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METHODS OF DIAGNOSIS OF ANGIO GENESIS, COMPOSITIONS 
AND METHODS OF SCREENING FOR ANGIOGENESIS 
5 MODULATORS 



CROSS-REFERENCES TO RELATED APPLICATIONS 
This application claims priority to USSN 09/784,356, filed February 14 2001; 
10 USSN 09/791,390, filed February 22, 2001; USSN 60/285,475, filed April 19, 2001, USSN 
60/310,025, filed August 3, 2001, and USSN 60/334,244, filed November 29, 2001, each of 
which is herein incorporated by reference in its entirety. 



FIELD OF THE INVENTION 
15 The invention relates to the identification of nucleic acid and protein 

expression profiles and nucleic acids, products, and antibodies thereto that are involved in 
angiogenesis; and to the use of such expression profiles and compositions in diagnosis and 
therapy of angiogenesis. The invention further relates to methods for identifying and using 
agents and/or targets that modulate angiogenesis. 

20 

BACKGROUND OF THE INVENTION 
Both vasculogenesis, the development of an interactive vascular system 
comprising arteries and veins, and angiogenesis, the generation of new blood vessels, play a 
role in embryonic development. In contrast, angiogenesis is limited in a normal adult to the 

25 placenta, ovary, endometrium and sites of wound healing. However, angiogenesis, or its 

absence, plays an important role in the maintenance of a variety of pathological states. Some 
of these states are characterized by neovascularization, e.g., cancer, diabetic retinopathy, 
glaucoma, and age related macular degeneration. Others, e.g., stroke, infertility, heart 
disease, ulcers, and scleroderma, are diseases of angiogenic insufficiency. 

30 Angiogenesis has a number of stages (see, e.g., Folkman, J.Natl Cancer Inst. 

82:4-6, 1990; Firestein, J Clin Invest 103:3-4, 1999; Koch, Arthritis RheumA\:95 1-62, 1998; 
Carter, Oncologist 5(Suppl l):51-4, 2000; Browder etal., Cancer Res. 60:1878-86, 2000; and 
Zhu and Witte, Invest New Dings 17:195-212, 1999). The early stages of angiogenesis 
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include endothelial cell protease production, migration of cells, and proliferation. The early 
stages also appear to require some growth factors, with VEGF, TGF-cc, angiostatin, and 
selected chemokines all putatively playing a role. Later stages of angiogenesis include 
population of the vessels with mural cells (pericytes or smooth muscle cells), basement 
5 membrane production, and the induction of vessel bed specializations. The final stages of 
vessel formation include what is known as "remodeling", wherein a forming vasculature 
becomes a stable, mature vessel bed. Thus, the process is highly dynamic, often requiring 
coordinated spatial and temporal waves of gene expression. 

Conversely, the complex process may be subject to disruption by interfering 

1 0 with one or more critical steps. Thus, the lack of understanding of the dynamics of 

angiogenesis prevents therapeutic intervention in serious diseases such as those indicated. It 
is an object of the invention to provide methods that can be used to screen compounds for the 
ability to modulate angiogenesis. Additionally, it is an object to provide molecular targets for 
therapeutic intervention in disease states which either have an undesirable excess or a deficit 

15 in angiogenesis. The present invention provides solutions to both. 



SUMMARY OF THE INVENTION 
The present invention provides compositions and methods for detecting or 
modulating angiogenesis associated sequences. 
20 In one aspect, the invention provides a method of detecting an angiogenesis- 

associated transcript in a cell in a patient, the method comprising contacting a biological 
sample from the patient with a polynucleotide that selectively hybridized to a sequence at 
least 80% identical to a sequence as shown in Tables 1-8. In one embodiment, the biological 
sample is a tissue sample. In another embodiment, the biological sample comprises isolated 
25 nucleic acids, which are often mRNA. 

In another embodiment, the method further comprises the step of amplifying 
nucleic acids before the step of contacting the biological sample with the polynucleotide. 
Often, the polynucleotide comprises a sequence as shown in Tables 1-8. The polynucleotide 
can be labeled, for example, with a fluorescent label and can be immobilized on a solid 
30 surface. 

In other embodiments the patient is undergoing a therapeutic regimen to treat a 
disease associated with angiogenesis or the patient is suspected of having an angiogenesis- 
associated disorder. 
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In another aspect, the invention comprises an isolated nucleic acid molecule 
consisting of a polynucleotide sequence as shown in Tables 1-8. The nucleic acid molecule 
can be labeled, for example, with a fluorescent label, 

In other aspects, the invention provides an expression vector comprising an 
5 isolated nucleic acid molecule consisting of a polynucleotide sequence as shown in Tables 1- 
8 or a host cell comprising the expression vector. 

In another embodiment, the isolated nucleic acid molecule encodes a 
polypeptide having an amino acid sequence as shown in Table 8. 

In another aspect, the invention provides an isolated polypeptide which is 
10 encoded by a nucleic acid molecule having polynucleotide sequence as shown in Tables 1*8. 
In one embodiment, the isolated polypeptide has an amino acid sequence as shown in Table 
8. 

In another embodiment, the invention provides an antibody that specifically 

binds a polypeptide that has an amino acid sequence as shown in Table 8 or which is encoded 
15 by a nucleotide sequence of Tables 1 -8 . The antibody can be conjugated or fused to an 

effector component such as a fluorescent label, a toxin, or a radioisotope. In some 

embodiments, the antibody is an antibody fragment or a humanized antibody. 

In another aspect, the invention provides a method of detecting a cell 

undergoing angiogenesis in a biological sample from a patient, the method comprising 
20 contacting the biological sample with an antibody that specifically binds to a polypeptide that 

has an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide 

sequence of Tables 1-8 . In some embodiments, the antibody is further conjugated or fused to 

an effector component, for example, a fluorescent label. 

In another embodiment, the invention provides a method of detecting 
25 antibodies specific to angiogenesis in a patient, the method comprising contacting a 

biological sample from the patient with a polypeptide which is encoded by a nucleotide 

sequence of Tables 1-8. 

The invention also provides a method of identifying a compound that 

modulates the activity of an angiogenesis-associated polypeptide, the method comprising the 
30 steps of: (i) contacting the compound with a polypeptide that comprises at least 80% identity 

to an amino acid sequence as shown in Table 8 or which is encoded by a nucleotide sequence 

of Tables 1-8; and (ii) detecting an increase or a decrease in the activity of the polypeptide. 

In one embodiment, the polypeptide has an amino acid sequence as shown in Table 8 or is a 
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polypeptide encoded by a nucleotide sequence of Tables 1-8. In another embodiment, the 
polypeptide is expressed in a cell 

The invention also provides a method of identifying a compound that 
modulates angiogenesis, the method comprising steps of: (i) contacting the compound with a 
cell undergoing angiogenesis; and (ii) detecting an increase or a decrease in the expression of 
a polypeptide sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8. In one embodiment, the detecting step comprises 
hybridizing a nucleic acid sample from the cell with a polynucleotide that selectively 
hybridizes to a sequence at least 80% identical to a sequence as shown in Tables 1-8. In 
another embodiment, the method further comprises detecting an increase or decrease in the 
expression of a second sequence as shown in Table 8 or a polypeptide which is encoded by a 
nucleotide sequence of Tables 1-8 . 

In another embodiment, the invention provides a method of inhibiting 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or which is 80% identical to a polypeptide encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amount of an inhibitor of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8 . In another embodiment, the inhibitor is an 
antibody. 

In other embodiments, the invention provides a method of activating 
angiogenesis in a cell that expresses a polypeptide at least 80% identical to a sequence as 
shown in Table 8 or at least 80% identical to a polypeptide which is encoded by a nucleotide 
sequence of Tables 1-8 , the method comprising the step of contacting the cell with a 
therapeutically effective amount of an activator of the polypeptide. In one embodiment, the 
polypeptide has an amino acid sequence shown in Table 8 or is a polypeptide which is 
encoded by a nucleotide sequence of Tables 1-8. 

Other aspects of the invention will become apparent to the skilled artisan by 
the following description of the invention. 

Tables 1-8 provide nucleotide sequence of genes that exhibit changes in 
expression levels as a function of time in tissue undergoing angiogenesis compared to tissue 
that is not. 
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DESCRIPTION OF THE SPECIFIC EMBODIMENTS 
In accordance with the objects outlined above, the present invention provides 
novel methods for diagnosis and treatment of disorders associated with angiogenesis 
(sometimes referred to herein as angiogenesis disorders or AD), as well as methods for 
5 screening for compositions which modulate angiogenesis. By "disorder associated with 

angiogenesis" or "disease associated with angiogenesis" herein is meant a disease state which 
is marked by either an excess or a deficit of blood vessel development. Angiogenesis 
disorders asociated with increased angiogenesis include, but are not limited to, cancer and 
proliferative diabetic retinopathy. Pathological states for which it may be desirable to 

10 increase angiogenesis include stroke, heart disease, infertility, ulcers, wound healing, 

ischemia, and scleradoma. Solid tumors typically require angiogenesis to support or sustain 
growth, e.g., breast, colon, lung, brain, bladder, and prostate tumors. Other AD include, e.g., 
arthritis, inflammatory bowel disease, diabetis retinopathy, macular degeneration, 
atherosclerosis, and psoriasis. Also provided are methods for treating AD. 

15 Definitions 

The term "angiogenesis protein" or "angiogenesis polynucleotide" refers to 
nucleic acid and polypeptide polymorphic variants, alleles, mutants, and interspecies 
homologs that: (1) have an amino acid sequence that has greater than about 60% amino acid 
sequence identity, 65%, 70%, 75%, 80%, 85%, 90%, preferably 91%, 92%, 93%, 94%, 95%, 

20 96%, 97%, 98% or 99% or greater amino acid sequence identity, preferably over a region of 
over a region of at least about 25, 50, 100, 200, 500, 1000, or more amino acids, to an 
angiogenesis protein sequence of Table 8; (2) bind to antibodies, e.g., polyclonal antibodies, 
raised against an immunogen comprising an amino acid sequence of Table 8, and 
conservatively modified variants thereof; (3) specifically hybridize under stringent 

25 hybridization conditions to an anti-sense strand corresponding to a nucleic acid sequence of 
Tables 1-8 and conservatively modified variants thereof; (4) have a nucleic acid sequence 
that has greater than about 95%, preferably greater than about 96%, 97%, 98%, 99%, or 
higher nucleotide sequence identity, preferably over a region of at least about 25, 50, 100, 
200, 500, 1000, or more nucleotides, to a sense sequence corresponding to one set out in 

30 Tables 1-8 . A polynucleotide or polypeptide sequence is typically from a mammal 

including, but not limited to, primate, e.g., human; rodent, e.g., rat, mouse, hamster; cow, pig, 
horse, sheep, or any mammal. An "angiogenesis polypeptide" and an "angiogenesis 
polynucleotide," include both naturally occurring or recombinant. 
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A "full length" angiogenesis protein or nucleic acid refers to an agiogenesis 
polypeptide or polynucleotide sequence, or a variant thereof, that contains all of the elements 
normally contained in one or more naturally occurring, wild type angiogenesis polynucleotide 
or polypeptide sequences. The "full length" may be prior to, or after, various stages of post- 
5 translation processing. 

"Biological sample" as used herein is a sample of biological tissue or fluid that 
contains nucleic acids or polypeptides, e.g., of an angiogenic protein. Such samples include, 
but are not limited to, tissue isolated from primates, e.g., humans, or rodents, e.g., mice, and 
rats. Biological samples may also include sections of tissues such as biopsy and autopsy 
10 samples, and frozen sections taken for histologic purposes. A biological sample is typically 
obtained from a eukaryotic organism, most preferably a mammal such as a primate e.g., 
chimpanzee or human; cow; dog; cat; a rodent, e.g., guinea pig, rat, mouse; rabbit; or a bird; 
reptile; or fish. 

"Providing a biological sample" means to obtain a biological sample for use in 
15 methods described in this invention. Most often, this will be done by removing a sample of 
cells from an animal, but can also be accomplished by using previously isolated cells (e.g., 
isolated by another person, at another time, and/or for another purpose), or by performing the 
methods of the invention in vivo. Archival tissues, having treatment or outcome histroy, will 
be particularly useful. 

20 The terms "identical" or percent "identity," in the context of two or more 

nucleic acids or polypeptide sequences, refer to two or more sequences or subsequences that 
are the same or have a specified percentage of amino acid residues or nucleotides that are the 
same (i.e., about 70% identity, preferably 75%, 80%, 85%, 90%, 91%, 92%, 93%, 94%, 95%, 
96%, 97%, 98%, 99%, or higher identity over a specified region (e.g., SEQ ID NOS: 1-229), 

25 when compared and aligned for maximum correspondence over a comparison window or 
designated region) as measured using a BLAST or BLAST 2.0 sequence comparison 
algorithms with default parameters described below, or by manual alignment and visual 
inspection (see, e.g., NCBI web site http://www.ncbi.nlm.nih.gov/BLAST/ or the like). Such 
sequences are then said to be "substantially identical." This definition also refers to, or may 

30 be applied to, the compliment of a test sequence. The definition also includes sequences that 
have deletions and/or additions, as well as those that have substitutions. As described below, 
the preferred algorithms can account for gaps and the like. Preferably, identity exists over a 
region that is at least about 25 amino acids or nucleotides in length, or more preferably over a 
region that is 50-100 amino acids or nucleotides in length. 

6 
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For sequence comparison, typically one sequence acts as a reference sequence, 
to which test sequences are compared. When using a sequence comparison algorithm, test 
and reference sequences are entered into a computer, subsequence coordinates are designated, 
if necessary, and sequence algorithm program parameters are designated. Preferably, default 
5 program parameters can be used, or alternative parameters can be designated. The sequence 
comparison algorithm then calculates the percent sequence identities for the test sequences 
relative to the reference sequence, based on the program parameters. 

A "comparison window", as used herein, includes reference to a segment of 
any one of the number of contiguous positions selected from the group consisting of from 20 

10 to 600, usually about 50 to about 200, more usually about 100 to about 150 in which a 
sequence may be compared to a reference sequence of the same number of contiguous 
positions after the two sequences are optimally aligned. Methods of alignment of sequences 
for comparison are well-known in the art. Optimal alignment of sequences for comparison 
can be conducted, e.g., by the local homology algorithm of Smith & Waterman, Adv. Appl. 

15 Math. 2:482 (1981), by the homology alignment algorithm of Needleman & Wunsch, J. Mol. 
Biol. 48:443 (1970), by the search for similarity method of Pearson & Lipman, Proc. Nat 'I. 
Acad. Sci. USA 85:2444 (1988), by computerized implementations of these algorithms 
(GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, 
Genetics Computer Group, 575 Science Dr., Madison, WI), or by manual alignment and 

20 visual inspection (see, e.g., Current Protocols in Molecular Biology (Ausubel et al. 9 eds. 
1995 supplement)). 

A preferred example of algorithm that is suitable for determining percent 
sequence identity and sequence similarity are the BLAST and BLAST 2.0 algorithms, which 
are described in Altschul et al. t Nuc. Acids Res. 25:3389-3402 (1977) and Altschul et al. f J. 

25 Mol. Biol. 215:403-410 (1990), respectively. BLAST and BLAST 2.0 are used, with the 

parameters described herein, to determine percent sequence identity for the nucleic acids and 
proteins of the invention. Software for performing BLAST analyses is publicly available 
through the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). 
This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying 

30 short words of length W in the query sequence, which either match or satisfy some positive- 
valued threshold score T when aligned with a word of the same length in a database 
sequence. T is referred to as the neighborhood word score threshold (Altschul et aL, supra). 
These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs 
containing them. The word hits are extended in both directions along each sequence for as 



7 



WO 02/079492 PCT/US02/04915 

far as the cumulative alignment score can be increased. Cumulative scores are calculated 
using, for nucleotide sequences, the parameters M (reward score for a pair of matching 
residues; always > 0) and N (penalty score for mismatching residues; always < 0). For amino 
acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the 
5 word hits in each direction are halted when: the cumulative alignment score falls off by the 
quantity X from its maximum achieved value; the cumulative score goes to zero or below, 
due to the accumulation of one or more negative-scoring residue alignments; or the end of 
either sequence is reached. The BLAST algorithm parameters W, T, and X determine the 
sensitivity and speed of the alignment. The BLASTN program (for nucleotide sequences) 

10 uses as defaults a wordlength (W) of 1 1 , an expectation (E) of 10, M=5, N— 4 and a 
comparison of both strands. For amino acid sequences, the BLASTP program uses as 
defaults a wordlength of 3, and expectation (E) of 10, and the BLQSUM62 scoring matrix 
(see Henikoff & Henikoff, Proc. Natl. Acad. Sci USA 89:10915 (1989)) alignments (B) of 
50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands. 

15 The BLAST algorithm also performs a statistical analysis of the similarity 

between two sequences (see, e.g., Karlin & Altschul, Proc. Natl. Acad. Sci. USA 90:5873- 
5787 (1993)). One measure of similarity provided by the BLAST algorithm is the smallest 
sum probability (P(N)), which provides an indication of the probability by which a match 
between two nucleotide or amino acid sequences would occur by chance. For example, a 

20 nucleic acid is considered similar to a reference sequence if the smallest sum probability in a 
comparison of the test nucleic acid to the reference nucleic acid is less than about 0.2, more 
preferably less than about 0.01, and most preferably less than about 0.001. 

An indication that two nucleic acid sequences or polypeptides are substantially 
identical is that the polypeptide encoded by the first nucleic acid is immunologically cross 

25 reactive with the antibodies raised against the polypeptide encoded by the second nucleic 
acid, as described below. Thus, a polypeptide is typically substantially identical to a second 
polypeptide, for example, where the two peptides differ only by conservative substitutions. 
Another indication that two nucleic acid sequences are substantially identical is that the two 
molecules or their complements hybridize to each other under stringent conditions, as 

30 described below. Yet another indication that two nucleic acid sequences are substantially 
identical is that the same primers can be used to amplify the sequences. 

A "host cell" is a naturally occurring cell or a transformed cell that contains an 
expression vector and supports the replication or expression of the expression vector. Host 
cells may be cultured cells, explants, cells in vivo, and the like. Host cells may be 

8 
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prokaryotic cells such as E. coli, or eukaryotic cells such as yeast, insect, amphibian, or 
mammalian cells such as CHO, HeLa, and the like (see, e.g., the American Type Culture 
Collection catalog or web site, www.atcc.org). 

The terms "polypeptide," "peptide" and "protein" are used interchangeably 
5 herein to refer to a polymer of amino acid residues. The terms apply to amino acid polymers 
in which one or more amino acid residue is an artificial chemical mimetic of a corresponding 
naturally occurring amino acid, as well as to naturally occurring amino acid polymers and 
non-naturally occurring amino acid polymer. 

The term "amino acid" refers to naturally occurring and synthetic amino acids, 

10 as well as amino acid analogs and amino acid mimetics that function in a manner similar to 
the naturally occurring amino acids. Naturally occurring amino acids are those encoded by 
the genetic code, as well as those amino acids that are later modified, e.g., hydroxyproline, y- 
carboxyglutamate, and O-phosphoserine. Amino acid analogs refers to compounds that have 
the same basic chemical structure as a naturally occurring amino acid, i.e., an a carbon that is 

15 bound to a hydrogen, a carboxyl group, an amino group, and an R group, e.g., homoserine, 
norleucine, methionine sulfoxide, methionine methyl sulfonium. Such analogs have modified 
R groups (e.g., norleucine) or modified peptide backbones, but retain the same basic chemical 
structure as a naturally occurring amino acid. Amino acid mimetics refers to chemical 
compounds that have a structure that is different from the general chemical structure of an 

20 amino acid, but that functions in a manner similar to a naturally occurring amino acid. 

Amino acids may be referred to herein by either their commonly known three 
letter symbols or by the one-letter symbols recommended by the IUPAC-IUB Biochemical 
Nomenclature Commission. Nucleotides, likewise, may be referred to by their commonly 
accepted single-letter codes. 

25 "Conservatively modified variants" applies to both amino acid and nucleic 

acid sequences. With respect to particular nucleic acid sequences, conservatively modified 
variants refers to those nucleic acids which encode identical or essentially identical amino 
acid sequences, or where the nucleic acid does not encode an amino acid sequence, to 
essentially identical sequences. Because of the degeneracy of the genetic code, a large 

30 number of functionally identical nucleic acids encode any given protein. For instance, the 
codons GCA, GCC, GCG and GCU all encode the amino acid alanine. Thus, at every 
position where an alanine is specified by a codon, the codon can be altered to any of the 
corresponding codons described without altering the encoded polypeptide. Such nucleic acid 
variations are "silent variations," which are one species of conservatively modified 
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variations. Every nucleic acid sequence herein which encodes a polypeptide also describes 
every possible silent variation of the nucleic acid. One of skill will recognize that each codon 
in a nucleic acid (except AUG, which is ordinarily the only codon for methionine, and TGG, 
which is ordinarily the only codon for tryptophan) can be modified to yield a functionally 
5 identical molecule. Accordingly, each silent variation of a nucleic acid which encodes a 
polypeptide is implicit in each described sequence with respect to the expression product, but 
not with respect to actual probe sequences. 

As to amino acid sequences, one of skill will recognize that individual 
substitutions, deletions or additions to a nucleic acid, peptide, polypeptide, or protein 

10 sequence which alters, adds or deletes a single amino acid or a small percentage of amino 
acids in the encoded sequence is a "conservatively modified variant" where the alteration 
results in the substitution of an amino acid with a chemically similar amino acid. 
Conservative substitution tables providing functionally similar amino acids are well known in 
the art. Such conservatively modified variants are in addition to and do not exclude 

15 polymorphic variants, interspecies homologs, and alleles of the invention. 

The following eight groups each contain amino acids that are conservative 
substitutions for one another: 1) Alanine (A), Glycine (G); 2) Aspartic acid (D), Glutamic acid 
(E); 3) Asparagine (N), Glutamine (Q); 4) Arginine (R), Lysine (K); 5) Isoleucine (I), 
Leucine (L), Methionine (M), Valine (V); 6) Phenylalanine (F), Tyrosine (Y), Tryptophan 

20 (W); 7) Serine (S), Threonine (T); and 8) Cysteine (C), Methionine (M) (see, e.g., Creighton, 
Proteins (1984)). 

Macromolecular structures such as polypeptide structures can be described in 
terms of various levels of organization. For a general discussion of this organization, see, 
e.g., Alberts et al., Molecular Biology of the Cell (3 rd ed., 1994) and Cantor and Schimmel, 

25 Biophysical Chemistry Part I: The Conformation of Biological Macromolecules (1980). 
"Primary structure" refers to the amino acid sequence of a particular peptide. "Secondary 
structure" refers to locally ordered, three dimensional structures within a polypeptide. These 
structures are commonly known as domains. Domains are portions of a polypeptide that 
form a compact unit of the polypeptide and are typically 25 to approximately 500 amino 

30 acids long. Typical domains are made up of sections of lesser organization such as stretches 
of (3-sheet and a-helices. "Tertiary structure" refers to the complete three dimensional 
structure of a polypeptide monomer. "Quaternary structure" refers to the three dimensional 
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structure formed, usually by the noncovalent association of independent tertiary units. 
Anisotropic terms are also known as energy terms. 

A "label" or a "detectable moiety" is a composition detectable by 
spectroscopic, photochemical, biochemical, immunochemical, chemical, or other physical 
5 means. For example, useful labels include 32 P, fluorescent dyes, electron-dense reagents, 
enzymes (e.g., as commonly used in an ELISA), biotin, digoxigenin, or haptens and proteins 
which can be made detectable, e.g., by incorporating a radiolabel into the peptide or used to 
detect antibodies specifically reactive with the peptide. 

An "effector" or "effector moiety" or "effector component" is a molecule that 

10 is bound (or linked, or conjugated), either covalently, through a linker or a chemical bond, or 
noncovalently, through ionic, van der Waals, electrostatic, or hydrogen bonds, to an antibody. 
The "effector" can be a variety of molecules including, for example, detection moieties 
including radioactive compounds, fluroescent compounds, an enzyme or substrate, tags such 
as epitope tags, a toxin; a chemotherapeutic agent; a lipase; an antibiotic; or a radioisotope 

1 5 emitting "hard" e.g. , beta radiation. 

A "labeled nucleic acid probe or oligonucleotide" is one that is bound, either 
covalently, through a linker or a chemical bond, or noncovalently, through ionic, van der 
Waals, electrostatic, or hydrogen bonds to a label such that the presence of the probe may be 
detected by detecting the presence of the label bound to the probe. Alternatively, method 

20 using high affinity interactions may achieve the same results where one of a pair of binding 
partners binds to the other, e.g. , biotin, streptavidin. 

As used herein a "nucleic acid probe or oligonucleotide" is defined as a 
nucleic acid capable of binding to a target nucleic acid of complementary sequence through 
one or more types of chemical bonds, usually through complementary base pairing, usually 

25 through hydrogen bond formation. As used herein, a probe may include. natural (i.e., A, G, C, 
or T) or modified bases (7-deazaguanosine, inosine, etc.). In addition, the bases in a probe 
may be joined by a linkage other than a phosphodiester bond, so long as it does not interfere 
with hybridization. Thus, for example, probes may be peptide nucleic acids in which the 
constituent bases are joined by peptide bonds rather than phosphodiester linkages. It will be 

30 understood by one of skill in the art that probes may bind target sequences lacking complete 
complementarity with the probe sequence depending upon the stringency of the hybridization 
conditions. The probes are preferably directly labeled as with isotopes, chromophores, 
lumiphores, chromogens, or indirectly labeled such as with biotin to which a streptavidin 
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complex may later bind. By assaying for the presence or absence of the probe, one can detect 
the presence or absence of the select sequence or subsequence. 

The term "recombinant" when used with reference, e.g., to a cell, or nucleic 
acid, protein, or vector, indicates that the cell, nucleic acid, protein or vector, has been 
5 modified by the introduction of a heterologous nucleic acid or protein or the alteration of a 
native nucleic acid or protein, or that the cell is derived from a cell so modified. Thus, for 
example, recombinant cells express genes that are not found within the native (non- 
recombinant) form of the cell or express native genes that are otherwise abnormally 
expressed, under expressed or not expressed at all. 

10 The term "heterologous" when used with reference to portions of a nucleic 

acid indicates that the nucleic acid comprises two or more subsequences that are not found in 
the same relationship to each other in nature. For instance, the nucleic acid is typically 
recombinantly produced, having two or more sequences from unrelated genes arranged to 
make a new functional nucleic acid, e.g., a promoter from one source and a coding region 

1 5 from another source. Similarly, a heterologous protein indicates that the protein comprises 
two or more subsequences that are not found in the same relationship to each other in nature 
(e.g., a fusion protein). 

A "promoter" is defined as an array of nucleic acid control sequences that 
direct transcription of a nucleic acid. As used herein, a promoter includes necessary nucleic 

20 acid sequences near the start site of transcription, such as, in the case of a polymerase II type 
promoter, a TATA element. A promoter also optionally includes distal enhancer or repressor 
elements, which can be located as much as several thousand base pairs from the start site of 
transcription. A "constitutive" promoter is a promoter that is active under most 
environmental and developmental conditions. An "inducible" promoter is a promoter that is 

25 active under environmental or developmental regulation. The term "operably linked" refers 
to a functional linkage between a nucleic acid expression control sequence (such as a 
promoter, or array of transcription factor binding sites) and a second nucleic acid sequence, 
wherein the expression control sequence directs transcription of the nucleic acid 
corresponding to the second sequence. 

30 An "expression vector" is a nucleic acid construct, generated recombinantly or 

synthetically, with a series of specified nucleic acid elements that permit transcription of a 
particular nucleic acid in a host cell. The expression vector can be part of a plasmid, virus, or 
nucleic acid fragment. Typically, the expression vector includes a nucleic acid to be 
transcribed operably linked to a promoter. 
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The phrase "selectively (or specifically) hybridizes to" refers to the binding, 
duplexing, or hybridizing of a molecule only to a particular nucleotide sequence under 
stringent hybridization conditions when that sequence is present in a complex mixture (e.g., 
total cellular or library DNA or RNA). 
5 The phrase "stringent hybridization conditions" refers to conditions under 

which a probe will hybridize to its target subsequence, typically in a complex mixture of 
nucleic acids, but to no other sequences. Stringent conditions are sequence-dependent and 
will be different in different circumstances. Longer sequences hybridize specifically at 
higher temperatures. An extensive guide to the hybridization of nucleic acids is found in 

10 Tijssen, Techniques in Biochemistry and Molecular Biology— Hybridization with Nucleic 
Probes, "Overview of principles of hybridization and the strategy of nucleic acid assays" 
(1993). Generally, stringent conditions are selected to be about 5-1 0°C lower than the 
thermal melting point (T m ) for the specific sequence at a defined ionic strength pH. The T m is 
the temperature (under defined ionic strength, pH, and nucleic concentration) at which 50% 

15 of the probes complementary to the target hybridize to the target sequence at equilibrium (as 
the target sequences are present in excess, at T m , 50% of the probes are occupied at 
equilibrium). Stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion concentration (or other 
salts) at pH 7.0 to 8.3 and the temperature is at least about 30°C for short probes (e.g., 10 to 

20 50 nucleotides) and at least about 60°C for long probes (e.g., greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. For selective or specific hybridization, a positive signal is at least two times 
background, preferably 10 times background hybridization. Exemplary stringent 
hybridization conditions can be as following: 50% formamide, 5x SSC, and 1% SDS, 

25 incubating at 42°C, or, 5x SSC, 1% SDS, incubating at 65°C, with wash in 0.2x SSC, and 
0.1% SDS at 65°C. For PCR, a temperature of about 36°C is typical for low stringency 
amplification, although annealing temperatures may vary between about 32°C and 48°C 
depending on primer length. For high stringency PCR amplification, a temperature of about 
62°C is typical, although high stringency annealing temperatures can range from about 50°C 

30 to about 65°C, depending on the primer length and specificity. Typical cycle conditions for 
both high and low stringency amplifications include a denaturation phase of 90°C - 95°C for 
30 sec - 2 min., an annealing phase lasting 30 sec. - 2 min., and an extension phase of about 
72°C for 1 - 2 min. Protocols and guidelines for low and high stringency amplification 
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reactions are provided, e.g., in Innis et ah (1990) PCR Protocols, A Guide to Methods and 
Applications, Academic Press, Inc. N.Y.). 

Nucleic acids that do not hybridize to each other under stringent conditions are 
still substantially identical if the polypeptides which they encode are substantially identical. 
5 This occurs, for example, when a copy of a nucleic acid is created using the maximum codon 
degeneracy permitted by the genetic code. In such cases, the nucleic acids typically hybridize 
under moderately stringent hybridization conditions. Exemplary "moderately stringent 
hybridization conditions" include a hybridization in a buffer of 40% formamide, 1 M NaCl, 
1% SDS at 37°C, and a wash in IX SSC at 45°C. A positive hybridization is at least twice 

10 background. Those of ordinary skill will readily recognize that alternative hybridization and 
wash conditions can be utilized to provide conditions of similar stringency. Additional 
guidelines for determining hybridization parameters are provided in numerous reference, e.g. , 
and Current Protocols in Molecular Biology, ed. Ausubel, et al 

The phrase "functional effects" in the context of assays for testing compounds 

1 5 that modulate activity of an angiogenesis protein includes the determination of a parameter 
that is indirectly or directly under the influence of the angiogenesis protein, e.g., a functional, 
physical, or chemical effect, such as the ability to increase or decrease angiogenesis. It 
includes binding activity, the ability of cells to proliferate, expression in cells undergoing 
angiogenesis, and other characteristics of angiogenic cells. "Functional effects 5 ' include in 

20 vitro, in vivo, and ex vivo activities. 

By "determining the functional effect" is meant assaying for a compound that 
increases or decreases a parameter that is indirectly or directly under the influence of an 
angiogenesis protein sequence, e.g., functional, physical and chemical effects. Such 
functional effects can be measured by any means known to those skilled in the art, e.g., 

25 changes in spectroscopic characteristics (e.g., fluorescence, absorbance, refractive index), 
hydrodynamic (e.g., shape), chromatographic, or solubility properties for the protein, 
measuring inducible markers or transcriptional activation of the angiogenesis protein; 
measuring binding activity or binding assays, e.g. binding to antibodies, and measuring 
cellular proliferation, particularly endothelial cell proliferation, cell viability, cell division 

30 especially of endothelial cells, lumen formation and capillary or vessel growth or formation. 
Determination of the functional effect of a compound on angiogenesis can also be performed 
using angiogenesis assays known to those of skill in the art such as an in vitro assays, e.g., in 
vitro endothelial cell tube formation assays, and other assays such as the chick CAM assay, 
the mouse corneal assay, and assays that assess vascularization of an implanted tumor. The 
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functional effects can be evaluated by many means known to those skilled in the art, e.g., 
microscopy for quantitative or qualitative measures of alterations in morphological features, 
e.g. , tube or blood vessel formation, measurement of changes in RNA or protein levels for 
angiogenesis-associated sequences, measurement of RNA stability, identification of 
5 downstream or reporter gene expression (CAT, luciferase, |3-gal, GFP and the like), e.g., via 
chemiluminescence, fluorescence, colorimetric reactions, antibody binding, inducible 
markers, and ligand binding assays. 

"Inhibitors", "activators", and "modulators" of angiogenic polynucleotide and 
polypeptide sequences are used to refer to activating, inhibitory, or modulating molecules 

1 0 identified using in vitro and in vivo assays of angiogenic polynucleotide and polypeptide 
sequences. Inhibitors are compounds that, e.g., bind to, partially or totally block activity, 
decrease, prevent, delay activation, inactivate, desensitize, or down regulate the activity or 
expression of angiogenesis proteins, e.g., antagonists. "Activators" are compounds that 
increase, open, activate, facilitate, enhance activation, sensitize, agonize, or up regulate 

15 angiogenesis protein activity. Inhibitors, activators, or modulators also include genetically 
modified versions of angiogenesis proteins, e.g., versions with altered activity, as well as 
naturally occurring and synthetic ligands, antagonists, agonists, antibodies, small chemical 
molecules and the like. Such assays for inhibitors and activators include, e.g., expressing the 
angiogenic protein in vitro, in cells, or cell membranes, applying putative modulator 

20 compounds, and then determining the functional effects on activity, as described above. 
Activators and inhibitors of angiogenesis can also be identified by incubating angiogenic 
cells with the test compound and determining increases or decreases in the expression of 1 or 
more angiogenesis proteins, e.g., 1, 2, 3, 4, 5, 10, 15, 20, 25, 30, 40, 50 or more angiogenesis 
proteins, such as angiogenesis proteins comprising the sequences set out in Table 8. 

25 Samples or assays comprising angiogenesis proteins that are treated with a 

potential activator, inhibitor, or modulator are compared to control samples without the 
inhibitor, activator, or modulator to examine the extent of inhibition. Control samples 
(untreated with inhibitors) are assigned a relative protein activity value of 100%. Inhibition 
of a polypeptide is achieved when the activity value relative to the control is about 80%, 

30 preferably 50%, more preferably 25-0%. Activation of an angiogenesis polypeptide is 

achieved when the activity value relative to the control (untreated with activators) is 1 10%, 
more preferably 1 50%, more preferably 200-500% (i.e., two to five fold higher relative to the 
control), more preferably 1000-3000%) higher. 
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"Antibody" refers to a polypeptide comprising a framework region from an 
immunoglobulin gene or fragments thereof that specifically binds and recognizes an antigen. 
The recognized immunoglobulin genes include the kappa, lambda, alpha, gamma, delta, 
epsilon, and mu constant region genes,, as well as the myriad immunoglobulin variable region 
5 genes. Light chains are classified as either kappa or lambda. Heavy chains are classified as 
gamma, mu, alpha, delta, or epsilon, which in turn define the immunoglobulin classes, IgG, 
IgM, IgA, IgD and IgE, respectively. Typically, the antigen-binding region of an antibody 
will be most critical in specificity and affinity of binding. 

An exemplary immunoglobulin (antibody) structural unit comprises a 

10 . tetramer. Each tetramer is composed of two identical pairs of polypeptide chains, each pair 
having one "light" (about 25 kD) and one "heavy" chain (about 50-70 kD). The N-terminus 
of each chain defines a variable region of about 1 00 to 1 1 0 or more amino acids primarily 
responsible for antigen recognition. The terms variable light chain (Vl) and variable heavy 
chain (V H ) refer to these light and heavy chains respectively. 

15 Antibodies exist, e.g., as intact immunoglobulins or as a number of well- 

characterized fragments produced by digestion with various peptidases. Thus, for example, 
pepsin digests an antibody below the disulfide linkages in the hinge region to produce F(ab) J 2) 
a dimer of Fab which itself is a light chain joined to Vh-Ch1 by a disulfide bond. The F(ab)'9 
may be reduced under mild conditions to break the disulfide linkage in the hinge region, 

20 thereby converting the F(ab)'2 dimer into an Fab' monomer. The Fab' monomer is 

essentially Fab with part of the hinge region (see Fundamental Immunology (Paul ed., 3d ed. 
1993). While various antibody fragments are defined in terms of the digestion of an intact 
antibody, one of skill will appreciate that such fragments may be synthesized de novo either 
chemically or by using recombinant DNA methodology. Thus, the term antibody, as used 

25 herein, also includes antibody fragments either produced by the modification of whole 

antibodies, or those synthesized de novo using recombinant DNA methodologies (e.g., single 
chain Fv) or those identified using phage display libraries (see, e.g., McCafferty et ah, Nature 
348:552-554(1990)) 

For preparation of antibodies, e.g., recombinant, monoclonal, or polyclonal 

30 antibodies, many technique known in the art can be used (see, e.g., Kohler & Milstein, 

Nature 256:495-497 (1975); Kozbor et al, Immunology Today 4: 72 (1983); Cole et al, pp. 
77-96 in Monoclonal Antibodies and Cancer Tlterapy, Alan R. Liss, Inc. (1985); Coligan, 
Current Protocols in Immunology (1991); Harlow & Lane, Antibodies, A Laboratory Manual 
(1988); and Goding, Monoclonal Antibodies: Principles and Practice (2d ed. 1986)). 
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Techniques for the production of single chain antibodies (U.S. Patent 4,946,778) can be 
adapted to produce antibodies to polypeptides of this invention. Also, transgenic mice, or 
other organisms such as other mammals, may be used to express humanized antibodies. 
Alternatively, phage display technology can be used to identify antibodies and heteromeric 
Fab fragments that specifically bind to selected antigens (see, e.g., McCafferty et ah, Nature 
348:552-554 (1990); Marks et al, Biotechnology 10:779-783 (1992)). 

A "chimeric antibody" is an antibody molecule in which (a) the constant 
region, or a portion thereof, is altered, replaced or exchanged so that the antigen binding site 
(variable region) is linked to a constant region of a different or altered class, effector function 
and/or species, or an entirely different molecule which confers new properties to the chimeric 
antibody, e.g., an enzyme, toxin, hormone, growth factor, drug, etc.; or (b) the variable 
region, or a portion thereof, is altered, replaced or exchanged with a variable region having a 
different or altered antigen specificity. 

The detailed description of the invention includes discussion of the following 
aspects of the invention: Expression of angiogenesis-associated sequences 

Informatics 

Angiogenesis-associated sequences 

Detection of angiogenesis sequence for diagnostic and 
therapeutic applications 

Modulators of angiogenesis 

Methods of identifying variant angiogenesis-associated 
sequences 

Administration of pharmaceutical and vaccine compositions 
Kits for use in diagnostic and/or prognostic applications. 

Expression of angiogenesis-associated sequences 

In one aspect, the expression levels of genes are determined in different 
patient samples for which diagnosis information is desired, to provide expression profiles. 
An expression profile of a particular sample is essentially a "fingerprint" of the state of the 
sample; while two states may have any particular gene similarly expressed, the evaluation of 
a number of genes simultaneously allows the generation of a gene expression profile that is 
unique to the state of the cell. That is, normal tissue may be distinguished from AD tissue. 
By comparing expression profiles of tissue in known different angiogenesis states, 
information regarding which genes are important (including both up- and down-regulation of 
genes) in each of these states is obtained. The identification of sequences that are 
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differentially expressed in angiogenic versus non-angiogenic tissue allows the use of this 
information in a number of ways. For example, a particular treatment regime may be 
evaluated: does a chemotherapeutic drug act to down-regulate angiogenesis, and thus tumor 
growth or recurrence, in a particular patient. Similarly, diagnosis and treatment outcomes 
5 may be done or confirmed by comparing patient samples with the known expression profiles. 
Angiogenic tissue can also be analyzed to determine the stage of angiogenesis in the tissue. 
Furthermore, these gene expression profiles (or individual genes) allow screening of drug 
candidates with an eye to mimicking or altering a particular expression profile; for example, 
screening can be done for drugs that suppress the angiogenic expression profile. This may be 

10 done by making biochips comprising sets of the important angiogenesis genes, which can 
then be used in these screens. These methods can also be done on the protein basis; that is, 
protein expression levels of the angiogenic proteins can be evaluated for diagnostic purposes 
or to screen candidate agents. In addition, the angiogenic nucleic acid sequences can be 
administered for gene therapy purposes, including the administration of antisense nucleic 

1 5 acids, or the angiogenic proteins (including antibodies and other modulators thereof) 
administered as therapeutic drugs. 

Thus the present invention provides nucleic acid and protein sequences that 
are differentially expressed in angiogenesis, herein termed "angiogenesis sequences". As 
outlined below, angiogenesis sequences include those that are up-regulated (i.e. expressed at 

20 a higher level) in disorders associated with angiogenesis, as well as those that are down- 
regulated (i.e. expressed at a lower level). In a preferred embodiment, the angiogenesis 
sequences are from humans; however, as will be appreciated by those in the art, angiogenesis 
sequences from other organisms may be useful in animal models of disease and drug 
evaluation; thus, other angiogenesis sequences are provided, from vertebrates, including 

25 mammals, including rodents (rats, mice, hamsters, guinea pigs, etc.), primates, farm animals 
(including sheep, goats, pigs, cows, horses, etc). Angiogenesis sequences from other 
organisms may be obtained using the techniques outlined below. 

Angiogenesis sequences can include both nucleic acid and amino acid 
sequences. In a preferred embodiment, the angiogenesis sequences are recombinant nucleic 

30 acids. By the term "recombinant nucleic acid" herein is meant nucleic acid, originally formed 
in vitro, in general, by the manipulation of nucleic acid e.g., using polymerases and 
endonucleases, in a form not normally found in nature. Thus an isolated nucleic acid, in a 
linear form, or an expression vector formed in vitro by ligating DNA molecules that are not 
normally joined, are both considered recombinant for the purposes of this invention. It is 



18 



WO 02/079492 PCT/US02/04915 

understood that once a recombinant nucleic acid is made and reintroduced into a host cell or 
organism, it will replicate non-recombinantly, i.e. using the in vivo cellular machinery of the 
host cell rather than in vitro manipulations; however, such nucleic acids, once produced 
recombinantly, although subsequently replicated non-recombinantly, are still considered 
5 recombinant for the purposes of the invention. 

Similarly, a "recombinant protein" is a protein made using recombinant 
techniques, /. e. through the expression of a recombinant nucleic acid as depicted above. A 
recombinant protein is distinguished from naturally occurring protein by at least one or more 
characteristics. For example, the protein may be isolated or purified away from some or all 

10 of the proteins and compounds with which it is normally associated in its wild type host, and 
thus may be substantially pure. For example, an isolated protein is unaccompanied by at least 
some of the material with which it is normally associated in its natural state, preferably 
constituting at least about 0.5%, more preferably at least about 5% by weight of the total 
protein in a given sample. A substantially pure protein comprises at least about 75% by 

15 weight of the total protein, with at least about 80% being preferred, and at least about 90% 
being particularly preferred. The definition includes the production of an angiogenesis protein 
from one organism in a different organism or host cell. Alternatively, the protein may be 
made at a significantly higher concentration than is normally seen, through the use of an 
inducible promoter or high expression promoter, such that the protein is made at increased 

20 concentration levels. Alternatively, the protein may be in a form not normally found in 
nature, as in the addition of an epitope tag or amino acid substitutions, insertions and 
deletions, as discussed below. 

In a preferred embodiment, the angiogenesis sequences are nucleic acids. As 
will be appreciated by those in the art and is more fully outlined below, angiogenesis 

25 sequences are useful in a variety of applications, including diagnostic applications, which will 

r 

detect naturally occurring nucleic acids, as well as screening applications; for example, 
biochips comprising nucleic acid probes to the angiogenesis sequences can be generated. In 
the broadest sense, then, by "nucleic acid" or "oligonucleotide" or grammatical equivalents 
herein means at least two nucleotides covalently linked together. A nucleic acid of the 
30 present invention will generally contain phosphodiester bonds, although in some cases, 
nucleic acid analogs are included that may have alternate backbones, comprising, for 
example, phosphoramidate, phosphorothioate, phosphorodithioate, or O- 
methylphophoroamidite linkages (see Eckstein, Oligonucleotides and Analogues: A Practical 
Approach, Oxford University Press); and peptide nucleic acid backbones and linkages. Other 
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analog nucleic acids include those with positive backbones; non-ionic backbones, and non- 
ribose backbones, including those described in U.S. Patent Nos. 5,235,033 and 5,034,506, 
and Chapters 6 and 7, ASC Symposium Series 580, "Carbohydrate Modifications in 
Antisense Research", Ed. Y.S. Sanghui and P. Dan Cook. Nucleic acids containing one or 
5 more carbocyclic sugars are also included within one definition of nucleic acids. 

Modifications of the ribose-phosphate backbone may be done for a variety of reasons, for 
example to increase the stability and half-life of such molecules in physiological 
environments or as probes on a biochip. 

As will be appreciated by those in the art, nucleic acid analogs may find use in 

10 the present invention. In addition, mixtures of naturally occurring nucleic acids and analogs 
can be made; alternatively, mixtures of different nucleic acid analogs, and mixtures of 
naturally occurring nucleic acids and analogs may be made. 

Particularly preferred are peptide nucleic acids (PNA) which includes peptide 
nucleic acid analogs. These backbones are substantially non-ionic under neutral conditions, in 

1 5 contrast to the highly charged phosphodiester backbone of naturally occurring nucleic acids. 
This results in two advantages. First, the PNA backbone exhibits improved hybridization 
kinetics. PNAs have larger changes in the melting temperature (Tm) for mismatched versus 
perfectly matched basepairs. DNA and RNA typically exhibit a 2-4°C drop in T m for an 
internal mismatch. With the non-ionic PNA backbone, the drop is closer to 7-9°C. Similarly, 

20 due to their non-ionic nature, hybridization of the bases attached to these backbones is 

relatively insensitive to salt concentration. In addition, PNAs are not degraded by cellular 
enzymes, and thus can be more stable. 

The nucleic acids may be single stranded or double stranded, as specified, or 
contain portions of both double stranded or single stranded sequence. As will be appreciated 

25 by those in the art, the depiction of a single strand also defines the sequence of the 

complementary strand; thus the sequences described herein also provide the complement of 
the sequence. The nucleic acid may be DNA, both genomic and cDNA, RNA or a hybrid, 
where the nucleic acid may contain combinations of deoxyribo- and ribo-nucleotides, and 
combinations of bases, including uracil, adenine, thymine, cytosine, guanine, inosine, 

30 xanthine hypoxanthine, isocytosine, isoguanine, etc. As used herein, the term "nucleoside" 
includes nucleotides and nucleoside and nucleotide analogs, and modified nucleosides such 
as amino modified nucleosides. In addition, "nucleoside" includes non-naturally occurring 
analog structures. Thus for example the individual units of a peptide nucleic acid, each 
containing a base, are referred to herein as a nucleoside. 
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An angiogenesis sequence can be initially identified by substantial nucleic 
acid and/or amino acid sequence homology to the angiogenesis sequences outlined herein. 
Such homology can be based upon the overall nucleic acid or amino acid sequence, and is 
generally determined as outlined below, using either homology programs or hybridization 
5 conditions. 

For identifying angiogenesis-associated sequences, the angiogenesis screen 
typically includes comparing genes identified in a modification of an in vitro model of 
angiogenesis as described in Hiraoka, Cell 95:365 (1998) with genes identified in controls. 
Samples of normal tissue and tissue undergoing angiogenesis are applied to biochips 

10 comprising nucleic acid probes. The samples are first microdissected, if applicable, and 
treated as is known in the art for the preparation of mRNA. Suitable biochips are 
commercially available, for example from Affymetrix. Gene expression profiles as described 
herein are generated and the data analyzed. 

In a preferred embodiment, the genes showing changes in expression as 

15 between normal and disease states are compared to genes expressed in other normal tissues, 
including, but not limited to lung, heart, brain, liver, breast, kidney, muscle, prostate, small 
intestine, large intestine, spleen, bone and placenta. In a preferred embodiment, those genes 
identified during the angiogenesis screen that are expressed in any significant amount in other 
tissues are removed from the profile, although in some embodiments, this is not necessary. 

20 That is, when screening for drugs, it is usually preferable that the target be disease specific, to 
minimize possible side effects. 

In a preferred embodiment, angiogenesis sequences are those that are up- 
regulated in angiogenesis disorders; that is, the expression of these genes is higher in the 
disease tissue as compared to normal tissue. "Up-regulation" as used herein means at least 

25 about a two-fold change, preferably at least about a three fold change, with at least about 
five-fold or higher being preferred. All accession numbers herein are for the GenBank 
sequence database and the sequences of the accession numbers are hereby expressly 
incorporated by reference. GenBank is known in the art, see, e.g., Benson, DA, et al., 
Nucleic Acids Research 26:1-7 (1998) and http://www.ncbi.nlm.nili.gov/. Sequences are also 

30 avialable in other databases, e.g., European Molecular Biology Laboratory (EMBL) and 
DNA Database of Japan (DDBJ). In addition, most preferred genes were found to be 
expressed in a limited amount or not at all in heart, brain, lung, liver, breast, kidney, prostate, 
small intestine and spleen. 
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In another preferred embodiment, angiogenesis sequences are those that are 
down-regulated in the angiogenesis disorder; that is, the expression of these genes is lower in 
angiogenic tissue as compared to normal tissue. "Down-regulation" as used herein means at 
least about a two-fold change, preferably at least about a three fold change, with at least about 
5 five-fold or higher being preferred. 

Angiogenesis sequences according to the invention may be classified into 
discrete clusters of sequences based on common expression profiles of the sequences. 
Expression levels of angiogenesis sequences may increase or decrease as a function of time in 
a manner that correlates with the induction of angiogenesis. Alternatively, expression levels 

10 of angiogenesis sequences may both increase and decrease as a function of time. For 
example, expression levels of some angiogenesis sequences are temporarily induced or 
diminished during the switch to the angiogenesis phenotype, followed by a return to baseline 
expression levels. Tables 1-8 provides genes, the mRNA expression of which varies as a 
function of time in angiogenesis tissue when compared to normal tissue. 

15 In a particularly preferred embodiment, angiogenesis sequences are those that 

are induced for a period of time, typically by positive angiogenic factors, followed by a return 
to the baseline levels. Sequences that are temporarily induced provide a means to target 
angiogenesis tissue, for example neovascularized tumors, at a particular stage of 
angiogenesis, while avoiding rapidly growing tissue that require perpetual vascularization. 

20 Such positive angiogenic factors include ocFGF, (3FGF, VEGF, angiogenin and the like. 

Induced angiogenesis sequences also are further categorized with respect to 
the timing of induction. For example, some angiogenesis genes may be induced at an early 
time period, such as within 10 minutes of the induction of angiogenesis. Others may be 
induced later, such as between 5 and 60 minutes, while yet others may be induced for a time 

25 period of about two hours or more followed by a return to baseline expression levels. 

In another preferred embodiment are angiogenesis sequences that are inhibited 
or reduced as a function of time followed by a return to "normal" expression levels. 
Inhibitors of angiogenesis are examples of molecules that have this expression profile. These 
sequences also can be further divided into groups depending on the timing of diminished 

30 expression. For example, some molecules may display reduced expression within 10 minutes 
of the induction of angiogenesis. Others may be diminished later, such as between 5 and 60 
minutes, while others may be diminished for a time period of about two hours or more 
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followed by a return to baseline. Examples of such negative angiogenic factors include 
thrombospondin and endostatin to name a few. 

In yet another preferred embodiment are angiogenesis sequences that are 
induced for prolonged periods. These sequences are typically associated with induction of 
5 angiogenesis and may participate in induction and/or maintenance of the angiogenesis 

T 

phenotype. 

In another preferred embodiment are angiogenesis sequences, the expression 
of which is reduced or diminished for prolonged periods in angiogenic tissue. These 
sequences are typically angiogenesis inhibitors and their diminution is correlated with an 
10 increase in angiogenesis. 

Informatics 

The ability to identify genes that undergo changes in expression with time 
during angiogenesis can additionally provide high-resolution, high-sensitivity datasets which 

1 5 can be used in the areas of diagnostics, therapeutics, drug development, biosensor 

development, and other related areas. For example, the expression profiles can be used in 
diagnostic or prognostic evaluation of patients with angiogenesis-associated disease. Or as 
another example, subcellular toxicological information can be generated to better direct drug 
structure and activity correlation {see, Anderson, L., "Pharmaceutical Proteomics: Targets, 

20 Mechanism, and Function/' paper presented at the IBC Proteomics conference, Coronado, 
CA (June 1 1-12, 1998)). Subcellular toxicological information can also be utilized in a 
biological sensor device to predict the likely toxicological effect of chemical exposures and 
likely tolerable exposure thresholds {see, U.S. Patent No. 5,81 1,231). Similar advantages 
accrue from datasets relevant to other biomolecules and bioactive agents {e.g., nucleic acids, 

25 saccharides, lipids, drugs, and the like). 

Thus, in another embodiment, the present invention provides a database that 
includes at least one set of data assay data. The data contained in the database is acquired , 
e.g., using array analysis either singly or in a library format. The database can be in 
substantially any form in which data can be maintained and transmitted, but is preferably an 

30 electronic database. The electronic database of the invention can be maintained on any 
electronic device allowing for the storage of and access to the database, such as a personal 
computer, but is preferably distributed on a wide area network, such as the World Wide Web. 
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The focus of the present section on databases that include peptide sequence 
data is for clarity of illustration only. It will be apparent to those of skill in the art that similar 
databases can be assembled for any assay data acquired using an assay of the invention. 

The compositions and methods for identifying and/or quantitating the relative 
5 and/or absolute abundance of a variety of molecular and macromolecular species from a 
biological sample undergoing angiogenesis, i.e., the identification of angiogenesis-associated 
sequences described herein, provide an abundance of information, which can be correlated 
with pathological conditions, predisposition to disease, drug testing, therapeutic monitoring, 
gene-disease causal linkages, identification of correlates of immunity and physiological 

10 status, among others. Although the data generated from the assays of the invention is suited 
for manual review and analysis, in a preferred embodiment, prior data processing using high- 
speed computers is utilized. 

An array of methods for indexing and retrieving biomolecular information is 
known in the art. For example, U.S. Patents 6,023,659 and 5,966,712 disclose a relational 

15 database system for storing biomolecular sequence information in a manner that allows 
sequences to be catalogued and searched according to one or more protein function 
hierarchies. U.S. Patent 5,953,727 discloses a relational database having sequence records 
containing information in a format that allows a collection of partial-length DNA sequences 
to be catalogued and searched according to association with one or more sequencing projects 

20 for obtaining full-length sequences from the collection of partial length sequences. U.S. 
Patent 5,706,498 discloses a gene database retrieval system for making a retrieval of a gene 
sequence similar to a sequence data item in a gene database based on the degree of similarity 
between a key sequence and a target sequence. U.S. Patent 5,538,897 discloses a method 
using mass spectroscopy fragmentation patterns of peptides to identify amino acid sequences 

25 in computer databases by comparison of predicted mass spectra with experimentally-derived 
mass spectra using a closeness-of-fit measure. U.S. Patent 5,926,818 discloses a multi- 
dimensional database comprising a functionality for multi-dimensional data analysis 
described as on-line analytical processing (OLAP), which entails the consolidation of 
projected and actual data according to more than one consolidation path or dimension. U.S. 

30 Patent 5,295,261 reports a hybrid database structure in which the fields of each database 
record are divided into two classes, navigational and informational data, with navigational 
fields stored in a hierarchical topological map which can be viewed as a tree structure or as 
the merger of two or more such tree structures. 
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The present invention provides a computer database comprising a computer 
and software for storing in computer-retrievable form assay data records cross-tabulated, e.g. , 
with data specifying the source of the target-containing sample from which each sequence 
specificity record was obtained. 

In an exemplary embodiment, at least one of the sources of target-containing 
sample is from a control tissue sample known to be free of pathological disorders. In a 
variation, at least one of the sources is a known pathological tissue specimen, e.g., a 
neoplastic lesion or another tissue specimen to be analyzed for angiogenesis. In another 
variation, the assay records cross-tabulate one or more of the following parameters for each 
target species in a sample: (1) a unique identification code, which can include, e.g., a target 
molecular structure and/or characteristic separation coordinate (e.g., electrophoretic 
coordinates); (2) sample source; and (3) absolute and/or relative quantity of the target species 
present in the sample. 

The invention also provides for the storage and retrieval of a collection of 
target data in a computer data storage apparatus, which can include magnetic disks, optical 
disks, magneto-optical disks, DRAM, SRAM, SGRAM, SDRAM, RDRAM, DDR RAM, 
magnetic bubble memory devices, and other data storage devices, including CPU registers 
and on-CPU data storage arrays. Typically, the target data records are stored as a bit pattern 
in an array of magnetic domains on a magnetizable medium or as an array of charge states or 
transistor gate states, such as an array of cells in a DRAM device (e.g., each cell comprised of 
a transistor and a charge storage area, which may be on the transistor). In one embodiment, 
the invention provides such storage devices, and computer systems built therewith, 
comprising a bit pattern encoding a protein expression fingerprint record comprising unique 
identifiers for at least 10 target data records cross-tabulated with target source. 

When the target is a peptide or nucleic acid, the invention preferably provides 
a method for identifying related peptide or nucleic acid sequences, comprising performing a 
computerized comparison between a peptide or nucleic acid sequence assay record stored in 
or retrieved from a computer storage device or database and at least one other sequence. The 
comparison can include a sequence analysis or comparison algorithm or computer program 
embodiment thereof (e.g., FASTA, TFASTA, GAP, BESTFIT) and/or the comparison may 
be of the relative amount of a peptide or nucleic acid sequence in a pool of sequences 
determined from a polypeptide or nucleic acid sample of a specimen. 

The invention also preferably provides a magnetic disk, such as an IBM- 
compatible (DOS, Windows, Windows95/98/2000, Windows NT, OS/2) or other format 
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/ 

(e.g., Linux, SunOS, Solaris, AIX, SCO Unix, VMS, MV, Macintosh, etc.). floppy diskette or 
hard (fixed, Winchester) disk drive, comprising a bit pattern encoding data from an assay of 
the invention in a file format suitable for retrieval and processing in a computerized sequence 
analysis, comparison, or relative quantitation method. 
5 The invention also provides a network, comprising a plurality of computing 

devices linked via a data link, such as an Ethernet cable (coax or lOBaseT), telephone line, 
ISDN line, wireless network, optical fiber, or other suitable signal tranmission medium, 
whereby at least one network device (e.g., computer, disk array, etc.) comprises a pattern of 
magnetic domains (e.g., magnetic disk) and/or charge domains (e.g., an array of DRAM 

1 0 cells) composing a bit pattern encoding data acquired from an assay of the invention. 

The invention also provides a method for transmitting assay data that includes 
generating an electronic signal on an electronic communications device, such as a modem, 
ISDN terminal adapter, DSL, cable modem, ATM switch, or the like, wherein the signal 
includes (in native or encrypted format) a bit pattern encoding data from an assay or a 

1 5 database comprising a plurality of assay results obtained by the method of the invention. 

In a preferred embodiment, the invention provides a computer system for 
comparing a query target to a database containing an array of data structures, such as an assay 
result obtained by the method of the invention, and ranking database targets based on the 
degree of identity and gap weight to the target data. A central processor is preferably 

20 initialized to load and execute the computer program for alignment and/or comparison of the 
assay results. Data for a query target is entered into the central processor via an I/O device. 
Execution of the computer program results in the central processor retrieving the assay data 
from the data file, which comprises a binary description of an assay result. 

The target data or record and the computer program can be transferred to 

25 secondary memory, which is typically random access memory (e.g. , DRAM, SRAM, 
SGRAM, or SDRAM). Targets are ranked according to the degree of correspondence 
between a selected assay characteristic (e.g., binding to a selected affinity moiety) and the 
same characteristic of the query target and results are output via an I/O device. For example, 
a central processor can be a conventional computer (e.g., Intel Pentium, PowerPC, Alpha, 

30 PA-8000, SPARC, MIPS 4400, MIPS 10000, VAX, etc.); a program can be a commercial or 
public domain molecular biology software package (e.g., UWGCG Sequence Analysis 
Software, Darwin); a data file can be an optical or magnetic disk, a data server, a memory 
device (e.g., DRAM, SRAM, SGRAM, SDRAM, EPROM, bubble memory, flash memory, 
etc.)', an I/O device can be a terminal comprising a video display and a keyboard, a modem, 
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an ISDN terminal adapter, an Ethernet port, a punched card reader, a magnetic strip reader, or 
other suitable I/O device. 

The invention also preferably provides the use of a computer system, such as 
that described above, which comprises: (1) a computer; (2) a stored bit pattern encoding a 
5 collection of peptide sequence specificity records obtained by the methods of the invention, 
which may be stored in the computer; (3) a comparison target, such as a query target; and (4) 
a program for alignment and comparison, typically with rank-ordering of comparison results 
on the basis of computed similarity values. 

1 0 Angiogenesis-associated sequences 

Angiogenesis proteins of the present invention may be classified as secreted 
proteins, transmembrane proteins or intracellular proteins. In one embodiment,the 
angiogenesis protein is an intracellular protein. Intracellular proteins may be found in the 
cytoplasm and/or in the nucleus or associated with the intracellular side of the plasma 

15 membrane. Intracellular proteins are involved in all aspects of cellular function and 

replication (including, e.g. , signaling pathways); aberrant expression of such proteins often 
results in unregulated or disregulated cellular processes (see, e.g., Molecular Biology of the 
Cell, 3rd Edition, Alberts, Ed., Garland Pub., 1994). For example, many intracellular 
proteins have enzymatic activity such as protein kinase activity, protein phosphatase activity, 

20 protease activity, nucleotide cyclase activity, polymerase activity and the like. Intracellular 
•proteins also serve as docking proteins that are involved in organizing complexes of proteins, 
or targeting proteins to various subcellular localizations, and are involved in maintaining the 
structural integrity of organelles. 

An increasingly appreciated concept in characterizing proteins is the presence 

25 in the proteins of one or more motifs for which defined functions have been attributed. In 

addition to the highly conserved sequences found in the enzymatic domain of proteins, highly 
conserved sequences have been identified in proteins that are involved in protein-protein 
interaction. For example, Src-homology-2 (SH2) domains bind tyrosine-phosphorylated 
targets in a sequence dependent manner. PTB domains, which are distinct from SH2 

30 domains, also bind tyrosine phosphorylated targets. SH3 domains bind to proline-rich 

targets. In addition, PH domains, tetratricopeptide repeats and WD domains to name only a 
few, have been shown to mediate protein-protein interactions. Some of these may also be 
involved in binding to phospholipids or other second messengers. As will be appreciated by 
one of ordinary skill in the art, these motifs can be identified on the basis of primary 
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sequence; thus, an analysis of the sequence of proteins may provide insight into both the 
enzymatic potential of the molecule and/or molecules with which the protein may associate. 

In another embodiment, the angiogenesis sequences are transmembrane 
proteins. Transmembrane proteins are molecules that span a phospholipid bilayer of a cell. 
5 They may have an intracellular domain, an extracellular domain, of both. The intracellular 
domains of such proteins may have a number of functions including those already described 
for intracellular proteins. For example, the intracellular domain may have enzymatic activity 
and/or may serve as a binding site for additional proteins. Frequently the intracellular 
domain of transmembrane proteins serves both roles. For example certain receptor tyrosine 

10 kinases have both protein kinase activity and SH2 domains. In addition, autophosphorylation 
of tyrosines on the receptor molecule itself, creates binding sites for additional SH2 domain 
containing proteins. 

Transmembrane proteins may contain from one to many transmembrane 
domains. For example, receptor tyrosine kinases, certain cytokine receptors, receptor 

1 5 guanylyl cyclases and receptor serine/threonine protein kinases contain a single 

transmembrane domain. However, various other proteins including channels and adenylyl 
cyclases contain numerous transmembrane domains. Many important cell surface receptors 
such as G protein coupled receptors (GPCRs) are classified as "seven transmembrane 
domain" proteins, as they contain 7 membrane spanning regions. Characteristics of 

20 transmembrane domains include approximately 20 consecutive hydrophobic amino acids that 
may be followed or flanked by charged amino acids. Therefore, upon analysis of the amino 
acid sequence of a particular protein, the localization and number of transmembrane domains 
within the protein may be predicted (see, e.g. PSORT web site http://psort.nibb.ac.jp/). 

The extracellular domains of transmembrane proteins are diverse; however, 

25 conserved motifs are found repeatedly among various extracellular domains. Conserved 
structure and/or functions have been ascribed to different extracellular motifs. Many 
extracellular domains are involved in binding to other molecules. In one aspect, extracellular 
domains are found on receptors. Factors that bind the receptor domain include circulating 
ligands, which may be peptides, proteins, or small molecules such as adenosine and the like. 

30 For example, growth factors such as EGF, FGF and PDGF are circulating growth factors that 
bind to their cognate receptors to initiate a variety of cellular responses. Other factors include 
cytokines, mitogenic factors, neurotrophic factors and the like. Extracellular domains also 
bind to cell-associated molecules. In this respect, they mediate cell-cell interactions. Cell- 
associated ligands can be tethered to the cell for example via a glycosylphosphatidylinositol 
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(GPI) anchor, or may themselves be transmembrane proteins. Extracellular domains also 

* 

associate with the extracellular matrix and contribute to the maintenance of the cell structure. 

Angiogenesis proteins that are transmembrane are particularly preferred in the 
present invention as they are readily accessible targets for immunotherapeutics, as are 
5 described herein. In addition, as outlined below, transmembrane proteins can be also useful 
in imaging modalities. Antibodies may be used to label such readily accessible proteins in 
situ. Alternatively, antibodies can also label intracellular proteins, in which case samples are 
typically permeablized to provide acess to intracellular proteins. 

It will also be appreciated by those in the art that a transmembrane protein can 

10 be made soluble by removing transmembrane sequences, for example through recombinant 
methods. Furthermore, transmembrane proteins that have been made soluble can be made to 
be secreted through recombinant means by adding an appropriate signal sequence. 

In another embodiment, the angiogenesis proteins are secreted proteins; the 
secretion of which can be either constitutive or regulated. These proteins have a signal 

1 5 peptide or signal sequence that targets the molecule to the secretory pathway. Secreted 

proteins are involved in numerous physiological events; by virtue of their circulating nature, 
they serve to transmit signals to various other cell types. The secreted protein may function in 
an autocrine manner (acting on the cell that secreted the factor), a paracrine manner (acting 
on cells in close proximity to the cell that secreted the factor) or an endocrine manner (acting 

20 on cells at a distance). Thus secreted molecules find use in modulating or altering numerous 
aspects of physiology. Angiogenesis proteins that are secreted proteins are particularly 
preferred in the present invention as they serve as good targets for diagnostic markers, e.g., 
for blood or serum tests. 

An angiogenesis sequence is typically initially identified by substantial nucleic 

25 acid and/or amino acid sequence homology or linkage to the angiogenesis sequences outlined 
herein. Such homology can be based upon the overall nucleic acid or amino acid sequence, 
and is generally determined as outlined below, using either homology programs or 
hybridization conditions. Typically, linked sequences on a mRNA are found on the same 
molecule. 

30 As detailed in the definitions, percent identity can be determined using an 

algorithm such as BLAST. A preferred method utilizes the BLASTN module of WTJ- 
BLAST-2 set to the default parameters, with overlap span and overlap fraction set to 1 and 
0.125, respectively. The alignment may include the introduction of gaps in the sequences to 
be aligned. In addition, for sequences which contain either more or fewer nucleotides than 
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those of the nucleic acids of the figures, it is understood that the percentage of homology will 
be determined based on the number of homologous nucleosides in relation to the total number 
of nucleosides. Thus, for example, homology of sequences shorter than those of the 
sequences identified herein and as discussed below, will be determined using the number of 

* 

5 nucleosides in the shorter sequence. 

In one embodiment, the nucleic acid homology is determined through 
hybridization studies. Thus, e.g., nucleic acids which hybridize under high stringency to a 
nucleic acid of Tables 1-8 , or its complement, or is also found on naturally occurring 
mRNAs is considered an angiogenesis sequence. In another embodiment, less stringent 

1 0 hybridization conditions are used; for example, moderate or low stringency conditions may 
be used, as are known in the art; see Ausubel, supra, and Tijssen, supra. 

In addition, the angiogenesis nucleic acid sequences of the invention, e.g, the 
sequence in Tables 1-8 , are fragments of larger genes, i.e. they are nucleic acid segments. 
"Genes" in this context includes coding regions, non-coding regions, and mixtures of coding 

1 5 and non-coding regions. Accordingly, as will be appreciated by those in the art, using the 
sequences provided herein, extended sequences, in either direction, of the angiogenesis genes 
can be obtained, using techniques well known in the art for cloning either longer sequences or 
the full length sequences; see Ausubel, et al, supra. Much can be done by informatics and 
many sequences can be clustered to include multiple sequences, e.g., systems such as 

20 UniGene (see, http://www.ncbi.nlm.nih.gov/UniGene/). 

Once the angiogenesis nucleic acid is identified, it can be cloned and, if 
necessary, its constituent parts recombined to form the entire angiogenesis nucleic acid 
coding regions or the entire mRNA sequence. Once isolated from its natural source, e.g., 
contained within a plasmid or other vector or excised therefrom as a linear nucleic acid 

25 segment, the recombinant angiogenesis nucleic acid can be further-used as a probe to identify 
and isolate other angiogenesis nucleic acids, for example extended coding regions. It can 
also be used as a "precursor" nucleic acid to make modified or variant angiogenesis nucleic 
acids and proteins. 

The angiogenesis nucleic acids of the present invention are used in several 
30 ways. In a first embodiment, nucleic acid probes to the angiogenesis nucleic acids are made 
and attached to biochips to be used in screening and diagnostic methods, as outlined below, 
or for administration, for example for gene therapy, vaccine, and/or antisense applications. 
Alternatively, the angiogenesis nucleic acids that include coding regions of angiogenesis 
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proteins can be put into expression vectors for the expression of angiogenesis proteins, again 
for screening purposes or for administration to a patient. 

In a preferred embodiment, nucleic acid probes to angiogenesis nucleic acids 
(both the nucleic acid sequences outlined in the figures and/or the complements thereof) are 
5 made. The nucleic acid probes attached to the biochip are designed to be substantially 

* 

complementary to the angiogenesis nucleic acids, i.e. the target sequence (either the target 
sequence of the sample or to other probe sequences, for example in sandwich assays), such 
that hybridization of the target sequence and the probes of the present invention occurs. As 
outlined below, this complementarity need not be perfect; there may be any number of base 

1 0 pair mismatches which will interfere with hybridization between the target sequence and the 
single stranded nucleic acids of the present invention. However, if the number of mutations 
is so great that no hybridization can occur under even the least stringent of hybridization 
conditions, the sequence is not a complementary target sequence. Thus, by "substantially 
complementary" herein is meant that the probes are sufficiently complementary to the target 

1 5 sequences to hybridize under normal reaction conditions, particularly high stringency 
conditions, as outlined herein. 

A nucleic acid probe is generally single stranded but can be partially single 
and partially double stranded. The strandedness of the probe is dictated by the structure, 
composition, and properties of the target sequence. In general, the nucleic acid probes range 

20 from about 8 to about 100 bases long, with from about 10 to about 80 bases being preferred, 
and from about 30 to about 50 bases being particularly preferred. That is, generally whole 
genes are not used. In some embodiments, much longer nucleic acids can be used, up to 
hundreds of bases. 

In a preferred embodiment, more than one probe per sequence is used, with 
25 either overlapping probes or probes to different sections of the target being used. That is, 

two, three, four or more probes, with three being preferred, are used to build in a redundancy 
for a particular target. The probes can be overlapping (i.e. have some sequence in common), 
or separate. In some cases, PCR primers may be used to amplify signal for higher sensitivity. 

As will be appreciated by those in the art, nucleic acids can be attached or 
30 immobilized to a solid support in a wide variety of ways. By "immobilized" and grammatical 
equivalents herein is meant the association or binding between the nucleic acid probe and the 
solid support is sufficient to be stable under the conditions of binding, washing, analysis, and 
removal as outlined below. The binding can typically be covalent or non-covalent. By "non- 
covalent binding" and grammatical equivalents herein is meant one or more of electrostatic, 
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hydrophilic, and hydrophobic interactions. Included in non-covalent binding is the covalent 
attachment of a molecule, such as, streptavidin to the support and the non-covalent binding of 
the biotinylated probe to the streptavidin. By "covalent binding" and grammatical 
equivalents herein is meant that the two moieties, the solid support and the probe, are 
5 attached by at least one bond, including sigma bonds, pi bonds and coordination bonds. 
Covalent bonds can be formed directly between the probe and the solid support or can be 
formed by a cross linker or by inclusion of a specific reactive group on either the solid 
support or the probe or both molecules. Immobilization may also involve a combination of 
covalent and non-covalent interactions. 
10 In general, the probes are attached to the biochip in a wide variety of ways, as 

will be appreciated by those in the art. As described herein, the nucleic acids can either be 
synthesized first, with subsequent attachment to the biochip, or can be directly synthesized on 
the biochip. 

The biochip comprises a suitable solid substrate. By "substrate" or "solid 

15 support" or other grammatical equivalents herein is meant a material that can be modified to 
contain discrete individual sites appropriate for the attachment or association of the nucleic 
acid probes and is amenable to at least one detection method. As will be appreciated by those 
in the art, the number of possible substrates are very large, and include, but are not' limited to, 
glass and modified or functionalized glass, plastics (including acrylics, polystyrene and 

20 copolymers of styrene and other materials, polypropylene, polyethylene, polybutylene, 

polyurethanes, TeflonJ, etc.), polysaccharides, nylon or nitrocellulose, resins, silica or silica- 
based materials including silicon and modified silicon, carbon, metals, inorganic glasses, 
plastics, etc. In general, the substrates allow optical detection and do not appreciably 
fluorescese. A preferred substrate is described in copending application entitled Reusable 

25 Low Fluorescent Plastic Biochip, U.S. Application Serial No. 09/270,214, filed March 15, 
1999, herein incorporated by reference in its entirety. 

Generally the substrate is planar, although as will be appreciated by those in 
the art, other configurations of substrates may be used as well. For example, the probes may 
be placed on the inside surface of a tube, for flow-through sample analysis to minimize 

30 sample volume. Similarly, the substrate may be flexible, such as a flexible foam, including 

* 

closed cell foams made of particular plastics. 

In a preferred embodiment, the surface of the biochip and the probe may be 
derivatized with chemical functional groups for subsequent attachment of the two. Thus, for 
example, the biochip is derivatized with a chemical functional group including, but not 
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limited to, amino groups, carboxy groups, oxo groups and thiol groups, with amino groups 
being particularly preferred. Using these functional groups, the probes can be attached using 
functional groups on the probes. For example, nucleic acids containing amino groups can be 
attached to surfaces comprising amino groups, for example using linkers as are known in the 
5 art; for example, homo-or hetero-bifunctional linkers as are well known (see 1994 Pierce 
Chemical Company catalog, technical section on cross-linkers, pages 155-200, incorporated 
herein by reference). In addition, in some cases, additional linkers, such as alkyl groups 
(including substituted and heteroalkyl groups) may be used. 

In this embodiment, oligonucleotides are synthesized as is known in the art, 

1 0 and then attached to the surface of the solid support. As will be appreciated by those skilled 
in the art, either the 5' or 3' terminus may be attached to the solid support, or attachment may 
be via an internal nucleoside. 

In another embodiment, the immobilization to the solid support may be very 
strong, yet non-covalent. For example, biotinylated oligonucleotides can be made, which 

1 5 bind to surfaces covalently coated with streptavidin, resulting in attachment. 

Alternatively, the oligonucleotides may be synthesized on the surface, as is 
known in the art. For example, photoactivation techniques utilizing photopolymerization 
compounds and techniques are used. In a preferred embodiment, the nucleic acids can be 
synthesized in situ, using well known photolithographic techniques, such as those described 

20 in WO 95/251 16; WO 95/35505; U.S. Patent Nos. 5,700,637 and 5,445,934; and references 
cited within, all of which are expressly incorporated by reference; these methods of 
attachment form the basis of the Affimetrix GeneChip™ technology. 

Often, amplification-based assays are performed to measure the expression 
level of angiogenesis-associated sequences. These assays are typically perfonned in 

25 conjunction with reverse transcription. In such assays, an angiogenesis-associated nucleic 
acid sequence acts as a template in an amplification reaction (e.g., Polymerase Chain 
Reaction, or PCR). In a quantitative amplification, the amount of amplification product will 
be proportional to the amount of template in the original sample. Comparison to appropriate 
controls provides a measure of the amount of angiogenesis-associated RNA Methods of 

30 quantitative amplification are well known to those of skill in the art. Detailed protocols for 
quantitative PCR are provided, e.g., in Innis et al (1990) PCR Protocols, A Guide to Methods 
and Applications , Academic Press, Inc. N.Y.). 

In some embodiments, a TaqMan based assay is used to measure expression. 
TaqMan based assays use a fluorogenic oligonucleotide probe that contains a 5' fluorescent 
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dye and a 3' quenching agent. The probe hybridizes to a PCR product, but cannot itself be 
extended due to a blocking agent at the 3' end. When the PCR product is amplified in 
subsequent cycles, the 5' nuclease activity of the polymerase, e.g., AmpliTaq, results in the 
cleavage of the TaqMan probe. This cleavage separates the 5' fluorescent dye and the 3' 
5 quenching agent, thereby resulting in an increase in fluorescence as a function of 

amplification (see, for example, literature provided by Perkin-Elmer, e.g., www2.perkin- 
elmer.com). 

Other suitable amplification methods include, but are not limited to, ligase 
chain reaction (LCR) {see, Wu and Wallace (1989) Genomics 4: 560, Landegren et al (1988) 

10 Science 241 : 1077, and Barringer et al (1990) Gene 89: 117), transcription amplification 

■ 

(Kwoh etal (1989) Proc. Natl Acad. Set USA 86: 1173), self-sustained sequence replication 
(Guatelli et al. (1990) Proc. Nat. Acad. Sci. USA 87: 1874), dot PCR, and linker adapter PCR, 
etc. 

In a preferred embodiment, angiogenesis nucleic acids, e.g., encoding 

1 5 angiogenesis proteins are used to make a variety of expression vectors to express 

angiogenesis proteins which can then be used in screening assays, as described below. 
Expression vectors and recombinant DNA technology are well known to those of skill in the 
art (see, e.g., Ausubel, supra, and Gene Expression Systems, Fernandez & Hoeffler, Eds, 
Academic Press, 1999) and are used to express proteins. The expression vectors may be 

20 either self-replicating extrachromosomal vectors or vectors which integrate into a host 
genome. Generally, these expression vectors include transcriptional and translational 
regulatory nucleic acid operably linked to the nucleic acid encoding the angiogenesis protein. 
The term "control sequences" refers to DNA sequences used for the expression of an 
operably linked coding sequence in a particular host organism. Control sequences that are 

25 suitable for prokaryotes, for example, include a promoter, optionally an operator sequence, 
and a ribosome binding site. Eukaryotic cells are known to utilize promoters, 
polyadenylation signals, and enhancers. 

Nucleic acid is "operably linked" when it is placed into a functional 
relationship with another nucleic acid sequence. For example, DNA for a presequence or 

30 secretory leader is operably linked to DNA for a polypeptide if it is expressed as a preprotein 
that participates in the secretion of the polypeptide; a promoter or enhancer is operably linked 
to a coding sequence if it affects the transcription of the sequence; or a ribosome binding site 
is operably linked to a coding sequence if it is positioned so as to facilitate translation. 
Generally, "operably linked" means that the DNA sequences being linked are contiguous, 
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and, in the case of a secretory leader, contiguous and in reading phase. However, enhancers 
do not have to be contiguous. Linking is typically accomplished by ligation at convenient 
restriction sites. If such sites do not exist, synthetic oligonucleotide adaptors or linkers are 
used in accordance with conventional practice. Transcriptional and translational regulatory 
5 nucleic acid will generally be appropriate to the host cell used to express the angiogenesis 
protein; for example, transcriptional and translational regulatory nucleic acid sequences from 
Bacillus are preferably used to express the angiogenesis protein in Bacillus. Numerous types 
of appropriate expression vectors, and suitable regulatory sequences are known in the art for 
a variety of host cells. 

1 0 In general, transcriptional and translational regulatory sequences may include, 

but are not limited to, promoter sequences, ribosomal binding sites, transcriptional start and 
stop sequences, translational start and stop sequences, and enhancer or activator sequences. 
In a preferred embodiment, the regulatory sequences include a promoter and transcriptional 
start and stop sequences. 

1 5 Promoter sequences encode either constitutive or inducible promoters. The 

promoters may be either naturally occurring promoters or hybrid promoters. Hybrid 
promoters, which combine elements of more than one promoter, are also known in the art, 
and are useful in the present invention. 

In addition, an expression vector may comprise additional elements. For 

20 example, the expression vector may have two replication systems, thus allowing it to be 

maintained in two organisms, for example in mammalian or insect cells for expression and in 
a procaryotic host for cloning and amplification. Furthermore, for integrating expression 
vectors, the expression vector contains at least one sequence homologous to the host cell 
genome, and preferably two homologous sequences which flank the expression construct. 

25 The integrating vector may be directed to a specific locus in the host cell by selecting the 
appropriate homologous sequence for inclusion in the vector. Constructs for integrating 
vectors are well known in the art (e.g., Fernandez & Hoeffler, supra). See also Kitamura, et 
al. (1995) PNAS 92:9146-9150. 

In addition, in a preferred embodiment, the expression vector contains a 

30 selectable marker gene to allow the selection of transformed host cells. Selection genes are 
well known in the art and will vary with the host cell used. 

The angiogenesis proteins of the present invention are produced by culturing a 
host cell transformed with an expression vector containing nucleic acid encoding an 
angiogenesis protein, under the appropriate conditions to induce or cause expression of the 
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angiogenesis protein. Conditions appropriate for angiogenesis protein expression will vary 
with the choice of the expression vector and the host cell, and will be easily ascertained by 
one skilled in the art through routine experimentation or optimization. For example, the use 
of constitutive promoters in the expression vector will require optimizing the growth and 
proliferation of the host cell, while the use of an inducible promoter requires the appropriate 
growth conditions for induction. In addition, in some embodiments, the timing of the harvest 
is important. For example, the baculo viral systems used in insect cell expression are lytic 
viruses, and thus harvest time selection can be crucial for product yield. 

Appropriate host cells include yeast, bacteria, archaebacteria, fungi, and insect 
and animal cells, including mammalian cells. Of particular interest are Saccharomyces 
cerevisiae and other yeasts, E. coli, Bacillus subtilis, Sf9 cells, CI 29 cells, 293 cells, 
Neurospora, BHK, CHO, COS, HeLa cells, HUVEC (human umbilical vein endothelial 
cells), THP1 cells (a macrophage cell line) and various other human cells and cell lines. 

In a preferred embodiment, the angiogenesis proteins are expressed in 
mammalian cells. Mammalian expression systems are also known in the art, and include 
retroviral and adenoviral systems. Of particular use as mammalian promoters are the 
promoters from mammalian viral genes, since the viral genes are often highly expressed and 
have a broad host range. Examples include the S V40 early promoter, mouse mammary tumor 
virus LTR promoter, adenovirus major late promoter, herpes simplex virus promoter, and the 
CMV promoter (see, e.g., Fernandez & Hoeffler, supra). Typically, transcription termination 
and polyadenylation sequences recognized by mammalian cells are regulatory regions located 
3' to the translation stop codon and thus, together with the promoter elements, flank the 
coding sequence. Examples of transcription terminator and polyadenlytion signals include 
those derived form SV40. 

The methods of introducing exogenous nucleic acid into mammalian hosts, as 
well as other hosts, is well known in the art, and will vary with the host cell used. 
Techniques include dextran-mediated transfection, calcium phosphate precipitation, 
polybrene mediated transfection, protoplast fusion, electroporation, viral infection, 

■ 

encapsulation of the polynucleotide(s) in liposomes, and direct microinjection of the DNA 
into nuclei. 

In a preferred embodiment, angiogenesis proteins are expressed in bacterial 
systems. Bacterial expression systems are well known in the art. Promoters from 
bacteriophage may also be used and are known in the art. In addition, synthetic promoters 
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and hybrid promoters are also useful; for example, the tac promoter is a hybrid of the trp and 
lac promoter sequences. Furthermore, a bacterial promoter can include naturally occurring 
promoters of non-bacterial origin that have the ability to bind bacterial RNA polymerase and 
initiate transcription. In addition to a functioning promoter sequence, an efficient ribosome 
5 binding site is desirable. The expression vector may also include a signal peptide sequence 
that provides for secretion of the angiogenesis protein in bacteria. The protein is either 
secreted into the growth media (gram-positive bacteria) or into the periplasmic space, located 
between the inner and outer membrane of the cell (gram-negative bacteria). The bacterial 
expression vector may also include a selectable marker gene to allow for the selection of 

10 bacterial strains that have been transformed. Suitable selection genes include genes which 
render the bacteria resistant to drugs such as ampicillin, chloramphenicol, erythromycin, 
kanamycin, neomycin and tetracycline. Selectable markers also include biosynthetic genes, 
such as those in the histidine, tryptophan and leucine biosynthetic pathways. These 
components are assembled into expression vectors. Expression vectors for bacteria are well 

15 known in the art, and include vectors for Bacillus subtilis, E. coli, Streptococcus cremoris, 
and Streptococcus lividans, among others {e.g., Fernandez & Hoeffler, supra). The bacterial 
expression vectors are transformed into bacterial host cells using techniques well known in 
the art, such as calcium chloride treatment, electroporation, and others. 

In one embodiment, angiogenesis proteins are produced in insect cells. 

20 Expression vectors for the transformation of insect cells, and in particular, baculovirus-based 
expression vectors, are well known in the art. 

In a preferred embodiment, angiogenesis protein is produced in yeast cells. 
Yeast expression systems are well known in the art, and include expression vectors for 
Saccharomyces cerevisiae, Candida albicans and C. maltosa, Hansenula polymorpha, 

25 Kluyveromyces fragilis and K. lactis, Pichia giiillerimondii and P. pastoris, 
Schizosaccharomyces pombe, and Yarroma lipolytica. 

The angiogenesis protein may also be made as a fusion protein, using 
techniques well known in the art. Thus, for example, for the creation of monoclonal 
antibodies, if the desired epitope is small, the angiogenesis protein may be fused to a carrier 

30 protein to form an immunogen. Alternatively, the angiogenesis protein may be made as a 
fusion protein to increase expression, or for other reasons. For example, when the 
angiogenesis protein is an angiogenesis peptide, the nucleic acid encoding the peptide may be 
linked to another nucleic acid for expression purposes. Fusion with detection epitope tags 
can be made, e.g., with FLAG, His 6, myc, HA, etc. 
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In one embodiment, the angiogenesis nucleic acids, proteins and antibodies of 
the invention are labeled. By "labeled" herein is meant that a compound has at least one 
element, isotope or chemical compound attached to enable the detection of the compound. In 
general, labels fall into three classes: a) isotopic labels, which may be radioactive or heavy 
5 isotopes; b) immune labels, which may be antibodies, antigens, or epitope tags and c) colored 
or fluorescent dyes. The labels may be incorporated into the angiogenesis nucleic acids, 
proteins and antibodies at any position. For example, the label should be capable of 
producing, either directly or indirectly, a detectable signal. The detectable moiety may be a 
radioisotope, such as 3 H, l4 C, 32 P, 35 S, or l25 I, a fluorescent or chemiluminescent compound, 

1 0 such as fluorescein isothiocyanate, rhodamine, or luciferin, or an enzyme, such as alkaline 
phosphatase, beta-galactosidase or horseradish peroxidase. Any method known in the art for 
conjugating the antibody to the label may be employed, including those methods described by 
Hunter et al, Nature, 144:945 (1962); David et al, Biochemistry, 13:1014 (1974); Pain et 
al., J. Immunol. Meth., 40:219 (1981); andNygren, J. Histochem. and Cytochem., 30:407 

15 (1982). 

Accordingly, the present invention also provides angiogenesis protein 
sequences. An angiogenesis protein of the present invention may be identified in several 
ways. "Protein" in this sense includes proteins, polypeptides, and peptides. As will be 
appreciated by those in the art, the nucleic acid sequences of the invention can be used to 

20 generate protein sequences. There are a variety of ways to do this, including cloning the 
entire gene and verifying its frame and amino acid sequence, or by comparing it to known 
sequences to search for homology to provide a frame, assuming the angiogenesis protein has 
an identifiable motif or homology to some protein in the database being used. Generally, the 
nucleic acid sequences are input into a program that will search all three frames for 

25 homology. This is done in a preferred embodiment using the following NCBI Advanced 

BLAST parameters. The program is blastx or blastn. The database is nr. The input data is as 
"Sequence in FASTA format". The organism list is "none". The "expect" is 10; the filter is 
default. The "descriptions" is 500, the "alignments" is 500, and the "alignment view" is 
pairwise. The "Query Genetic Codes" is standard (1). The matrix is BLOSUM62; gap 

30 existence cost is 1 1 , per residue gap cost is 1 ; and the lambda ratio is .85 default. This 
results in the generation of a putative protein sequence. 

Also included within one embodiment of angiogenesis proteins are amino acid 
variants of the naturally occurring sequences, as determined herein. Preferably, the variants 
are preferably greater than about 75% homologous to the wild-type sequence, more 
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preferably greater than about 80%, even more preferably greater than about 85% and most 
preferably greater than 90%. In some embodiments the homology will be as high as about 93 
to 95 or 98%, As for nucleic acids, homology in this context means sequence similarity or 
identity, with identity being preferred. This homology will be determined using standard 
5 techniques well known in the art as are outlined above for the nucleic acid homologies. 

Angiogenesis proteins of the present invention may be shorter or longer than 
the wild type amino acid sequences. Thus, in a preferred embodiment, included within the 
definition of angiogenesis proteins are portions or fragments of the wild type sequences, 
herein. In addition, as outlined above, the angiogenesis nucleic acids of the invention may be 

10 used to obtain additional coding regions, and thus additional protein sequence, using 
techniques known in the art. 

In a preferred embodiment, the angiogenesis proteins are derivative or variant 
angiogenesis proteins as compared to the wild-type sequence. That is, as outlined more fully 
below, the derivative angiogenesis peptide will often contain at least one amino acid 

15 substitution, deletion or insertion, with amino acid substitutions being particularly preferred. 
The amino acid substitution, insertion or deletion may occur at any residue within the 
angiogenesis peptide. 

Also included within one embodiment of angiogenesis proteins of the present 
invention are amino acid sequence variants. These variants typically fall into one or more of 

20 three classes: substitutional, insertional or deletional variants. These variants ordinarily are 
prepared by site specific mutagenesis of nucleotides in the DNA encoding the angiogenesis 
protein, using cassette or PCR mutagenesis or other techniques well known in the art, to 
produce DNA encoding the variant, and thereafter expressing the DNA in recombinant cell 
culture as outlined above. However, variant angiogenesis protein fragments having up to 

25 about 100-150 residues may be prepared by in vitro synthesis using established techniques. 
Amino acid sequence variants are characterized by the predetermined nature of the variation, 
a feature that sets them apart from naturally occurring allelic or interspecies variation of the 
angiogenesis protein amino acid sequence. The variants typically exhibit the same qualitative 
biological activity as the naturally occurring analogue, although variants can also be selected 

30 which have modified characteristics as will be more fiilly outlined below. 

While the site or region for introducing an amino acid sequence variation is 
predetermined, the mutation per se need not be predetermined. For example, in order to 
optimize the performance of a mutation at a given site, random mutagenesis may be 
conducted at the target codon or region and the expressed angiogenesis variants screened for 
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the optimal combination of desired activity. Techniques for making substitution mutations at 
predetermined sites in DNA having a known sequence are well known, for example, Ml 3 

* 

primer mutagenesis and PCR mutagenesis. Screening of the mutants is done using assays of 
angiogenesis protein activities. 

Amino acid substitutions are typically of single residues; insertions usually 
will be on the order of from about 1 to 20 amino acids, although considerably larger 
insertions may be tolerated. Deletions range from about 1 to about 20 residues, although in 
some cases deletions may be much larger. 

Substitutions, deletions, insertions or any combination thereof may be used to 
arrive at a final derivative. Generally these changes are done on a few amino acids to 
minimize the alteration of the molecule. However, larger changes may be tolerated in certain 
circumstances. When small alterations in the characteristics of the angiogenesis protein are 
desired, substitutions are generally made in accordance with the amino acid substitution chart 
provided in the definition section. 

Substantial changes in function or immunological identity are made by 
selecting substitutions that are less conservative than those provided in the definition of 
"conservative substitution". For example, substitutions may be made which more 
significantly affect: the structure of the polypeptide backbone in the area of the alteration, for 
example the alpha-helical or beta-sheet structure; the charge or hydrophobicity of the 
molecule at the target site; or the bulk of the side chain. The substitutions which in general 
are expected to produce the greatest changes in the polypeptide's properties are those in 
which (a) a hydrophilic residue, e.g. seryl or threonyl, is substituted for (or by) a hydrophobic 
residue, e.g. leucyl, isoleucyl, phenylalanyl, valyl or alanyl; (b) a cysteine or proline is 
substituted for (or by) any other residue; (c) a residue having an electropositive side chain, 
e.g. lysyl, arginyl, or histidyl, is substituted for (or by) an electronegative residue, e.g. 
glutamyl or aspartyl; or (d) a residue having a bulky side chain, e.g. phenylalanine, is 
substituted for (or by) one not having a side chain, e.g. glycine. 

The variants typically exhibit the same qualitative biological activity and will 
elicit the same immune response as the naturally-occurring analog, although variants also are 
selected to modify the characteristics of the angiogenesis proteins as needed. Alternatively, 
the variant may be designed such that the biological activity of the angiogenesis protein is 
altered. For example, glycosylation sites may be altered or removed. 

Covalent modifications of angiogenesis polypeptides are included within the 
scope of this invention. One type of covalent modification includes reacting targeted amino 
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acid residues of an angiogenesis polypeptide with an organic derivatizing agent that is 
capable of reacting with selected side chains or the N-or C-terminal residues of an 
angiogenesis. polypeptide. Derivatization with bifunctional agents is useful, for instance, for 
crosslinking angiogenesis polypeptides to a water-insoluble support matrix or surface for use 
5 in the method for purifying anti-angiogenesis polypeptide antibodies or screening assays, as 
is more fully described below. Commonly used crosslinking agents include, e.g., 1,1- 
bis(diazoacetyl)-2-phenylethane, glutaraldehyde, N-hydroxysuccinimide esters, for example, 
esters with 4-azidosalicylic acid, homobifunctional imidoesters, including disuccinimidyl 
esters such as 3,3'-dithiobis(succinimidylpropionate), bifunctional maleimides such as bis-N- 

10 maleimido-l,8-octane and agents such as methyl-3-[(p-azidophenyl)dithio]propioimidate. 

Other modifications include deamidation of glutaminyl and asparaginyl 
residues to the corresponding glutamyl and aspartyl residues, respectively, hydroxylation of 
proline and lysine, phosphorylation of hydroxyl groups of seryl, threonyl or tyrosyl residues, 
methylation of the y-amino groups of lysine, arginine, and histidine side chains [T.E. 

15 Creighton, Proteins: Structure and Molecular Properties, W.H. Freeman & Co., San 

Francisco, pp. 79-86 (1983)], acetylation of the N-terminal amine, and amidation of any C- 
terminal carboxyl group. 

Another type of covalent modification of the angiogenesis polypeptide 
included within the scope of this invention comprises altering the native glycosylation pattern 

20 of the polypeptide. "Altering the native glycosylation pattern" is intended for purposes herein 
to mean deleting one or more carbohydrate moieties found in native sequence angiogenesis 
polypeptide, and/or adding one or more glycosylation sites that are not present in the native 
sequence angiogenesis polypeptide. Glycosylation patterns can be altered in many ways. For 
example the use of different cell types to express angiogenesis-associated sequences can 

25 result in different glycosylation patterns. 

Addition of glycosylation sites to angiogenesis polypeptides may also be 
accomplished by altering the amino acid sequence thereof. The alteration may be made, for 
example, by the addition of, or substitution by, one or more serine or threonine residues to the 
native sequence angiogenesis polypeptide (for O-linked glycosylation sites). The 

30 angiogenesis amino acid sequence may optionally be altered through changes at the DNA 

level, particularly by mutating the DNA encoding the angiogenesis polypeptide at preselected 
bases such that codons are generated that will translate into the desired amino acids. 
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Another means of increasing the number of carbohydrate moieties on the 
angiogenesis polypeptide is by chemical or enzymatic coupling of glycosides to the 
polypeptide. Such methods are described in the art, e.g., in WO 87/05330 published 1 1 
September 1987, and in Aplin and Wriston, CRC Crit. Rev. Biochem., pp. 259-306 (1981). 
5 Removal of carbohydrate moieties present on the angiogenesis polypeptide 

may be accomplished chemically or enzymatically or by mutational substitution of codons 
encoding for amino acid residues that serve as targets for glycosylation. Chemical 
deglycosylation techniques are known in the art and described, for instance, by Hakimuddin, 
et al., Arch. Biochem. Biophys., 259:52 (1987) and by Edge et al., Anal. Biochem., 118:131 

10 (1981). Enzymatic cleavage of carbohydrate moieties on polypeptides can be achieved by the 
use of a variety of endo-and exo-glycosidases as described by Thotakura et al., Meth. 
Enzymol., 138:350(1987). 

Another type of covalent modification of angiogenesis comprises linking the 
angiogenesis polypeptide to one of a variety of nonproteinaceous polymers, e.g., 

1 5 polyethylene glycol, polypropylene glycol, or polyoxyalkylenes, in the manner set forth in 
U.S. Patent Nos. 4,640,835; 4,496,689; 4,301,144; 4,670,417; 4,791,192 or 4,179,337. 

Angiogenesis polypeptides of the present invention may also be modified in a 
way to form chimeric molecules comprising an angiogenesis polypeptide fused to another, 
heterologous polypeptide or amino acid sequence. In one embodiment, such a chimeric 

20 molecule comprises a fusion of an angiogenesis polypeptide with a tag polypeptide which 
provides an epitope to which an anti-tag antibody can selectively bind. The epitope tag is 
generally placed at the amino-or carboxyl-terminus of the angiogenesis polypeptide. The 
presence of such epitope-tagged forms of an angiogenesis polypeptide can be detected using 
an antibody against the tag polypeptide. Also, provision of the epitope tag enables the 

25 angiogenesis polypeptide to be readily purified by affinity purification using an anti-tag 
antibody or another type of affinity matrix that binds to the epitope tag. In an alternative 
embodiment, the chimeric molecule may comprise a fusion of an angiogenesis polypeptide 
with an immunoglobulin or a particular region of an immunoglobulin. For a bivalent form of 
the chimeric molecule, such a fusion could be to the Fc region of an IgG molecule. 

30 Various tag polypeptides and their respective antibodies are well known in the 

art. Examples include poly-histidine (poly-his) or poly-histidine-glycine (poly-his-gly) tags; 
HIS6 and metal chelation tags, the flu HA tag polypeptide and its antibody 12CA5 [Field et 
al, Mol Cell Biol, 8:2159-2165 (1988)]; the c-myc tag and the 8F9, 3C7, 6E10, G4, B7 and 
9E10 antibodies thereto [Evan et al, Molecular and Cellular Biology, 5:3610-3616 (1985)]; 
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and the Herpes Simplex virus glycoprotein D (gD) tag and its antibody [Paborsky et aL, 
Protein Engineering, 3(6):547-553 (1990)]. Other tag polypeptides include the Flag-peptide 
[Hopp et aL, BioTechnology, 6:1204-1210 (1988)]; the KT3 epitope peptide [Martin et aL, 
Science, 255:192-194 (1992)]; tubulin epitope peptide [Skinner et aL, 1 BioL Chem., 
5 266:15163-15166 (1991)]; and the T7 gene 10 protein peptide tag [Lutz-Freyermuth et aL, 
Proc. Natl. Acad. Sci. USA, 87:6393-6397 (1990)]. 

Also included with an embodiment of angiogenesis protein are other 
angiogenesis proteins of the angiogenesis family, and angiogenesis proteins from other 
organisms, which are cloned and expressed as outlined below. Thus, probe or degenerate 

10 polymerase chain reaction (PCR) primer sequences may be used to find other related 

angiogenesis proteins from humans or other organisms. As will be appreciated by those in 
the art, particularly useful probe and/or PCR primer sequences include the unique areas of the 
angiogenesis nucleic acid sequence. As is generally known in the art, preferred PCR primers 
are from about 15 to about 35 nucleotides in length, with from about 20 to about 30 being 

15 preferred, and may contain inosine as needed. The conditions for the PCR reaction are well 
known in the art (e.g. , Innis, PCR Protocols, sapra). 

In addition, as is outlined herein, angiogenesis proteins can be made that are 
longer than those encoded by the nucleic acids of the figures, e.g., by the elucidation of 
extended sequences, the addition of epitope or purification tags, the addition of other fusion 

20 sequences, etc. 

Angiogenesis proteins may also be identified as being encoded by 
angiogenesis nucleic acids. Thus, angiogenesis proteins are encoded by nucleic acids that 
will hybridize to the sequences of the sequence listings, or their complements, as outlined 
herein. 

25 In a preferred embodiment, when the angiogenesis protein is to be used to 

generate antibodies, e.g., for immunotherapy or immunodiagnosis, the angiogenesis protein 
should share at least one epitope or determinant with the full length protein. By "epitope" or 
"determinant" herein is typically meant a portion of a protein which will generate and/or bind 
an antibody or T-cell receptor in the context of MHC. Thus, in most instances, antibodies 

30 made to a smaller angiogenesis protein will be able to bind to the full-length protein, 
particularly linear epitopes. In a preferred embodiment, the epitope is unique; that is, 
antibodies generated to a unique epitope show little or no cross-reactivity. In a preferred 
embodiment, the epitope is selected from a protein sequence set out in Table 8. 
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Methods of preparing polyclonal antibodies are known to the skilled artisan 
{e.g., Coligan, supra; and Harlow & Lane, supra). Polyclonal antibodies can be raised in a 
mammal, e.g., by one or more injections of an immunizing agent and, if desired, an adjuvant. 
Typically, the immunizing agent and/or adjuvant will be injected in the mammal by multiple 
5 subcutaneous or intraperitoneal injections. The immunizing agent may include a protein 
encoded by a nucleic acid of the figures or fragment thereof or a fusion protein thereof. It 
may be useful to conjugate the immunizing agent to a protein known to be immunogenic in 
the mammal being immunized. Examples of such immunogenic proteins include but are not 
limited to keyhole limpet hemocyanin, serum albumin, bovine thyro globulin, and soybean 

10 trypsin inhibitor. Examples of adjuvants which may be employed include Freund's complete 
adjuvant and MPL-TDM adjuvant (monophosphoryl Lipid A, synthetic trehalose 
dicorynomycolate). The immunization protocol may be selected by one skilled in the art 
without undue experimentation. 

The antibodies may, alternatively, be monoclonal antibodies. Monoclonal 

1 5 antibodies may be prepared using hybridoma methods, such as those described by Kohler and 
Milstein, Nature, 256:495 (1975). In a hybridoma method, a mouse, hamster, or other 
appropriate host animal, is typically immunized with an immunizing agent to elicit 
lymphocytes that produce or are capable of producing antibodies that will specifically bind to 
the immunizing agent. Alternatively, the lymphocytes may be immunized in vitro. The 

20 immunizing agent will typically include a polypeptide encoded by a nucleic acid of Tables 1- 
8 , or fragment thereof, or a fusion protein thereof. Generally, either peripheral blood 
lymphocytes ("PBLs") are used if cells of human origin are desired, or spleen cells or lymph 
node cells are used if non-human mammalian sources are desired. The lymphocytes are then 
fused with an immortalized cell line using a suitable fusing agent, such as polyethylene 

25 glycol, to form a hybridoma cell [Goding, Monoclonal Antibodies: Principles and Practice, 
Academic Press, (1986) pp. 59-103]. Immortalized cell lines are usually transformed 
mammalian cells, particularly myeloma cells of rodent, bovine and human origin. Usually, 
rat or mouse myeloma cell lines are employed. The hybridoma cells may be cultured in a 
suitable culture medium that preferably contains one or more substances that inhibit the 

30 growth or survival of the unfused, immortalized cells. For example, if the parental cells lack 
the enzyme hypoxanthine guanine phosphoribosyl transferase (HGPRT or HPRT), the culture 
medium for the hybridomas typically will include hypoxanthine, aminopterin, and thymidine 
("HAT medium"), which substances prevent the growth of HGPRT-deficient cells. 
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In one embodiment, the antibodies are bispecific antibodies. Bispecific 
antibodies are monoclonal, preferably human or humanized, antibodies that have binding 
specificities for at least two different antigens or that have binding specificities for two 
epitopes on the same antigen. In one embodiment, one of the binding specificities is for a 
5 protein encoded by a nucleic acid Tables 1-8 or a fragment thereof, the other one is for any 
other antigen, and preferably for a cell-surface protein or receptor or receptor subunit, 
preferably one that is tumor specific. Alternatively, tetramer-type technology may create 
multivalent reagents. 

In a preferred embodiment, the antibodies to angiogenesis protein are capable 

10 of reducing or eliminating a biological function of an angiogenesis protein, as is described 
below. That is, the addition of anti-angiogenesis protein antibodies (either polyclonal or 
preferably monoclonal) to angiogenic tissue (or cells containing angiogenesis) may reduce or 
eliminate the angiogenesis activity. Generally, at least a 25% decrease in activity is 
preferred, with at least about 50% being particularly preferred and about a 95-100% decrease 

1 5 being especially preferred. 

In a preferred embodiment the antibodies to the angiogenesis proteins are 
humanized antibodies (e.g., Xenerex Biosciences, Mederex, Inc., Abgenix, Inc., Protein 
Design Labs,Inc.) Humanized forms of non-human (e.g., murine) antibodies are chimeric 
molecules of immunoglobulins, immunoglobulin chains or fragments thereof (such as Fv, 

20 Fab, Fab', F(ab f )2 or other antigen-binding subsequences of antibodies) which contain 

minimal sequence derived from non-human immunoglobulin. Humanized antibodies include 
human immunoglobulins (recipient antibody) in which residues form a complementary 
determining region (CDR) of the recipient are replaced by residues from a CDR of a non- 
human species (donor antibody) such as mouse, rat or rabbit having the desired specificity, 

25 affinity and capacity. In some instances, Fv framework residues of the human 

immunoglobulin are replaced by corresponding non-human residues. Humanized antibodies 
may also comprise residues which are found neither in the recipient antibody nor in the 
imported CDR or framework sequences. In general, a humanized antibody will comprise 
substantially all of at least one, and typically two, variable domains, in which all or 

30 substantially all of the CDR regions correspond to those of a non-human immunoglobulin 
and all or substantially all of the framework (FR) regions are those of a human 
immunoglobulin consensus sequence. The humanized antibody optimally also will comprise 
at least a portion of an immunoglobulin constant region (Fc), typically that of a human 
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immunoglobulin [Jones et al, Nature, 321:522-525 (1986); Riechmann et al., Nature, 
332:323-329 (1988); andPresta, Curr. Op. Struct. Biol., 2:593-596 (1992)]. 

Methods for humanizing non-human antibodies are well known in the art. 
Generally, a humanized antibody has one or more amino acid residues introduced into it from 
5 a source which is non-human. These non-human amino acid residues are often referred to as 
import residues, which are typically taken from an import variable domain. Humanization 
can be essentially performed following the method of Winter and co-workers [Jones et al., 
Nature, 321:522-525 (1986); Riechmann et al., Nature, 332:323-327 (1988); Verhoeyen et 
al., Science, 239:1534-1536 (1988)], by substituting rodent CDRs or CDR sequences for the 

10 corresponding sequences of a human antibody. Accordingly, such humanized antibodies are 
chimeric antibodies (U.S. Patent No. 4,816,567), wherein substantially less than an intact 
human variable domain has been substituted by the corresponding sequence from a non- 
human species. In practice, humanized antibodies are typically human antibodies in which 
some CDR residues and possibly some FR residues are substituted by residues from 

15 analogous sites in rodent antibodies. 

Human antibodies can also be produced using various techniques known in the 
art, including phage display libraries [Hoogenboom and Winter, J. Mol. Biol., 227:381 
(1991); Marks et al, J. Mol. Biol., 222:581 (1991)]. The techniques of Cole et al. and 
Boerner et al. are also available for the preparation of human monoclonal antibodies (Cole et 

20 al., Monoclonal Antibodies and Cancer Therapy, Alan R. Liss, p. 77 (1985) and Boerner et 
al., J. Immunol, 147(l):S6-95 (1991)]. Similarly, human antibodies can be made by 
introducing of human immunoglobulin loci into transgenic animals, e.g., mice in which the 
endogenous immunoglobulin genes have been partially or completely inactivated. Upon 

* 

challenge, human antibody production is observed, which closely resembles that seen in 
25 humans in all respects, including gene rearrangement, assembly, and antibody repertoire. 

This approach is described, for example, in U.S. Patent Nos. 5,545,807; 5,545,806; 

5,569,825; 5,625,126; 5,633,425; 5,661,016, and in the following scientific publications: 

Marks et al., Bio/Technology 10, 779-783 (1992); Lonberg et al., Nature 368 856-859 (1994); 

Morrison, Nature 368, 812-13 (1994); Fishwild et al., Nature Biotechnology 14, 845-51 
30 (1996); Neuberger, Nature Biotechnology 14, 826 (1996); Lonberg and Huszar, Intern. Rev. 

Immunol. 13 65-93 (1995). 

By immunotherapy is meant treatment of angiogenesis with an antibody raised 

against angiogenesis proteins. As used herein, immunotherapy can be passive or active. 

Passive immunotherapy as defined herein is the passive transfer of antibody to a recipient 
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(patient). Active immunization is the induction of antibody and/or T-cell responses in a 
recipient (patient). Induction of an immune response is the result of providing the recipient 
with an antigen to which antibodies are raised. As appreciated by one of ordinary skill in the 
art, the antigen may be provided by injecting a polypeptide against which antibodies are 
5 desired to be raised into a recipient, or contacting the recipient with a nucleic acid capable of 
expressing the antigen and under conditions for expression of the antigen, leading to an 
immune response. 

In a preferred embodiment the angiogenesis proteins against which antibodies 
are raised are secreted proteins as described above. Without being bound by theory, 

10 antibodies used for treatment, bind and prevent the secreted protein from binding to its 
receptor, thereby inactivating the secreted angiogenesis protein. 

In another preferred embodiment, the angiogenesis protein to which antibodies 
are raised is a transmembrane protein. Without being bound by theory, antibodies used for 
treatment, bind the extracellular domain of the angiogenesis protein and prevent it from 

15 binding to other proteins, such as circulating ligands or cell-associated molecules. The 

antibody may cause down-regulation of the transmembrane angiogenesis protein. As will be 
appreciated by one of ordinary skill in the art, the antibody may be a competitive, non- 
competitive or uncompetitive inhibitor of protein binding to the extracellular domain of the 
angiogenesis protein. The antibody is also an antagonist of the angiogenesis protein. 

20 Further, the antibody prevents activation of the transmembrane angiogenesis protein. In one 
aspect, when the antibody prevents the binding of other molecules to the angiogenesis 
protein, the antibody prevents growth of the cell. The antibody may also be used to target or 
sensitize the cell to cytotoxic agents, including, but not limited to TNF-ot, TNF-p, IL-1, INF-y 
and IL-2, or chemotherapeutic agents including 5FU, vinblastine, actinomycin D, cisplatin, 

25 methotrexate, and the like. In some instances the antibody belongs to a sub-type that 
activates serum complement when complexed with the transmembrane protein thereby 
mediating cytotoxicity or antigen-dependent cytotoxicity (ADCC). Thus, angiogenesis is 
treated by administering to a patient antibodies directed against the transmembrane 
angiogenesis protein. Antibody-labeling may activate a co-toxin, localize a toxin payload, or 

30 otherwise provide means to locally ablate cells. 

In another preferred embodiment, the antibody is conjugated or fused to an 
effector moiety. The effector moiety can be any number of molecules, including labelling 
moieties such as radioactive labels or fluorescent labels, or can be a therapeutic moiety. In 
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one aspect the therapeutic moiety is a small molecule that modulates the activity of the 
angiogenesis protein. In another aspect the therapeutic moiety modulates the activity of 
molecules associated with or in close proximity to the angiogenesis protein. The therapeutic 
moiety may inhibit enzymatic activity such as protease or collagenase activity associated with 
5 angiogenesis, or be an attractant of other cells, such as NK cells. 

In a preferred embodiment, the therapeutic moiety can also be a cytotoxic 
agent. In this method, targeting the cytotoxic agent to angiogenesis tissue or cells, results in a 
reduction in the number of afflicted cells, thereby reducing symptoms associated with 
angiogenesis. Cytotoxic agents are numerous and varied and include, but are not limited to, 

10 cytotoxic drugs or toxins or active fragments of such toxins. Suitable toxins and their 

corresponding fragments include diphtheria A chain, exotoxin A chain, ricin A chain, abrin A 
chain, curcin, crotin, phenomycin, enomycin and the like. Cytotoxic agents also include 
radiochemicals made by conjugating radioisotopes to antibodies raised against angiogenesis 
proteins, or binding of a radionuclide to a chelating agent that has been covalently attached to 

15 the antibody. Targeting the therapeutic moiety to transmembrane angiogenesis proteins not 
only serves to increase the local concentration of therapeutic moiety in the angiogenesis 
afflicted area, but also serves to reduce deleterious side effects that may be associated with 
the therapeutic moiety. 

hi another preferred embodiment, the angiogenesis protein against which the 

20 antibodies are raised is an intracellular protein. In this case, the antibody may be conjugated 
or fused to a protein which facilitates entry into the cell. In one case, the antibody enters the 
cell by endocytosis. In another embodiment, a nucleic acid encoding the antibody is 
administered to the individual or cell. Moreover, wherein the angiogenesis protein can be 
targeted within a cell, i.e., the nucleus, an antibody thereto contains a signal for that target 

25 localization, i.e., a nuclear localization signal. 

The angiogenesis antibodies of the invention specifically bind to angiogenesis 
proteins. By "specifically bind" herein is meant that the antibodies bind to the protein with a 
Kd of at least about 0.1 mM, more usually at least about 1 pM, preferably at least about 0.1 
pM or better, and most preferably, 0.01 pM or better. Selectivity of binding is also 

30 important. 

In a preferred embodiment, the angiogenesis protein is purified or isolated 
after expression. Angiogenesis proteins may be isolated or purified in a variety of ways 
known to those skilled in the art depending on what other components are present in the 
sample. Standard purification methods include electrophoretic, molecular, immunological 
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and chromatographic techniques, including ion exchange, hydrophobic, affinity, and reverse- 
phase HPLC chromatography, and chromatofocusing. For example, the angiogenesis protein 
may be purified using a standard anti-angiogenesis protein antibody column. Ultrafiltration 
and diafiltration techniques, in conjunction with protein concentration, are also useful. For 
5 general guidance in suitable purification techniques, see Scopes, R., Protein Purification, 
Springer- Verlag, NY (1982). The degree of purification necessary will vary depending on 
the use of the angiogenesis protein. In some instances no purification will be necessary. 

Once expressed and purified if necessary, the angiogenesis proteins and 

i m 

nucleic acids are useful in a number of applications. They may be used as immunoselection 
1 0 reagents, as vaccine reagents, as screening agents, etc. 

Detection of angiogenesis sequence for diagnostic and therapeutic applications 

In one aspect, the RNAexpression levels of genes are determined for different 
cellular states in the angiogenesis phenotype. Expression levels of genes in normal tissue 

15 (i.e., not undergoing angiogenesis) and in angiogenesis tissue (and in some cases, for varying 
severities of angiogenesis that relate to prognosis, as outlined below) are evaluated to provide 
expression profiles. An expression profile of a particular cell state or point of development is 
essentially a "fingerprint" of the state. While two states may have any particular gene 
similarly expressed, the evaluation of a number of genes simultaneously allows the 

20 generation of a gene expression profile that is reflective of the state of the cell. By comparing 
expression profiles of cells in different states, information regarding which genes are 
important (including both up- and down-regulation of genes) in each of these states is 
obtained. Then, diagnosis may be performed or confirmed to determine whether a tissue 
sample has the gene expression profile of normal or angiogenesic tissue. This will provide 

25 for molecular diagnosis of related conditions. 

"Differential expression," or grammatical equivalents as used herein, refers to 
qualitative or quantitative differences in the temporal and/or cellular gene expression 
patterns within and among cells and tissue. Thus, a differentially expressed gene can 
qualitatively have its expression altered, including an activation or inactivation, in, e.g. , 

30 normal versus angiogenic tissue. Genes may be turned on or turned off in a particular state, 
relative to another state thus permitting comparison of two or more statese. A qualitatively 
regulated gene will exhibit an expression pattern within a state or cell type which is 
detectable by standard techniques. Some genes will be expressed in one state or cell type, but 
not in both. Alternatively, the difference in expression may be quantitative, e.g., in that 
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expression is increased or decreased; i.e., gene expression is either upregulated, resulting in 
an increased amount of transcript, or downregulated, resulting in a decreased amount of 
transcript. The degree to which expression differs need only be large enough to quantify via 
standard characterization techniques as outlined below, such as by use of Affymetrix 
5 GeneChip™ expression arrays, Lockhart, Nature Biotechnology, 14:1675-1680 (1996), 

hereby expressly incorporated by reference. Other techniques include, but are not limited to, 
quantitative reverse transcriptase PCR, Northern analysis and RNase protection. As outlined 
above, preferably the change in expression (i.e., upregulation or downregulation) is at least 
about 50%, more preferably at least about 100%, more preferably at least about 150%, more 

10 preferably at least about 200%, with from 300 to at least 1000% being especially preferred. 

Evaluation may be at the gene transcript, or the protein level. The amount of 
gene expression may be monitored using nucleic acid probes to the DNA or RNA equivalent 
of the gene transcript, and the quantification of gene expression levels, or, alternatively, the 
final gene product itself (protein) can be monitored, e.g. , with antibodies to the angiogenesis 

15 protein and standard immunoassays (ELISAs, etc.) or other techniques, including mass 
spectroscopy assays, 2D gel electrophoresis assays, etc. Proteins corresponding to 
angiogenesis genes, i.e., those identified as being important in an angiogenesis phenotype, 
can be evaluated in an angiogenesis diagnostic test. 

In a preferred embodiment, gene expression monitoring is performed 

20 simultaneously on a number of genes. Multiple protein expression monitoring can be 

performed as well. Similarly, these assays may be performed on an individual basis as well. 

In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. The assays are further described below in the example. PCR techniques can 

25 be used to provide greater sensitivity. 

In a preferred embodiment nucleic acids encoding the angiogenesis protein are 
detected. Although DNA or RNA encoding the angiogenesis protein may be detected, of 
particular interest are methods wherein an mRNA encoding an angiogenesis protein is 
detected. Probes to detect mRNA can be a nucleotide/deoxynucleotide probe that is 

30 complementary to and hybridizes with the mRNA and includes, but is not limited to, 

oligonucleotides, cDNA or RNA. Probes also should contain a detectable label, as defined 
herein. In one method the mRNA is detected after immobilizing the nucleic acid to be 
examined on a solid support such as nylon membranes and hybridizing the probe with the 
sample. Following washing to remove the non-specifically bound probe, the label is 
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detected. In another method detection of the mRNA is performed in situ. In this method 
permeabilized cells or tissue samples are contacted with a detectably labeled nucleic acid 
probe for sufficient time to allow the probe to hybridize with the target mRNA. Following 
washing to remove the non-specifically bound probe, the label is detected. For example a 
5 digoxygenin labeled riboprobe (RNA probe) that is complementary to the mRNA encoding 
' an angiogenesis protein is detected by binding the digoxygenin with an anti-digoxygenin 
secondary antibody and developed with nitro blue tetrazolium and 5-bromo-4-chloro-3- 
indoyl phosphate. 

In a preferred embodiment, various proteins from the three classes of proteins 
10 as described herein (secreted, transmembrane or intracellular proteins) are used in diagnostic 
assays. The angiogenesis proteins, antibodies, nucleic acids, modified proteins and cells 
containing angiogenesis sequences are used in diagnostic assays. This can be performed on 
an individual gene or corresponding polypeptide level. In a preferred embodiment, the 
expression profiles are used, preferably in conjunction with high throughput screening 
1 5 techniques to allow monitoring for expression profile genes and/or corresponding 
polypeptides. 

As described and defined herein, angiogenesis proteins, including 
intracellular, transmembrane or secreted proteins, find use as markers of angiogenesis. 
Detection of these proteins in putative angiogenesis tissue allows for detection or diagnosis of 

20 angiogenesis. In one embodiment, antibodies are used to detect angiogenesis proteins. A 
preferred method separates proteins from a sample by electrophoresis on a gel (typically a 
denaturing and reducing protein gel, but may be another type of gel, including isoelectric 
focusing gels and the like). Following separation of proteins, the angiogenesis protein is 
detected, e.g., by immunoblotting with antibodies raised against the angiogenesis protein. 

25 Methods of immunoblotting are well known to those of ordinary skill in the art. 

In another preferred method, antibodies to the angiogenesis protein find use in 
in situ imaging techniques, e.g., in histology (e.g., Methods in Cell Biology: Antibodies in 
Cell Biology, volume 37 (Asai, ed. 1993)). In this method cells are contacted with from one 
to many antibodies to the angiogenesis protein(s). Following washing to remove non-specific 

30 antibody binding, the presence of the antibody or antibodies is detected. In one embodiment 
the antibody is detected by incubating with a secondary antibody that contains a detectable 
label. In another method the primary antibody to the angiogenesis protein(s) contains a 
detectable label, for example an enzyme marker that can act on a substrate. In another 
preferred embodiment each one of multiple primary antibodies contains a distinct and 
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detectable label. This method finds particular use in simultaneous screening for a plurality of 
angiogenesis proteins. As will be appreciated by one of ordinary skill in the art, many other 
histological imaging techniques are alsoprovided by the invention. 

In a preferred embodiment the label is detected in a fluorometer which has the 
5 ability to detect and distinguish emissions of different wavelengths. In addition, a 
fluorescence activated cell sorter (FACS) can be used in the method. 

In another preferred embodiment, antibodies find use in diagnosing 
angiogenesis from biological samples, such as blood, urine, sputum, or other bodily fluids. 
As previously described, certain angiogenesis proteins are secreted/circulating molecules. 

10 Blood samples, therefore, are useful as samples to be probed or tested for the presence of 
secreted angiogenesis proteins. Antibodies can be used to detect an angiogenesis protein by 
previously described immunoassay techniques including ELIS A, immunoblotting (W estern 
blotting), immunoprecipitation, BIACORE technology and the like. Conversely, the presence 
of antibodies may indicate an immune response against an endogenous angiogenesis protein. 

15 In a preferred embodiment, in situ hybridization of labeled angiogenesis 

nucleic acid probes to tissue arrays is done. For example, arrays of tissue samples, including 
angiogenesis tissue and/or normal tissue, are made. In situ hybridization (see, e.g., Ausubel, 
supra) is then performed. When comparing the fingerprints between an individual and a 
standard, the skilled artisan can make a diagnosis, a prognosis, or a prediction based on the 

20 findings. It is further understood that the genes which indicate the diagnosis may differ from 
those which indicate the prognosis and molecular profiling of the condition of the cells may 
lead to distinctions between responsive or refractory conditions or may be predictive of 
outcomes. 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
25 acids, modified proteins and cells containing angiogenesis sequences are used in prognosis 
assays. As above, gene expression profiles can be generated that correlate to angiogenesis 
severity, in terms of long term prognosis. Again, this may be done on either a protein or gene 
level, with the use of genes being preferred. As above, angiogenesis probes may be attached 
to biochips for the detection and quantification of angiogenesis sequences in a tissue or 

■ 

30 patient. The assays proceed as outlined above for diagnosis. PCR method may provide more 
sensitive and accurate quantification. 

In a preferred embodiment members of the three classes of proteins as 
described herein are used in drug screening assays. The angiogenesis proteins, antibodies, 
nucleic acids, modified proteins and cells containing angiogenesis sequences are used in drug 
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screening assays or by evaluating the effect of drug candidates on a "gene expression profile" 
or expression profile of polypeptides. In a preferred embodiment, the expression profiles are 
used, preferably in conjunction with high throughput screening techniques to allow 
monitoring for expression profile genes after treatment with a candidate agent (e.g., 
5 Zlokarnik, et al., Science 279, 84-8 (1 998); Heid, Genome Res 6:986-94, 1 996). 

In a preferred embodiment, the angiogenesis proteins, antibodies, nucleic 
acids, modified proteins and cells containing the native or modified angiogenesis proteins are 
used in screening assays. That is, the present invention provides novel methods for screening 
for compositions which modulate the angiogenesis phenotype or an identified physiological 

10 function of an angiogenesis protein. As above, this can be done on an individual gene level 
or by evaluating the effect of drug candidates on a "gene expression profile". In a preferred 
embodiment, the expression profiles are used, preferably in conjunction with high throughput 
screening techniques to allow monitoring for expression profile genes after treatment with a 
candidate agent, see Zlokarnik, supra, 

1 5 Having identified the differentially expressed genes herein, a variety of assays 

may be executed. In a preferred embodiment, assays may be run on an individual gene or 
protein level That is, having identified a particular gene as up regulated in angiogenesis, test 
compounds can be screened for the ability to modulate gene expression or for binding to the 
angiogenic protein. "Modulation" thus includes both an increase and a decrease in gene 

20 expression. The preferred amount of modulation will depend on the original change of the 
gene expression in normal versus tissue undergoing angiogenesis, with changes of at least 
10%, preferably 50%, more preferably 100-300%, and in some embodiments 300-1000% or 
greater. Thus, if a gene exhibits a 4-fold increase in angiogenic tissue compared to normal 
tissue, a decrease of about four-fold is often desired; similarly, a 10-fold decrease in 

25 angiogenic tissue compared to normal tissue often provides a target value of a 10- fold 
increase in expression to be induced by the test compound. 

The amount of gene expression may be monitored using nucleic acid probes 
and the quantification of gene expression levels, or, alternatively, the gene product itself can 
be monitored, e.g., through the use of antibodies to the angiogenesis protein and standard 

30 immunoassays. Proteomics and separation techniques may also allow quantification of 
expression. 

In a preferred embodiment, gene expression or protein monitoring of a number 
of entitites, i.e., an expression profile, is monitored simultaneously. Such profiles will 
typically invove a plurality of those entitites described herein.. 
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In this embodiment, the angiogenesis nucleic acid probes are attached to 
biochips as outlined herein for the detection and quantification of angiogenesis sequences in a 
particular cell. Alternatively, PCR may be used. Thus, a series, e.g., of microtiter plate, may 
be used with dispensed primers in desired wells. A PCR reaction can then be performed and 
5 analyzed for each well. 



Modulators of angiogenesis 

Expression monitoring can be performed to identify compounds that modify 
the expression of one or more angiogenesis-associated sequences, e.g., a polynucleotide 
10 sequence set out in Tables 1-8 . Generally, in a preferred embodiment, a test modulator is 
added to the cells prior to analysis. Moreover, screens are also provided to identify agents 
that modulate angiogenesis, modulate angiogenesis proteins, bind to an angiogenesis protein, 
or interfere with the binding of an angiogenesis protein and an antibody or other binding 
partner. 

15 The term "test compound" or "drug candidate" or "modulator" or grammatical 

equivalents as used herein describes any molecule, e.g., protein, oligopeptide, small organic 
molecule, polysaccharide, polynucleotide, etc., to be tested for the capacity to directly or 
indirectly alter the angiogenesis phenotype or the expression of an angiogenesis sequence, 
e.g. , a nucleic acid or protein sequence. In preferred embodiments, modulators alter 

20 expression profiles, or expression profile nucleic acids or proteins provided herein. In one 
embodiment, the modulator suppresses an angiogenesis phenotype, for example to a normal 
tissue fingerprint. In another embodiment, a modulator induced an angiogenesis phenotype. 
Generally, a plurality of assay mixtures are run in parallel with different agent concentrations 
to obtain a differential response to the various concentrations. Typically, one of these 

25 concentrations serves as a negative control, i.e., at zero concentration or below the level of 
detection. 

In one aspect, a modulator will neutralize the effect of an angiogenesis protein. 
By "neutralize" is meant that activity of a protein is inhibited or blocked and thereby has 
substantially no effect on a cell. 
30 In certain embodiments, combinatorial libraries of potential modulators will be 

screened for an ability to bind to an angiogenesis polypeptide or to modulate activity. 
Conventionally, new chemical entities with useful properties are generated by identifying a 
chemical compound (called a "lead compound") with some desirable property or activity, 
e.g., inhibiting activity, creating variants of the lead compound, and evaluating the property 
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and activity of those variant compounds. Often, high throughput screening (HTS) methods 
are employed for such an analysis. 

In one preferred embodiment, high throughput screening methods involve 
providing a library containing a large number of potential therapeutic compounds (candidate 
5 compounds). Such "combinatorial chemical libraries" are then screened in one or more 
assays to identify those library members (particular chemical species or subclasses) that 
display a desired characteristic activity. The compounds thus identified can serve as 
conventional "lead compounds" or can themselves be used as potential or actual therapeutics. 

A combinatorial chemical library is a collection of diverse chemical 

10 compounds generated by either chemical synthesis or biological synthesis by combining a 
number of chemical "building blocks" such as reagents. For example, a linear combinatorial 
chemical library, such as a polypeptide (e.g., mutein) library, is formed by combining a set of 
chemical building blocks called amino acids in every possible way for a given compound 
length (i.e., the number of amino acids in a polypeptide compound). Millions of chemical 

1 5 compounds can be synthesized through such combinatorial mixing of chemical building 
blocks (Gallop etal (1994)/. Med. Chem. 37(9): 1233-1251). 

Preparation and screening of combinatorial chemical libraries is well known to 
those of skill in the art. Such combinatorial chemical libraries include, but are not limited to, 
peptide libraries (see, e.g., U.S. Patent No. 5,010,175, Furka (1991) Int. J. Pept. Prot. Res., 

20 37: 487-493, Houghton et al (1991) Nature, 354: 84-88), peptoids (PCT Publication No WO 
91/19735, 26 Dec. 1991), encoded peptides (PCT Publication WO 93/20242, 14 Oct. 1993), 
random bio-oligomers (PCT Publication WO 92/00091, 9 Jan. 1992), benzodiazepines (U.S. 
Pat. No. 5,288,514), diversomers such as hydantoins, benzodiazepines and dipeptides (Hobbs 
etal, (1993) Proc. Nat. Acad. Sci. USA 90: 6909-6913), vinylogous polypeptides (Hagihara 

25 et al. (1992) J. Amer. Chem. Soc. 1 14: 6568), nonpeptidal peptidomimetics with a Beta-D- 
Glucose scaffolding (Hirschmann et al., (1992) J. Amer. Chem. Soc. 1 14: 9217-9218), 
analogous organic syntheses of small compound libraries (Chen et al. (1994) J. Amer. Chem. 
Soc. 116: 2661), oligocarbamates (Cho, et al, (1993) Science 261:1303), and/or peptidyl 
phosphonates (Campbell et al., (1994) J. Org. Chem. 59: 658). See, generally, Gordon et al, 

30 (1994) J. Med. Chem. 37:1385, nucleic acid libraries (see, e.g., Strategene, Corp.), peptide 
nucleic acid libraries (see, e.g., U.S. Patent 5,539,083), antibody libraries (see, e.g., Vaughn 
etal (1996) Nature Biotechnology, 14(3): 309-314), and PCT/US96/1 0287), carbohydrate 
libraries (see, e.g., Liang et al, (1996) Science, 21 A: 1520-1522, and U.S. Patent No. 
5,593,853), and small organic molecule libraries (see, e.g., benzodiazepines, Baum (1993) 



55 



WO 02/079492 PCT/US02/04915 

C&EN, Jan 18, page 33; isoprenoids, U.S. Patent No. 5,569,588; thiazolidinones and 
metathiazanones, U.S. Patent No. 5,549,974; pyrrolidines, U.S. Patent Nos. 5,525,735 and 
5,519,134; morpholino compounds, U.S. Patent No. 5,506,337; benzodiazepines, U.S. Patent 
No. 5,288,514; and the like). 
5 Devices for the preparation of combinatorial libraries are commercially 

available {see, e.g., 357 MPS, 390 MPS, Advanced Chem Tech, Louisville KY, Symphony, 
Rainin, Woburn, MA, 433A Applied Biosystems, Foster City, CA, 9050 Plus, Millipore, 
Bedford, MA). 

A number of well known robotic systems have also been developed for 
1 0 solution phase chemistries. These systems include automated workstations like the 

automated synthesis apparatus developed by Takeda Chemical Industries, LTD. (Osaka, 
Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark Corporation, 
Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.), which mimic the manual 
synthetic operations performed by a chemist. Any of the above devices are suitable for use 
1 5 with the present invention. The nature and implementation of modifications to these devices 
(if any) so that they can operate as discussed herein will be apparent to persons skilled in the 
relevant art. In addition, numerous combinatorial libraries are themselves commercially 
available (see, e.g., ComGenex, Princeton, NJ., Asinex, Moscow, Ru, Tripos, Inc., St. Louis, 
MO, ChemStar, Ltd, Moscow, RU, 3D Pharmaceuticals, Exton, PA, Martek Biosciences, 
20 Columbia, MD, etc.). 

The assays to identify modulators are amenable to high throughput screening. 
Preferred assays thus detect enhancement or inhibition of angiogenesis gene transcription, 
inhibition or enhancement of polypeptide expression, and inhibition or enhancement of 
polypeptide activity. 

25 High throughput assays for the presence, absence, quantification, or other 

properties of particular nucleic acids or protein products are well known to those of skill in 
the art. Similarly, binding assays and reporter gene assays are similarly well known. Thus, 
for example, U.S. Patent No. 5,559,410 discloses high throughput screening methods for 
proteins, U.S. Patent No. 5,585,639 discloses high throughput screening methods for nucleic 

30 acid binding (i.e., in arrays), while U.S. Patent Nos. 5,576,220 and 5,541,061 disclose high 
throughput methods of screening for ligand/antibody binding. 

In addition, high throughput screening systems are commercially available 
(see, e.g., Zymark Corp., Hopkinton, MA; Air Technical Industries, Mentor, OH; Beckman 
Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). These systems 
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typically automate entire procedures, including all sample and reagent pipetting, liquid 
dispensing, timed incubations, and final readings of the microplate in detector(s) appropriate 
for the assay. These configurable systems provide high throughput and rapid start up as well 
as a high degree of flexibility and customization. The manufacturers of such systems provide 
5 detailed protocols for various high throughput systems. Thus, for example, Zymark Corp. 
provides technical bulletins describing screening systems for detecting the modulation of 
gene transcription, ligand binding, and the like. 

In one embodiment, modulators are proteins, often naturally occurring 
proteins or fragments of naturally occurring proteins. Thus, e.g., cellular extracts containing 
10 proteins, or random or directed digests of proteinaceous cellular extracts, may be used. In 

■ 

this way libraries of proteins may be made for screening in the methods of the invention. 
Particularly preferred in this embodiment are libraries of bacterial, fungal, viral, and 
mammalian proteins, with the latter being preferred, and human proteins being especially 
preferred. Paticularly useful test compound will be directed to the class of proteins to which 
15 the target belongs, e.g., substrates for enzymes or ligands and receptors. 

In a preferred embodiment, modulators are peptides of from about 5 to about 

> 

30 amino acids, with from about 5 to about 20 amino acids being preferred, and from about 7 
to about 15 being particularly preferred. The peptides may be digests of naturally occurring 
proteins as is outlined above, random peptides, or "biased" random peptides. By 

20 "randomized" or grammatical equivalents herein is meant that each nucleic acid and peptide 
consists of essentially random nucleotides and amino acids, respectively. Since generally 
these random peptides (or nucleic acids, discussed below) are chemically synthesized, they 
may incorporate any nucleotide or amino acid at any position. The synthetic process can be 
designed to generate randomized proteins or nucleic acids, to allow the formation of all or 

25 most of the possible combinations over the length of the sequence, thus forming a library of 
randomized candidate bioactive proteinaceous agents. 

In one embodiment, the library is fully randomized, with no sequence 
preferences or constants at any position. In a preferred embodiment, the library is biased. 
That is, some positions within the sequence are either held constant, or are selected from a 

30 limited number of possibilities. For example, in a preferred embodiment, the nucleotides or 
amino acid residues are randomized within a defined class, for example, of hydrophobic 
amino acids, hydrophilic residues, sterically biased (either small or large) residues, towards 
the creation of nucleic acid binding domains, the creation of cysteines, for cross-linking, 
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prolines for SH-3 domains, serines, threonines, tyrosines or histidines for phosphorylation 
sites, etc., or to purines, etc. 

Modulators of angiogenesis can also be nucleic acids, as defined above. 

As described above generally for proteins, nucleic acid modulating agents may 
5 be naturally occurring nucleic acids, random nucleic acids, or "biased" random nucleic acids. 
For example, digests of procaryotic or eucaryotic genomes may be used as is outlined above 
for proteins. 

In a preferred embodiment, the candidate compounds are organic chemical 
moieties, a wide variety of which are available in the literature. 

10 After the candidate agent has been added and the cells allowed to incubate for 

some period of time, the sample containing a target sequence to be analyzed is added to the 
biochip. If required, the target sequence is prepared using known techniques. For example, 
the sample may be treated to lyse the cells, using known lysis buffers, electroporation, etc., 
with purification and/or amplification such as PCR performed as appropriate. For example, 

15 an in viti-o transcription with labels covalently attached to the nucleotides is performed. 
Generally, the nucleic acids are labeled with biotin-FITC or PE, or with cy3 or cy5. 

In a preferred embodiment, the target sequence is labeled with, for example, a 
fluorescent, a chemiluminescent, a chemical, or a radioactive signal, to provide a means of 
detecting the target sequence's specific binding to a probe. The label also can be an enzyme, 

20 such as, alkaline phosphatase or horseradish peroxidase, which when provided with an 

appropriate substrate produces a product that can be detected. Alternatively, the label can be 
a labeled compound or small molecule, such as an enzyme inhibitor, that binds but is not 
catalyzed or altered by the enzyme. The label also can be a moiety or compound, such as, an 
epitope tag or bio tin which specifically binds to strep tavidin. For the example of biotin, the 

25 streptavidin is labeled as described above, thereby, providing a detectable signal for the 

bound target sequence. Unbound labeled streptavidin is typically removed prior to analysis. 

As will be appreciated by those in the art, these assays can be direct 
hybridization assays or can comprise "sandwich assays", which include the use of multiple 
probes, as is generally outlined in U.S. Patent Nos. 5,681,702, 5,597,909, 5,545,730, 

30 5,594,117, 5,591,584, 5,571,670, 5,580,731, 5,571,670, 5,591,584, 5,624,802, 5,635,352, 
5,594,1 18, 5,359,100, 5,124,246 and 5,681,697, all of which are hereby incorporated by 
reference. In this embodiment, in general, the target nucleic acid is prepared as outlined 
above, and then added to the biochip comprising a plurality of nucleic acid probes, under 
conditions that allow the formation of a hybridization complex. 
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A variety of hybridization conditions may be used in the present invention, 
including high, moderate and low stringency conditions as outlined above. The assays are 
generally run under stringency conditions which allows formation of the label probe 
hybridization complex only in the presence of target. Stringency can be controlled by 
5 altering a step parameter that is a thermodynamic variable, including, but not limited to, 
temperature, formamide concentration, salt concentration, chaotropic salt concentration pH, 
organic solvent concentration, etc. 

These parameters may also be used to control non-specific binding, as is 
generally outlined in U.S. Patent No. 5,681,697. Thus it may be desirable to perform certain 
10 steps at higher stringency conditions to reduce non-specific binding. 

The reactions outlined herein may be accomplished in a variety of ways. 
Components of the reaction may be added simultaneously, or sequentially, in different orders, 
with preferred embodiments outlined below. In addition, the reaction may include a variety 
of other reagents. These include salts, buffers, neutral proteins, e.g. albumin, detergents, etc. 
1 5 which may be used to facilitate optimal hybridization and detection, and/or reduce non- 
specific or background interactions. Reagents that otherwise improve the efficiency of the 
assay, such as protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may also be 
used as appropriate, depending on the sample preparation methods and purity of the target. 

The assay data are analyzed to determine the expression levels, and changes in 
20 expression levels as between states, of individual genes, forming a gene expression profile. 

Screens are performed to identify modulators of the angiogenesis phenotype. 
In one embodiment, screening is performed to identify modulators that can induce or 
suppress a particular expression profile, thus preferably generating the associated phenotype. 
In another embodiment, e.g., for diagnostic applications, having identified differentially 
25 expressed genes important in a particular state, screens can be performed to identify 

modulators that alter expression of individual genes. In an another embodiment, screening is 
performed to identify modulators that alter a biological function of the expression product of 
a differentially expressed gene. Again, having identified the importance of a gene in a 
particular state, screens are performed to identify agents that bind and/or modulate the 
30 biological activity of the gene product. 

In addition screens can be done for genes that are induced in response to a 
candidate agent. After identifying a modulator based upon its ability to suppress an 
angiogenesis expression pattern leading to a normal expression pattern, or to modulate a 
single angiogenesis gene expression profile so as to mimic the expression of the gene from 

* 
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normal tissue, a screen as described above can be performed to identify genes that are 
specifically modulated in response to the agent. Comparing expression profiles between 
normal tissue and agent treated angiogenesis tissue reveals genes that are not expressed in 
normal tissue or angiogenesis tissue, but are expressed in agent treated tissue. These agent- 
5 specific sequences can be identified and used by methods described herein for angiogenesis 
genes or proteins. In particular these sequences and the proteins they encode find use in 
marking or identifying agent treated cells. In addition, antibodies can be raised against the 
agent induced proteins and used to target novel therapeutics to the treated angiogenesis tissue 
sample. 

10 Thus, in one embodiment, a test compound is administered to a population of 

angiogenic cells, that have an associated angiogenesis expression profile. By 
"administration" or "contacting" herein is meant that the candidate agent is added to the cells 
in such a manner as to allow the agent to act upon the cell, whether by uptake and 
intracellular action, or by action at the cell surface. In some embodiments, nucleic acid 

15 encoding a proteinaceous candidate agent (i.e., a peptide) may be put into a viral construct 
such as an adenoviral or retroviral construct, and added to the cell, such that expression of 
the peptide agent is accomplished, e.g., PCT US97/01019. Regulatable gene therapy systems 
can also be used. 

Once the test compound has been administered to the cells, the cells can be 
20 washed if desired and are allowed to incubate under preferably physiological conditions for 
some period of time. The cells are then harvested and a new gene expression profile is 
generated, as outlined herein. 

Thus, for example, angiogenesis tissue may be screened for agents that 
modulate, e.g., induce or suppress the angiogenesis phenotype. A change in at least one 
25 gene, preferably many, of the expression profile indicates that the agent has an effect on 

angiogenesis activity. By defining such a signature for the angiogenesis phenotype, screens 
for new drugs that alter the phenotype can be devised. With this approach, the drug target 
need not be known and need not be represented in the original expression screening platform, 
nor does the level of transcript for the target protein need to change. 
30 Measure of angiogenesis polypeptide activity, or of angiogenesis or the 

angiogenic phenotype can be performed using a variety of assays. For example, the effects of 
the test compounds upon the function of the anagiogenesis polypeptides can be measured by 
examining parameters described above. A suitable physiological change that affects activity 
can be used to assess the influence of a test compound on the polypeptides of this invention. 
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When the functional consequences are determined using intact cells or animals, one can also 
measure a variety of effects such as, in the case of angiogenesis associated with tumors, 
tumor growth, neovascularization, hormone release, transcriptional changes to both known 
and uncharacterized genetic markers (e.g., northern blots), changes in cell metabolism such as 
5 cell growth or pH changes, and changes in intracellular second messengers such as cGMP. In 
the assays of the invention, mammalian angiogenesis polypeptide is typically used, e.g., 
mouse, preferably human. 

A variety of angiogenesis assays are known to those of skill in the art. Various 
models have been employed to evaluate angiogenesis (e.g., Croix et al., Science 289:1 197- 

1 0 1202, 2000 and Kahn et al. , Amer. J. Pathol. 156: 1 887-1900). Assessement of angiogenesis 
in the presence of a potential modulator of angiogenesis can be performed using cell-cultre- 
based angiogenesis assays, e.g., endothelial cell tube formation assays, as well as other 
bioassays such as the chick CAM assay, the mouse corneal assay, and assays measuring the 
effect of administering potential modulators on implanted tumors. The chick CAM assay is 

15 described by O'Reilly, et al. Cell 79: 315-328, 1994. Briefly, 3 day old chicken embryos with 
intact yolks are separated from the egg and placed in a petri dish. After 3 days of incubation, 
a methylcellulose disc containing the protein to be tested is applied to the CAM of individual 
embryos. After about 48 hours of incubation, the embryos and CAMs are observed to 
determine whether endothelial growth has been inhibited. The mouse corneal assay involves 

20 implanting a growth factor- containing pellet, along with another pellet containing the 

suspected endothelial growth inhibitor, in the cornea of a mouse and observing the pattern of 
capillaries that are elaborated in the cornea. Angiogenesis can also be measured by 
determining the extent of neovascularization of a tumor. For example, carcinoma cells can be 
subcutaneously inoculated into athymic nude mice and tumor growth then monitored. The 

25 cancer cells are treated with an angiogenesis inhibitor, such as an antibody, or other 

compound that is exogen'ously administered, or can be transfected prior to inoculation with a 
polynucleotide inhibitor of angiogenesis. Immunoassays using endothelial cell-specific 
antibodies are typically used to stain for vascularization of tumor and the number of vessels 
in the tumor. 

30 Assays to identify compounds with modulating activity can be performed in 

vitro. For example, an angiogenesis polypeptide is first contacted with a potential modulator 
and incubated for a suitable amount of time, e.g., from 0.5 to 48 hours. In one embodiment, 
the angiogenesis polypeptide levels are determined in vitro by measuring the level of protein 
or mRNA. The level of protein is measured using immunoassays such as western blotting, 
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ELIS A and the like with an antibody that selectively binds to the angiogenesis polypeptide or 
a fragment thereof. For measurement of mRNA, amplification, e.g., using PCR, LCR, or 
hybridization assays, e.g., northern hybridization, RNAse protection, dot blotting, are 
preferred. The level of protein or mRNA is detected using directly or indirectly labeled 
5 detection agents, e.g. , fluorescently or radioactively labeled nucleic acids, radioactively or 
enzymatically labeled antibodies, and the like, as described herein. 

Alternatively, a reporter gene system can be devised using the angiogenesis 
protein promoter operably linked to a reporter gene such as luciferase, green fluorescent 
protein, CAT, or 0-gal. The reporter construct is typically transfected into a cell. After 

10 treatment with a potential modulator, the amount of reporter gene transcription, translation, or 
activity is measured according to standard techniques known to those of skill in the art. 

In a preferred embodiment, as outlined above, screens may be done on 
individual genes and gene products (proteins). That is, having identified a particular 
differentially expressed gene as important in a particular state, screening of modulators of the 

1 5 expression of the gene or the gene product itself can be done. The gene products of 

differentially expressed genes are sometimes referred to herein as "angiogenesis proteins". In 
preferred embodiments the angiogenesis protein comprises a sequence shown in Table 8. 
The angiogenesis protein may be a fragment, or alternatively, be the full length protein to a 
fragment shown herein. 

20 Preferably, the angiogenesis protein is a fragment of approximately 14 to 24 

amino acids long. More preferably the fragment is a soluble fragment. In one embodiment 
an angiogenesis protein is conjugated or fused to an immunogenic agent or BSA. 

In one embodiment, screening for modulators of expression of specific genes 
is performed. Typically, the expression of only one or a few genes are evaluated. In another 

25 embodiment, screens are designed to first find compounds that bind to differentially 
expressed proteins. These compounds are then evaluated for the ability to modulate 
differentially expressed activity. Moreover, once initial candidate compounds are identified, 
variants can be further screened to better evaluate strucutre activity relationships. 

In a preferred embodiment, binding assays are done. In general, purified or 

30 isolated gene product is used; that is, the gene products of one or more differentially 

expressed nucleic acids are made. For example, antibodies are generated to the protein gene 
products, and standard immunoassays are run to determine the amount of protein present. 
Alternatively, cells comprising the angiogenesis proteins can be used in the assays. 

* 
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Thus, in a preferred embodiment, the methods comprise combining an 
angiogenesis protein and a candidate compound, and determining the binding of the 
compound to the angiogenesis protein. Preferred embodiments utilize the human 
angiogenesis protein, although other mammalian proteins may also be used, for example for 
5 the development of animal models of human disease. In some embodiments, as outlined 
herein, variant or derivative angiogenesis proteins may be used. 

Generally, in a preferred embodiment of the methods herein, the angiogenesis 
protein or the candidate agent is non-diffusably bound to an insoluble support having isolated 
sample receiving areas (e.g. a microtiter plate, an array, etc.). The insoluble supports may be 

10 made of any composition to which the compositions can be bound, is readily separated from 
soluble material, and is otherwise compatible with the overall method of screening. The 
surface of such supports may be solid or porous and of any convenient shape. Examples of 
suitable insoluble supports include microtiter plates, arrays, membranes and beads. These are 
typically made of glass, plastic (e.g., polystyrene), polysaccharides, nylon or nitrocellulose, 

15 teflon™, etc. Microtiter plates and arrays are especially convenient because a large number 
of assays can be carried out simultaneously, using small amounts of reagents and samples. 
The particular manner of binding of the composition is not crucial so long as it is compatible 
with the reagents and overall methods of the invention, maintains the activity of the 
composition and is nondiffusable. Preferred methods of binding include the use of antibodies 

20 (which do not sterically block either the ligand binding site or activation sequence when the 
protein is bound to the support), direct binding to "sticky" or ionic supports, chemical 
crosslinking, the synthesis of the protein or agent on the surface, etc. Following binding of 
the protein or agent, excess unbound material is removed by washing. The sample receiving 
areas may then be blocked through incubation with bovine serum albumin (BSA), casein or 

25 other innocuous protein or other moiety. 

In a preferred embodiment, the angiogenesis protein is bound to the support, 
and a test compound is added to the assay. Alternatively, the candidate agent is bound to the 
support and the angiogenesis protein is added. Novel binding agents include specific 
antibodies, non-natural binding agents identified in screens of chemical libraries, peptide 

30 analogs, etc. Of particular interest are screening assays for agents that have a low toxicity for 
human cells. A wide variety of assays may be used for this purpose, including labeled in 
vitro protein-protein binding assays, electrophoretic mobility shift assays, immunoassays for 
protein binding, functional assays (phosphorylation assays, etc.) and the like. 
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The determination of the binding of the test modulating compound to the 
angiogenesis protein may be done in a number of ways. In a preferred embodiment, the 
compound is labelled, and binding determined directly, e.g., by attaching all or a portion of 
the angiogenesis protein to a solid support, adding a labelled candidate agent (e.g., a 
5 fluorescent label), washing off excess reagent, and determining whether the label is present 
on the solid support. Various blocking and washing steps may be utilized as appropriate. 

By "labeled" herein is meant that the compound is either directly or indirectly 
labeled with a label which provides a detectable signal, e.g. radioisotope, fluoresces, 
enzyme, antibodies, particles such as magnetic particles, chemiluminescers, or specific 

10 binding molecules, etc. Specific binding molecules include pairs, such as biotin and 
streptavidin, digoxin and antidigoxin, etc. For the specific binding members, the 
complementary member would normally be labeled with a molecule which provides for 
detection, in accordance with known procedures, as outlined above. The label can directly or 
indirectly provide a detectable signal. 

15 In some embodiments, only one of the components is labeled, e.g., the 

proteins (or proteinaceous candidate compounds) can be labeled. Alternatively, more than 
one component can be labeled with different labels, e.g., I for the proteinsand a fluorophor 
for the compound. Proximity reagents, e.g., quenching or energy transfer reagents are also 
useful. 

20 In one embodiment, the binding of the test compound is determined by 

competitive binding assay. The competitor is a binding moiety known to bind to the target 
molecule (i.e. an angiogenesis protein), such as an antibody, peptide, binding partner, ligand, 
etc. Under certain circumstances, there may be competitive binding between the compound 
and the binding moiety, with the binding moiety displacing the compound. In one 

25 embodiment, the test compound is labeled. Either the compound, or the competitor, or both, 
is added first to the protein for a time sufficient to allow binding, if present. Incubations may 
be performed at a temperature which facilitates optimal activity, typically between 4 and 
40°C. Incubation periods are typically optimized, e.g. , to facilitate rapid high throughput 
screening. Typically between 0.1 and 1 hour will be sufficient. Excess reagent is generally 

30 removed or washed away. The second component is then added, and the presence or absence 
of the labeled component is followed, to indicate binding. 

In a preferred embodiment, the competitor is added first, followed by the test 
compound. Displacement of the competitor is an indication that the test compound is binding 
to the angiogenesis protein and thus is capable of binding to, and potentially modulating, the 

64 



WO 02/079492 PCT/US02/04915 

activity of the angiogenesis protein. In this embodiment, either component can be labeled. 
Thus, for example, if the competitor is labeled, the presence of label in the wash solution 
indicates displacement by the agent. Alternatively, if the test compound is labeled, the 
presence of the label on the support indicates displacement. 
5 In an alternative embodiment, the test compound is added first, with 

incubation and washing, followed by the competitor. The absence of binding by the 
competitor may indicate that the test compound is bound to the angiogenesis protein with a 
higher affinity. Thus, if the test compound is labeled, the presence of the label on the 
support, coupled with a lack of competitor binding, may indicate that the test compound is 

10 capable of binding to the angiogenesis protein. 

In a preferred embodiment, the methods comprise differential screening to 
identity agents that are capable of modulating the activitity of the angiogenesis proteins. In 
this embodiment, the methods comprise combining an angiogenesis protein and a competitor 
in a first sample. A second sample comprises a test compound, an angiogenesis protein, and 

15 a competitor. The binding of the competitor is determined for both samples, and a change, or 
difference in binding between the two samples indicates the presence of an agent capable of 
binding to the angiogenesis protein and potentially modulating its activity. That is, if the 
binding of the competitor is different in the second sample relative to the first sample, the 
agent is capable of binding to the angiogenesis protein. 

20 Alternatively, differential screening is used to identify drug candidates that 

bind to the native angiogenesis protein, but cannot bind to modified angiogenesis proteins. 
The structure of the angiogenesis protein may be modeled, and used in rational drug design to 
synthesize agents that interact with that site. Drug candidates that affect the activity of an 
angiogenesis protein are also identified by screening drugs for the ability to either enhance or 

25 reduce the activity of the protein. 

Positive controls and negative controls may be used in the assays. Preferably 

i 

control and test samples are performed in at least triplicate to obtain statistically significant 
results. Incubation of all samples is for a time sufficient for the binding of the agent to the 
protein. Following incubation, samples are washed free of non-specifically bound material 
30 and the amount of bound, generally labeled agent determined. For example, where a 

radiolabel is employed, the samples may be counted in a scintillation counter to determine the 
amount of bound compound. 

A variety of other reagents may be included in the screening assays. These 
include reagents like salts, neutral proteins, e.g. albumin, detergents, etc, which may be used 
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to facilitate optimal protein-protein binding and/or reduce non-specific or background 
interactions. Also reagents that otherwise improve the efficiency of the assay, such as 
protease inhibitors, nuclease inhibitors, anti-microbial agents, etc., may be used. The mixture 
of components may be added in an order that provides for the requisite binding. 
5 In a preferred embodiment, the invention provides methods for screening for a 

compound capable of modulating the activity of an angiogenesis protein. The methods 
comprise adding a test compound, as defined above, to a cell comprising angiogenesis 
proteins. Preferred cell types include almost any cell. The cells contain a recombinant 
nucleic acid that encodes an angiogenesis protein. In a preferred embodiment, a library of 

10 candidate agents are tested on a plurality of cells. 

In one aspect, the assays are evaluated in the presence or absence or previous 
or subsequent exposure of physiological signals, for example hormones, antibodies, peptides, 
antigens, cytokines, growth factors, action potentials, pharmacological agents including 
chemotherapeutics, radiation, carcinogenics, or other cells (i.e. cell-cell contacts). In another 

15 example, the determinations are determined at different stages of the cell cycle process. 

In this way, compounds that modulate angiogenesis agents are identified. 
Compounds with pharmacological activity are able to enhance or interfere with the activity of 
the angiogenesis protein. Once identified, similar structures are evaluated to identify critical 
structural feature of the compound. 

20 In one embodiment, a method of inhibiting angiogenic cell division is 

provided. The method comprises administration of an angiogenesis inhibitor. In another 
embodiment, a method of inhibiting angiogenesis is provided. The method comprises 
administration of an angiogenesis inhibitor. In a further embodiment, methods of treating 
cells or individuals with angiogenesis are provided. The method comprises administration of 

25 an angiogenesis inhibitor. 

In one embodiment, an angiogenesis inhibitor is an antibody as discussed 
above. In another embodiment, the angiogenesis inhibitor is an antisense molecule. 

Pol ynucleotide modulators of angiogenesis 
30 Antisense Polynucleotides 

In certain embodiments, the activity of an angiogenesis-associated protein is 
downregulated, or entirely inhibited, by the use of antisense polynucleotide, i.e., a nucleic 
acid complementary to, and which can preferably hybridize specifically to, a coding mRNA 
nucleic acid sequence, e.g., an angiogenesis protein mRNA, or a subsequence thereof. 
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Binding of the antisense polynucleotide to the mRNA reduces the translation and/or stability 
of the mRNA. 

In the context of this invention, antisense polynucleotides can comprise 
naturally-occurring nucleotides, or synthetic species formed from naturally-occurring 
subunits or their close homologs. Antisense polynucleotides may also have altered sugar 
moieties or inter-sugar linkages. Exemplary among these are the phosphorothioate and other 
sulfur containing species which are known for use in the art. Analogs are comprehended by 
this invention so long as they function effectively to hybridize with the angiogenesis protein 
mRNA. See, e.g., Isis Pharmaceuticals, Carlsbad, CA; Sequitor, Inc., Natick, MA. 

Such antisense polynucleotides can readily be synthesized using recombinant 
means, or can be synthesized in vitro. Equipment for such synthesis is sold by several 
vendors, including Applied Biosystems. The preparation of other oligonucleotides such as 
phosphorothioates and alkylated derivatives is also well known to those of skill in the art. 

Antisense molecules as used herein include antisense or sense 
oligonucleotides. Sense oligonucleotides can, e.g., be employed to block trancription by 
binding to the anti-sense strand. The antisense and sense oligonucleotide comprise a single- 
stranded nucleic acid sequence (either RNA or DNA) capable of binding to target mRNA 
(sense) or DNA (antisense) sequences for angiogenesis molecules. A preferred antisense 
molecule is for an angiogenesis sequences in Tables 1-8 , or for a ligand or activator thereof. 
Antisense or sense oligonucleotides, according to the present invention, comprise a fragment 
generally at least about 14 nucleotides, preferably from about 14 to 30 nucleotides. The 
ability to derive an antisense or a sense oligonucleotide, based upon a cDNA sequence 
encoding a given protein is described in, for example, Stein and Cohen (Cancer Res. 48:2659, 
1988) and van der Krol et al. (BioTechniques 6:958, 1988). 

Ribozymes 

In addition to antisense polynucleotides, ribozymes can be used to target and 
inhibit transcription of angio genesis-associated nucleotide sequences. A ribozyme is an RNA 
molecule that catalytically cleaves other RNA molecules. Different kinds of ribozymes have 
been described, including group I ribozymes, hammerhead ribozymes, hairpin ribozymes, 
RNase P, and axhead ribozymes (see, e.g., Castanotto et al. (1994) Adv. in Pharmacology 25: 
289-3 17 for a general review of the properties of different ribozymes). 

The general features of hairpin ribozymes are described, e.g., in Hampel et al 
(1990) Nucl Acids Res. 18: 299-304; Hampel et al (1990) European Patent Publication No. 0 
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360 257; U.S. Patent No. 5,254,678. Methods of preparing are well known to those of skill in 
the art (see, e.g., Wong-Staal et al, WO 94/26877; Ojwang et al. (1993) Proc. Natl Acad. 
Set USA 90: 6340-6344; Yamada et al (1994) Human Gene Therapy 1: 39-45; Leavitt et al 
(1995) Proc. Natl Acad. Sci. USA 92: 699-703; Leavitt et al (1994) Human Gene Therapy 5: 
5 1151-120; and Yamada et al. (1994) Virology 2Q5\ 121-126). 

Polynucleotide modulators of angiogenesis may be introduced into a cell 
containing the target nucleotide sequence by formation of a conjugate with a ligand binding 
molecule, as described in WO 91/04753. Suitable ligand binding molecules include, but are 
not limited to, cell surface receptors, growth factors, other cytokines, or other ligands that 

10 bind to cell surface receptors. Preferably, conjugation of the ligand binding molecule does 
not substantially interfere with the ability of the ligand binding molecule to bind to its 
corresponding molecule or receptor, or block entry of the sense or antisense oligonucleotide 
or its conjugated version into the cell. Alternatively, a polynucleotide modulator of 
angiogenesis may be introduced into a cell containing the target nucleic acid sequence, e.g., 

1 5 by formation of an polynucleotide-lipid complex, as described in WO 90/10448. It is 

understood that the use of antisense molecules or knock out and knock in models may also be 

* 

used in screening assays as discussed above, in addition to methods of treatment. 

Thus, in one embodiment, methods of modulating angiogenesis in cells or 
organisms are provided. In one embodiment, the methods comprise administering to a cell an 

20 anti-angiogenesis antibody that reduces or eliminates the biological activity of an 

endogeneous angiogenesis protein. Alternatively, the methods comprise, administering to a 
cell or organism a recombinant nucleic acid encoding an angiogenesis protein. This may be 
accomplished in any number of ways. In a preferred embodiment, for example when the 
angiogenesis sequence is down-regulated in angiogenesis, such state may be reversed by 

25 increasing the amount of angiogenesis gene product in the cell. This can be accomplished, 
e.g., by overexpressing the endogeneous angiogenesis gene or administering a gene encoding 
the angiogenesis sequence, using known gene-therapy techniques, for example. In a 
prefeixed embodiment, the gene therapy techniques include the incorporation of the 
exogenous gene using enhanced homologous recombination (EHR), for example as described 

30 in PCT/US93/03868, hereby incorporated by reference in its entireity. Alternatively, for 

example when the angiogenesis sequence is up-regulated in angiogenesis, the activity of the 
endogeneous angiogenesis gene is decreased, for example by the administration of a 
angiogenesis antisense nucleic acid or other inhibitor, such as RNAi. 
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In one embodiment, the angiogenesis eproteins of the present invention may 
be used to generate polyclonal and monoclonal antibodies to angiogenesis proteins. 
Similarly, the angiogenesis proteins can be coupled, using standard technology, to affinity 
chromatography columns. These columns may then be used to purify angiogenesis 
5 antibodies useful for production, diagnostic, or therapeutic purposes. In a preferred 

embodiment, the antibodies are generated to epitopes unique to a angiogenesis protein; that 
is, the antibodies show little or no cross-reactivity to other proteins. The angiogenesis 
antibodies may be coupled to standard affinity chromatography columns and used to purify 
angiogenesis proteins. The antibodies may also be used as blocking polypeptides, as outlined 
10 above, since they will specifically bind to the angiogenesis protein. 

Methods of identifyiitg variant angiogenesis-associated sequences 

Without being bound by theory, expression of various angiogenesis sequences 
is correlated with angiogenesis. Accordingly, disorders based on mutant or variant 

1 5 angiogenesis genes may be determined. In one embodiment, the invention provides methods 
for identifying cells containing variant angiogenesis genes, e.g., determining all or part of the 
sequence of at least one endogeneous angiogenesis genes in a cell. This may be 
accomplished using any number of sequencing techniques. In a preferred embodiment, the 
invention provides methods of identifying the angiogenesis genotype of an individual, e.g., 

20 determining all or part of the sequence of at least one angiogenesis gene of the individual 
This is generally done in at least one tissue of the individual, and may include the evaluation 
of a number of tissues or different samples of the same tissue. The method may include 
comparing the sequence of the sequenced angiogenesis gene to a known angiogenesis gene, 
i.e., a wild- type gene. 

25 The sequence of all or part of the angiogenesis gene can then be compared to 

the sequence of a known angiogenesis gene to determine if any differences exist. This can be 
done using any number of known homology programs, such as Bestfit, etc. In a preferred 
embodiment, the presence of a a difference in the sequence between the angiogenesis gene of 
the patient and the known angiogenesis gene correlates with a disease state or a propensity 

30 for a disease state, as outlined herein. 

In a preferred embodiment, the angiogenesis genes are used as probes to 
determine the number of copies of the angiogenesis gene in the genome. 

In another preferred embodiment, the angiogenesis genes are used as probes to 
determine the chromosomal localization of the angiogenesis genes. Information such as 
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chromosomal localization finds use in providing a diagnosis or prognosis in particular when 
chromosomal abnormalities such as translocations, and the like are identified in the 
angiogenesis gene locus. 

5 Administration of pharmaceutical and vaccine compositions 

In one embodiment, a therapeutically effective dose of an angiogenesis protein 
or modulator thereof, is administered to a patient. By "therapeutically effective dose" herein 
is meant a dose that produces effects for which it is administered. The exact dose will depend 
on the purpose of the treatment, and will be ascertainable by one skilled in the art using 

* 

10 known techniques (e.g., Ansel et al y Pharmaceuitcal Dosage Forms and Drug Delivery, 
Lippincott, Williams & Wilkins Publishers, ISBN:0683305727; Lieberman (1992) 
Pharmaceutical Dosage Forms (vols. 1-3), Dekker, ISBN 0824770846, 082476918X, 
0824712692, 0824716981; Lloyd (1999) The Art, Science and Technology of Pharmaceutical 
Compounding, Amer. Pharmacutical Assn, ISBN 0917330889; and Pickar (1999) Dosage 

15 Calculations, Delmar Pub, ISBN 0766805042). As is known in the art, adjustments for 
angiogenesis degradation, systemic versus localized delivery, and rate of new protease 
synthesis, as well as the age, body weight, general health, sex, diet, time of administration, 
drug interaction and the severity of the condition may be necessary, and will be ascertainable 
with routine experimentation by those skilled in the art. 

20 A "patient" for the purposes of the present invention includes both humans and 

other animals, particularly mammals. Thus the methods are applicable to both human 
therapy and veterinary applications. In the preferred embodiment the patient is a mammal, 
preferably a primate, and in the most preferred embodiment the patient is human. 

The administration of the angiogenesis proteins and modulators thereof of the 

25 present invention can be done in a variety of ways as discussed above, including, but not 
limited to, orally, subcutaneously, intravenously, intranasally, transdermally, 
intraperitoneally, intramuscularly, intrapulmonary, vaginally, rectally, or intraocularly. In 
some instances, for example, in the treatment of wounds and inflammation, the angiogenesis 
proteins and modulators may be directly applied as a solution or spray. 

30 The pharmaceutical compositions of the present invention comprise an 

angiogenesis protein in a form suitable for administration to a patient. In the preferred 
embodiment, the pharmaceutical compositions are in a water soluble form, such as being 
present as pharmaceutically acceptable salts, which is meant to include both acid and base 
addition salts. "Pharmaceutically acceptable acid addition salt" refers to those salts that retain 

70 



WO 02/079492 



PCT/US02/049I5 



the biological effectiveness of the free bases and that are not biologically or otherwise 
undesirable, formed with inorganic acids such as hydrochloric acid, hydrobromic acid, 
sulfuric acid, nitric acid, phosphoric acid and the like, and organic acids such as acetic acid, 
propionic acid, glycolic acid, pyruvic acid, oxalic acid, maleic acid, malonic acid, succinic 
5 acid, fumaric acid, tartaric acid, citric acid, benzoic acid, cinnamic acid, mandelic acid, 

methanesulfonic acid, ethanesulfonic acid, p-toluenesulfonic acid, salicylic acid and the like. 
"Pharmaceutical^ acceptable base addition salts" include those derived from inorganic bases 
such as sodium, potassium, lithium, ammonium, calcium, magnesium, iron, zinc, copper, 
manganese, aluminum salts and the like. Particularly preferred are the ammonium, 

10 potassium, sodium, calcium, and magnesium salts. Salts derived from pharmaceutic ally 
acceptable organic non-toxic bases include salts of primary, secondary, and tertiary amines, 
substituted amines including naturally occurring substituted amines, cyclic amines and basic 
ion exchange resins, such as isopropylamine, trimethylamine, diethylamine, triethylamine, 
tripropylamine, and ethanolamine. 

15 The pharmaceutical compositions may also include one or more of the 

following: carrier proteins such as serum albumin; buffers; fillers such as microcrystalline 
cellulose, lactose, corn and other starches; binding agents; sweeteners and other flavoring 
agents; coloring agents; and polyethylene glycol 

The pharmaceutical compositions can be administered in a variety of unit 

20 dosage forms depending upon the method of administration. For example, unit dosage forms 
suitable for oral administration include, but are not limited to, powder, tablets, pills, capsules 
and lozenges. It is recognized that angiogenesis protein modulators (e.g., antibodies, 
antisense constructs, ribozymes, small organic molecules, etc.) when administered orally, 
should be protected from digestion. This is typically accomplished either by complexing the 

25 molecule(s) with a composition to render it resistant to acidic and enzymatic hydrolysis, or by 
packaging the molecule(s) in an appropriately resistant carrier, such as a liposome or a 
protection barrier. Means of protecting agents from digestion are well known in the art. 

The compositions for administration will commonly comprise an angiogenesis 
protein modulator dissolved in a pharmaceutical^ acceptable carrier, preferably an aqueous 

30 carrier. A variety of aqueous carriers can be used, e.g., buffered saline and the like. These 
solutions are sterile and generally free of undesirable matter. These compositions may be 
sterilized by conventional, well known sterilization techniques. The compositions may 
contain pharmaceutically acceptable auxiliary substances as required to approximate 
physiological conditions such as pH adjusting and buffering agents, toxicity adjusting agents 
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and the like, for example, sodium acetate, sodium chloride, potassium chloride, calcium 
chloride, sodium lactate and the like. The concentration of active agent in these formulations 
can vary widely, and will be selected primarily based on fluid volumes, viscosities, body 
weight and the like in accordance with the particular mode of administration selected and the 
5 patient's needs (e.g., Remington 's Pharmaceutical Science, 15th ed., Mack Publishing 
Company, Easton, Pennsylvania (1980) and Goodman and Gillman, The Pharmacologial 
Basis of Therapeutics ,(Hardman, J.G, Limbird, L.E, Molinoff, P.B., Ruddon, R.W, and 
Gilman, A.G.,eds) TheMcGraw-Hill Companies, Inc., 1996). 

Thus, a typical pharmaceutical composition for intravenous administration 

10 would be about 0.1 to 10 mg per patient per day. Dosages from 0.1 up to about 100 mg per 
patient per day may be used, particularly when the drug is administered to a secluded site and 
not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for 
preparing parenterally administrable compositions will be known or apparent to those skilled 

15 in the art, e.g., Remington s Pharmaceutical Science and Goodman and Gillman, The 
Pharmacologial Basis of Tlierapentics, supra. 

The compositions containing modulators of angiogenesis proteins can be 
administered for therapeutic or prophylactic treatments. In therapeutic applications, 
compositions are administered to a patient suffering from a disease (e.g., a cancer) in an 

20 amount sufficient to cure or at least partially arrest the disease and its complications. An 

amount adequate to accomplish this is defined as a "therapeutically effective dose," Amounts 
effective for this use will depend upon the severity of the disease and the general state of the 
patient's health. Single or multiple administrations of the compositions may be administered 
depending on the dosage and frequency as required and tolerated by the patient. In any event, 

25 the composition should provide a sufficient quantity of the agents of this invention to 

effectively treat the patient. An amount of modulator that is capable of preventing or slowing 
the development of cancer in a mammal is referred to as a "prophylactically effective dose." 
The particular dose required for a prophylactic treatment will depend upon the medical 
condition and history of the mammal, the particular cancer being prevented, as well as other 

30 factors such as age, weight, gender, administration route, efficiency, etc. Such prophylactic 
treatments may be used, e.g., in a mammal who has previously had cancer to prevent a 
recurrence of the cancer, or in a mammal who is suspected of having a significant likelihood 
of developing cancer. 
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It will be appreciated that the present angiogenesis protein-modulating 
compounds can be administered alone or in combination with additional angiogenesis 
modulating compounds or with other therapeutic agent, e.g., other anti-cancer agents or 
treatments. 

In numerous embodiments, one or more nucleic acids, e.g., polynucleotides 
comprising nucleic acid sequences set forth in Tables 1-8 , such as antisense polynucleotides 
or ribozymes, will be introduced into cells, in vitro or in vivo. The present invention provides 
methods, reagents, vectors, and cells useful for expression of angiogenesis-associated 
polypeptides and nucleic acids using in vitro (cell-free), ex vivo or in vivo (cell or 
organism-based) recombinant expression systems. 

The particular procedure used to introduce the nucleic acids into a host cell for 
expression of a protein or nucleic acid is application specific. Many procedures for 
introducing foreign nucleotide sequences into host cells may be used. These include the use 
of calcium phosphate transfection, spheroplasts, electroporation, liposomes, microinjection, 
plasma vectors, viral vectors and any of the other well known methods for introducing cloned 
genomic DNA, cDNA, synthetic DNA or other foreign genetic material into a host cell (see, 
e.g., Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in Enzymology 
volume 152 Academic Press, Inc., San Diego, CA (Berger), F.M. Ausubel et al., eds., Current 
Protocols, a joint venture between Greene Publishing Associates, Inc. and John Wiley & 
Sons, Inc., (supplemented through 1999), and Sambrook et al., Molecular Cloning - A 
Laboratory Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring 
Harbor, New York, 1989. 

In a preferred embodiment, angiogenesis proteins and modulators are 
administered as therapeutic agents, and can be formulated as outlined above. Similarly, 
angiogenesis genes (including both the full-length sequence, partial sequences, or regulatory 
sequences of the angiogenesis coding regions) can be administered in a gene therapy 
application. These angiogenesis genes can include antisense applications, either as gene 
therapy (i.e. for incorporation into the genome) or as antisense compositions, as will be 
appreciated by those in the art. 

Angiogenesis polypeptides and polynucleotides can also be administered as 
vaccine compositions to stimulate HTL, CTL and antibody responses.. Such vaccine 
compositions can include, for example, lipidated peptides (e.g. 5 Vitiello, A. et al., J. Clin. 
Invest. 95:341, 1995), peptide compositions encapsulated in poly(DL-lactide-co-glycolide) 
("PLG") microspheres (see, e.g., Eldridge, etal, Molec. Immunol 28:287-294, 1991: Alonso 
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etal, Vaccine 12:299-306, 1 994; Jones et al. , Vaccine 13:675-681, 1995), peptide 
compositions contained in immune stimulating complexes (ISCOMS) (see, e.g., Takahashi et 
al, Nature 344:873-875, 1990; Hu a/., Clin Exp Immunol 113:235-243, 1998), multiple 
antigen peptide systems (MAPs) (see e.g., Tarn, J. P., Proc. Natl. Acad. Set U.S.A. 85:5409- 
5 5413, 1988; Tarn, J.P., X Immunol Methods 196:17-32, 1996), peptides formulated as 
multivalent peptides; peptides for use in ballistic delivery systems, typically crystallized 
peptides, viral delivery vectors (Perkus, M. E. et al, In: Concepts in vaccine development, 
Kaufinann, S. H. E„ ed., p. 379, 1996; Chakrabarti, S. et al, Nature 320:535, 1986; Hu, S. L. 
et al, Nature 320:537, 1986; Kieny, M.-P. et al, AIDS Bio/Technology 4:790, 1986; Top, F. 

10 H. et al, J. Infect. Dis. 124:148, 1971; Chanda, P. K. et al, Virology 175:535, 1990), 
particles of viral or synthetic origin (e.g., Kofler, N. et al, J. Immunol Methods. 192:25, 
1996; Eldridge, J. H. etal, Sem. Hematol 30:16, 1993; Falo, L. D., Jr. etal, Nature Med. 
7:649, 1995), adjuvants (Warren, H. S., Vogel, F. R., and Chedid, L. A. Annu. Rev. Immunol 
4:369, 1986; Gupta, R. K. et al, Vaccine 11:293, 1993), liposomes (Reddy, R. et al.,J. 

1 5 Immunol 148: 1 585, 1992; Rock, K. L., Immunol Today 17:131,1 996), or, naked or particle 
absorbed cDNA (Ulmer, J. B. et al, Science 259: 1745, 1993; Robinson, H. L., Hunt, L. A., 
and Webster, R. G., Vaccine 11:957, 1993; Shiver, J. W. etal, In: Concepts in vaccine 
development, Kaufinann, S. H. E., ed., p. 423, 1996; Cease, K. B., and Berzofsky, J. A., 
Anna. Rev. Immunol 12:923, 1994 and Eldridge, J. H. et al, Sem. Hematol. 30:16, 1993). 

20 Toxin-targeted delivery technologies, also known as receptor mediated targeting, such as 
those of Avant Immunotherapeutics, Inc. (Needham, Massachusetts) may also be used. 

Vaccine compositions often include adjuvants. Many adjuvants contain a 
substance designed to protect the antigen from rapid catabolism, such as aluminum hydroxide 
or mineral oil, and a stimulator of immune responses, such as lipid A, Bortadella pertussis or 

25 Mycobacterium tuberculosis derived proteins. Certain adjuvants are commercially available 
as, for example, Freund's Incomplete Adjuvant and Complete Adjuvant (Difco Laboratories, 

■ 

Detroit, MI); Merck Adjuvant 65 (Merck and Company, Inc., Railway, NJ); AS-2 
(SmithKline Beecham, Philadelphia, PA); aluminum salts such as aluminum hydroxide gel 
(alum) or aluminum phosphate; salts of calcium, iron or zinc; an insoluble suspension of 
30 acylated tyrosine; acylated sugars; cationically or anionically derivatized polysaccharides; 
polyphosphazenes; biodegradable microspheres; monophosphoryl lipid A and quil A. 
Cytokines, such as GM-CSF, interleukin-2, -7, -12, and other like growth factors, may also be 
used as adjuvants. 
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Vaccines can be administered as nucleic acid compositions wherein DNA or 
RNA encoding one or more of the polypeptides, or a fragment thereof, is administered to a 
patient. This approach is described, for instance, in Wolff et. al, Science 247:1465 (1990) as 
well as U.S. Patent Nos. 5,580,859; 5,589,466; 5,804,566; 5,739,118; 5,736,524; 5,679,647; 
5 WO 98/04720; and in more detail below. Examples of DNA-based delivery technologies 
include "naked DNA", facilitated (bupivicaine, polymers, peptide-mediated) delivery, 
cationic lipid complexes, and particle-mediated ("gene gun") or pressure-mediated delivery 
{see, e.g., U.S. Patent No. 5,922,687). 

For therapeutic or prophylactic immunization purposes, the peptides of the 

1 0 invention can be expressed by viral or bacterial vectors. Examples of expression vectors 

include attenuated viral hosts, such as vaccinia or fowlpox. This approach involves the use of 
vaccinia virus, for example, as a vector to express nucleotide sequences that encode 
angiogenic polypeptides or polypeptide fragments. Upon introduction into a host, the 
recombinant vaccinia virus expresses the immunogenic peptide, and thereby elicits an 

1 5 immune response. Vaccinia vectors and methods useful in immunization protocols are 
described in, e.g., U.S. Patent No. 4,722,848. Another vector is BCG (Bacille Calmette 
Guerin). BCG vectors are described in Stover et al, Nature 351 :456-460 (1991). A wide 
variety of other vectors useful for therapeutic administration or immunization e.g. adeno and 
adeno-associated virus vectors, retroviral vectors, Salmonella typhi vectors, detoxified 

20 anthrax toxin vectors, and the like, will be apparent to those skilled in the art from the 

description herein (see, e.g., Shata et al. (2000) Mol Med Today, 6: 66-71; Shedlock et al, J 
LeukocBiol 68,:793-806, 2000; Hipp et al., In Vivo 14:571-85, 2000). 

Methods for the use of genes as DNA vaccines are well known, and include 
placing an angiogenesis gene or portion of an angiogenesis gene under the control of a 

25 regulatable promoter or a tissue-specific promoter for expression in an angiogenesis patient. 
The angiogenesis gene used for DNA vaccines can encode fiill-Iength angiogenesis proteins, 
but more preferably encodes portions of the angiogenesis proteins including peptides derived 
from the angiogenesis protein. In one embodiment, a patient is immunized with a DNA 
vaccine comprising a plurality of nucleotide sequences derived from an angiogenesis gene. 

30 For example, angiogenesis-associated genes or sequence encoding subfragments of an 
angiogenesis protein are introduced into expression vectors and tested for their 
immunogenicity in the context of Class I MHC and an ability to generate cytotoxic T cell 
responses. This procedure provides for production of cytotoxic T cell responses against cells 
which present antigen, including intracellular epitopes. 
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In a preferred embodiment, the DNA vaccines include a gene encoding an 
adjuvant molecule with the DNA vaccine. Such adjuvant molecules include cytokines that 
increase the immunogenic response to the angiogenesis polypeptide encoded by the DNA 
vaccine. Additional or alternative adjuvants are available. 
5 In another preferred embodiment angiogenesis genes find use in generating 

animal models of angiogenesis. When the angiogenesis gene identified is repressed or 
diminished in angiogenesic tissue, gene therapy technology, e.g., wherein antisense RNA 
directed to the angiogenesis gene will also diminish or repress expression of the gene. 
Animal models of angiogenesis find use in screening for modulators of an angiogenesis- 

10 associated sequence or modulators of angiogenesis. Similarly, transgenic animal technology 
including gene knockout technology, for example as a result of homologous recombination 
with an appropriate gene targeting vector, will result in the absence or increased expression 
of the angiogenesis protein. When desired, tissue-specific expression or knockout of the 
angiogenesis protein may be necessary, 

15 It is also possible that the angiogenesis protein is overexpressed in 

angiogenesis. As such, transgenic animals can be generated that overexpress the 
angiogenesis protein. Depending on the desired expression level, promoters of various 
strengths can be employed to express the transgene. Also, the number of copies of the 
integrated transgene can be determined and compared for a determination of the expression 

20 level of the transgene. Animals generated by such methods find use as animal models of 
angiogenesis and are additionally useful in screening for modulators to treat angiogenesis or 
to evaluate a therapeutic entity. 

Kits for Use in Diagnostic and/or Prognostic Applications 

25 For use in diagnostic, research, and therapeutic applications suggested above, 

kits are also provided by the invention. In the diagnostic and research applications such kits 
may include any or all of the following: assay reagents, buffers, angiogenesis-specific nucleic 
acids or antibodies, hybridization probes and/or primers, antisense polynucleotides, 
ribozymes, dominant negative angiogenesis polypeptides or polynucleotides, small molecules 

30 inhibitors of angiogenesis-associated sequences etc. A therapeutic product may include 
sterile saline or another pharmaceutically acceptable emulsion and suspension base. 

In addition, the kits may include instructional materials containing directions 
(i.e., protocols) for the practice of the methods of this invention. While the instructional 
materials typically comprise written or printed materials they are not limited to such. Any 

76 



WO 02/079492 



PCT/US02/04915 



medium capable of storing such instructions and communicating them to an end user is 
contemplated by this invention. Such media include, but are not limited to electronic storage 
media (e.g., magnetic discs, tapes, cartridges, chips), optical media (e.g., CD ROM), and the 
like. Such media may include addresses to internet sites that provide such instructional 
5 materials. 

The present invention also provides for kits for screening for modulators of 
angiogenesis-associated sequences. Such kits can be prepared from readily available 
materials and reagents. For example, such kits can comprise one or more of the following 
materials: an angiogenesis-associated polypeptide or polynucleotide, reaction tubes, and 

10 instructions for testing angiogenic-associated activity. Optionally, the kit contains 

biologically active angiogenesis protein. A wide variety of kits and components can be 
prepared according to the present invention, depending upon the intended user of the kit and 
the particular needs of the user. Diagnosis would typically involve evaluation of a plurality 
of genes or products. The genes will be selected based on correlations with important 

15 parameters in disease which may be identified in historical or outcome data. 

It is understood that the examples described above in no way serve to limit 
the true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
20 specification are herein incorporated by reference as if each individual publication or patent 
application were specifically and individually indicated to be incorporated by reference. 

EXAMPLES 

25 Example 1: Tissue Preparation. Labeling Chips, and Fingerprints 
Purify total RNA from tissue using TRIzol Reagent 

Homogenize tissue samples in 1ml of TRIzol per 50mg of tissue using a 

* 

Polytron 3100 homogenizer. The generator/probe used depends upon the tissue size. A 
generator that is too large for the amount of tissue to be homogenized will cause a loss of 
30 sample and lower RNA yield. TRIzol is added directly to frozen tissue, which is then 

homogenize. Following homogenization, insoluble material is removed by centrifiigation at 
7500 x g for 15 min in a Sorvall superspeed or 12,000 x g for 10 min. in an Eppendorf 
centrifuge at 4°C. The clear homogenate is transferred to a new tube for use. The samples 
may be frozen now at -60° to -70°C (and kept for at least one month). The homogenate is 
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mixed with 0.2ml of chloroform per 1ml of TRIzol reagent used in the original 
homogenization and incubated at room temp, for 2-3 minutes. The aqueous phase is then 
separated by centrifugation and transferred to a fresh tube and the RNA precipitated using 
isopropyl alcohol. The pellet is isolated by centrifugation, washed, air-dried, resuspended in 
5 an appropriate volume of DEPC H2O, and the absorbance measured. 

Purification of poly A+ mRNA from total RNA is performed as follows. Heat 
an oligotex suspension to 37°C and mixing immediately before adding to RNA. The 
Elution Buffer is heated at 70°C. Warm up 2 x Binding Buffer at 65°C if there is precipitate 
in the buffer. Mix total RNA with DEPC-treated water, 2 x Binding Buffer, and Oligotex 

10 according to Table 2 on page 16 of the Oligotex Handbook. Incubate for 3 minutes at 65°C. 
Incubate for 10 minutes at room temperature. Centrifuge for 2 minutes at 14,000 to 18,000 g. 
Remove supernatant without disturbing Oligotex pellet. A little bit of solution can be left 
behind to reduce the loss of Oligotex. Gently resuspend in Wash Buffer OW2 and pipet onto 
spin column. Centrifuge the spin column at full speed for 1 minute. Transfer spin column to 

1 5 a new collection tube and gently resuspend in Wash Buffer OW2 and centrifuge as describe 
herein. Transfer spin column to a new tube and elute with 20 to 100 ul of preheated (70oC) 
Elution Buffer. Gently resuspend Oligotex resin by pipetting up and down. Centrifuge as 
above. Repeat elution with fresh elution buffer or use first eluate to keep the elution volume 
low. Read absorbance, using diluted Elution Buffer as the blank. Before proceeding with 

20 cDNA synthesis, precipitate the mRNA as follows: add 0.4 vol. of 7.5 M NH40Ac + 2.5 vol. 
of cold 100% ethanol. Precipitate at -20oC 1 hour to overnight (or 20-30 min. at -70oC). 
Centrifuge at 14,000-16,000 x g for 30 minutes at 4oC. Wash pellet with 0.5ml of 
80%ethanol (-20oC) then centrifuge at 14,000-16,000 x g for 5 minutes at room temperature.. 
Repeat 80% ethanol wash. Air dry the ethanol from the pellet in the hood.. Suspend pellet in 

25 DEPC H 2 0 at lug/ul concentration. 

To further Clean up total RNA using Qiagen's RNeasy kit, add no more than 
lOOug to an RNeasy column. Adjust sample to a volume of lOOul with RNase-free water. 
Add 350ul Buffer RLT then 250ul ethanol (100%) to the sample. Mix by pipetting (do not 
centrifuge) then apply sample to an RNeasy mini spin column. Centrifuge for 15 sec at 

30 >l0,000rpm. Transfer column to a new 2-ml collection tube. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough. Add 500ul Buffer RPE and 
centrifuge for 15 sec at >10,000rpm. Discard flowthrough then centrifuge for 2 min at 
maximum speed to dry column membrane. Transfer column to a new 1.5-ml collection tube 
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and apply 30-5 Oul of RNase-free water directly onto column membrane. Centrifuge 1 min at 
>10,000rpm. Repeat elution. and read absorbance. 

cDNA synthesis using Gibco's "Superscript Choice System for cDNA Synthesis" kit 
5 First Strand cDNA synthesis is performed as follows. Use 5ug of total RNA 

or lug of polyA+ mRNA as starting material. For total RNA, use 2ul of Superscript RT. For 
polyA+ mRNA, use lul of Superscript RT. Final volume of first strand synthesis mix is 
20ul. RNA must be in a volume no greater than lOul. Incubate RNA with lul of lOOpmol 
T7-T24 oligo for 10minat70C. On ice, add 7 ul of: 4ul5Xlst Strand Buffer, 2ul of 0.1 M 

10 DTT, and 1 ul of lOmM dNTP mix. Incubate at 37C for 2 min then add Superscript RT. 
Incubate at 37C for 1 hour. 

For the second strand synthesis, place 1st strand reactions on ice and add: 91ul 
DEPC H 2 0; 30ul 5X 2nd Strand Buffer; 3ul lOmM dNTP mix; lul lOU/ul E.coli DNA 
Ligase; 4ul lOU/ul E.coli DNA Polymerase; and lul 2U/ul RNase H. Mix and incubate 2 

15 hours at 16C. Add 2ul T4 DNA Polymerase. Incubate 5 min at 16C. Add lOul of 0.5M 
EDTA. A further clean-up of DNA is performed using phenol:chloroform:isoamyl Alcohol 
(25:24:1) purification. 

In vitro Transcription (IVT) and labeling with biotin is performed as follows: 
Pipet 1 ,5ul of cDNA into a thin-wall PCR tube. Make NTP labeling mix by combining 2ul T7 

20 lOxATP (75mM) (Ambion); 2ul T7 lOxGTP (75mM) (Ambion); 1.5ul T7 lOxCTP (75mM) 
(Ambion); 1.5ul T7 lOxUTP (75mM) (Ambion); 3.75ul lOmM Bio-ll-UTP (Boehringer- 
Mannheim/Roche or Enzo); 3.75ul 10mMBio-16-CTP (Enzo); 2ul lOx T7 transcription 
buffer (Ambion); and 2ul lOx T7 enzyme mix (Ambion). The final volume is 20ul. Incubate 
6 hours at 37°C in a PCR machine. The RNA can be furthered cleaned. 

25 Fragmentation is performed as follows. 15 ug of labeled RNA is usually 

fragmented. Try to minimize the fragmentation reaction volume; a 10 ul volume is 
recommended but 20 ul is all right. Do not go higher than 20 ul because the magnesium in 
the fragmentation buffer contributes to precipitation in the hybridization buffer. Fragment 
RNA by incubation at 94 C for 35 minutes in 1 x Fragmentation buffer (5 x Fragmentation 

30 buffer is 200 mM Tris-acetate, pH 8.1; 500 mM KOAc; 150 mM MgOAc). The labeled 
RNA transcript can be analyzed before and after fragmentation. Samples can be heated to 
65°C for 15 minutes and electrophoresed on 1% agarose/TBE gels to get an approximate idea 
of the transcript size range 
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For hybridization, 200 ul (10ug cRNA) of a hybridization mix is put on the 
chip. If multiple hybridizations are to be done (such as cycling through a 5 chip set), then it 
is recommended that an initial hybridization mix of 300 ul or more be made. The 
hybridization mix is: fragment labeled RNA (50ng/ul final cone); 50 pM 948-b control 
5 oligo; 1.5 pM BioB; 5 pM BioC; 25 pM BioD; 100 pM CRE; O.lmg/ml herring sperm DNA; 
0.5mg/ml acetylated BSA; and 300 ul with lxMES hyb buffer. 

Labeling is performed as follows: The hybridization reaction includes non- 
biotinylated IVT (purified by RNeasy columns); IVT antisense RNA 4 (J.g:jal; random 
Hexamers (1 |ig/|il) 4 |il and water to 14 ul. The reaciton is incubated at 70°C, 10 min. 

10 Reverse transcriptionis performed in the following reaction: 5X First Strand (BRL) buffer, 6 
Hi; 0.1 M DTT, 3 ul; 50X dNTP mix, 0.6 jliI; H 2 0, 2.4 fal; Cy3 or Cy5 dUTP (ImM), 3 |il; SS 
RT II (BRL), 1 jliI in a final volume of 16 \xl. Add to hybridization reaction. Incubate 30 
min., 42°C. Add 1 jliI SSII and incubate another hour. Put on ice. SOX dNTP mix (25mM of 
cold dATP, dCTP, and dGTP, lOmM of dTTP: 25 \il each of lOOmM dATP, dCTP, and 

15 dGTP; 10 yl of lOOmM dTTP to 15 \il H20. dNTPs from Pharmacia) 

RNA degradation is performed as follows. Add 86 jil H20, 1.5 jil 1M NaOH/ 
2mM EDTA and incubate at 65°C, 10 min.. For U-Con 30, 500 jlxI TE/sample spin at 7000g 
for 10 min, save flow through for purification. For Qiagen purification, suspend u-con 
recovered material in 500^1 buffer PB and proceed using Qiagen protocol. For DNAse 

20 digestion, add 1 ul of 1/100 dil of DNAse/30ul Rx and incubate at 37°C for 15 min. Incubate 
at 5 min 95 °C to denature the DNAse/ 

For sample preparation, add Cot-1 DNA, 10 |al; 50X dNTPs, 1 jal; 20X SSC, 
2.3 Na pyro phosphate, 7.5 \xl; lOmg/ml Herring sperm DNA; lul of 1/10 dilution to 21.8 
final vol. Dry in speed vac. Resuspend in 15 \xl H20. Add 0.38 \xl 10% SDS. Heat 95°C, 2 

25 min and slow cool at room temp, for 20 min. Put on slide and hybridize overnight at 64°C. 
Washing after the hybridization: 3X SSC/0.03% SDS: 2 min., 37.5 mis 20X SSC+0.75mls 
10% SDS in 250mls H20; IX SSC: 5 min., 12.5 mis 20X SSC in 250mls H20; 0.2X SSC: 5 
min., 2.5 mis 20X SSC in 250mls H20. Dry slides and scan at appropiate PMT's and 
channels. 

30 

Example 2. A model of angiogenesis is used to determine expression in angiogenesis 

In the model of angiogenesis used to determine expression of angiogenesis- 
associated sequences, human umbilical vein endothelial cells (HUVEC) were obtained, e.g. , 
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as passage 1 (pi) frozen cells from Cascade Biologies (Oregon) and grown in maintenance 
medium: Medium 199 (Life Technologies) supplemented with 20% pooled human serum, 
1 00 mg/ml heparin and 75 mg/ml endothelial cell growth supplements (Sigma) and 
gentamicin (Life Technologies). An in vitro cell system model was used in which 2xl0 5 

5 HUVECs were cultured in 0.5 ml 3 mgs/ml plasminogen-depleted fibrinogen (Calbiochem, 
San Diego, CA) that was polymerized by the addition of 1 unit of maintenance medium 
supplemented with 100 ng/ml VEGF and HGF and 10 ng/ml TGF-a (R&D Systems, 
Minneapolis,MN) added (growth medium). The growth medium was replaced every 2 days. 
Samples for RNA were collected, e.g., at 0, 2, 6, 15, 24, 48, and 96 hours of culture. The 

10 fibrin clots were placed in Trizol (Life Technologies) and disrupted using a Tissuemizer. 
Thereafter standard procedures were used for extracting the RNA (e.g., Example 1). 

Angiogenesis associated sequences thus identified are shown in Tables 1-8 . 

■ 

As indicated, some of the Accession numbers include expression sequence tags (ESTs). 
Thus, in one embodiment herein, genes within an expression profile, also termed expression 
15 profile genes, include ESTs and are not necessarily full length. 
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TABLE 1 : 



Pkey: 
Accession: 
ExAccn: 
UnigenelD: 
Unigene Title: 



Unique Eos probeset identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 



Pkey 


Accession 


ExAccn 


UnigenelD 


134404 


AB000450 


AB000450 


Hs.82771 


121443 ABOO2380 


AF180681 


Hs.6582 


100082 AB003103 


AA1 30080 


Hs.4295 


132817 


AB004884 


N27852 


Hs.57553 


130150 


AF000573_ma1 


BE094848 


Hs.15113 


100104 AF008937 


AF008937 


Hs.102178 


130839 AF009301 


AB011169 


Hs.20141 


427064 


AF009368 


AF029674 


Hs.173422 


100113 


D00591 


NMJ301269 


Hs.84746 


133980 D00760 


AA294921 


Hs.250811 


100129 


D11139 


AA469369 


Hs.5831 


100154 


D14657 


H60720 


Hs.81892 


100169 


D14878 


AL037228 


Hs.82043 


101956 


D17716 


NM_002410 


Hs.121502 


100190 


D21090 


M91401 


Hs.178658 


134742 


D26135 


NNL001346 


Hs.89462 


100211 


D26528 


D26528 


Hs.123058 


100238 


D30742 


L24959 


Hs.348 


130283 


D31762 


NM 012288 


Hs.153954 


. 134237 


D31765 


D31765 


Hs.170114 


100248 


D31888 


NM.015156 


Hs.78398 


100256 


D38128 


D25418 


Hs.393 


100262 


D38500 


D38500 


Hs.278468 


134329 


D38551 


N92036 


Hs.81848 


100281 


D42087 


AF091035 


Hs. 184627 


100294 


D49396 


AA331881 


Hs.75454 


100327 


D55640 


D55640 




100335 


D63391 


AW247529 


Hs.6793 


134495 


D63477 


D63477 


Hs.84087 


100338 


D63483 


D86864 


Hs.57735 


135152 


D64015 


M96954 


Hs.182741 


134269 


D79990 


NM 014737 


Hs.80905 


100372 


D79997 


NM_014791 


Hs.184339 


134304 


D80010 


BE6 13486 


Hs.81412 


100394 


D84276 


D84284 


Hs.66052 


100405 


D86425 


AW291587 


Hs.82733 


100418 


D86978 


D86978 


Hs.84790 


133154 


D87012 


D87012 


Hs.194685 


134347 


D87075 


AF164142 


Hs.82042 


128653 


D87432 


D87432 


Hs.10315 


100438 


D87448 


AA013051 


Hs.91417 


134593 


D87845 


NM 000437 


Hs.234392 


100481 


HG1098-HT1098 


X70377 


HS.121489 


100552 HG2167-HT2237 AA019521 


Hs.301946 


100591 


HG2415-HT2511 NM.004091 


Hs.231444 


cds 








100652 


HG2825-HT2949 BE613608 


Hs.142653 


100662 


HG2887-HT3031. 


_r 


A1368680 


100899 HG4660-HT5073 AL039123 


Hs.103042 


100905 HG4704-HT5146 L12260 


Hs.172816 


100945 


HG884-HT8B4 


AF002225 


Hs.180686 


100950 


HG919-HT919 


AF128542 


Hs.166846 


100964 


J00212J 


J00212 




135407 J04029 


J04029 


Hs.99936 


130149 J04031 


AW067805 


Hs.172665 


131877 


J04088 


J04088 


Hs.156346 


101016 J04543 


J04543 


Hs.78637 


134786 


L06139 


T29618 


Hs.89640 


134100 L07540 


AA460085 


Hs.171075 


134078 L08895 


L08895 


Hs.78985 


101132 L11239 


L11239 


Hs.36993 


134849 


L11353 


BE409525 


Hs.902 


106432 L13773 


AK000310 


Hs.17138 



UnigeneTitie 

vaccinia related kinase 2 

Rho guanine exchange factor (GEF) 12 

proteasome (prosome, macropain) 26S subunit, non-ATPase, 12 

tousled-like kinase 2 

homogentisate 1,2-dioxygenase (homogentisate oxidase) 
syntaxin 16 

similar to S. cerevisiae SSM4 
KIAA1605 protein 
chromosome condensation 1 

v-ral simian leukemia viral oncogene homolog B (ras related: GTP binding protein) 
tissue inhibitor of metalloproteinase 1 (erythroid potentiating activity, coilagenase inhibitor) 
KIAA0101 gene product 
D1 23 gene product 

mannosyl (alpha-1,6-)-glycoprotein beta-1,6-N-acetyl-glucosaminyItransferase 
RAD23 (S. cerevisiae) homolog B 
diacylglycerol kinase, gamma (90kD) 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 7 (RNA helicase, 52kD) 
calclum/calmodulin-dependent protein kinase IV 
TRAM-Gke protein 
KIAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacyclin) receptor (IP) 
postmeiotic segregation increased 2-like 4 
RAD21 (S. pombe) homolog 
KIAA01 18 protein 
peroxiredoxin 3 

gb:Human monocyte PABL (pseudoautosomal boundary-like sequence) mRNA, clone Mo2. 
platelet-activating factor acetylhydrolase, isoform lb, gamma subunit (29kD) 
KIAA0143 protein 
acetyl LDL receptor; SREC 

TIA1 cytotoxic granule-associated RNA-binding protein-like 1 
Ras association (RalGDS/AF-6) domain family 2 
KIAA0175 gene product 
fipin 1 

CD38 antigen (p45) 
nidogen 2 
KIAA0225 protein 
topoisomerase (DNA) 111 beta 

solute carrier family 23 (nucleobase transporters), member 1 

solute carrier family 7 (cationic amino acid transporter, y+ system), member 6 

topoisomerase (DNA) II binding protein 

platelet-activating factor acetylhydrolase 2 (40kD) 

cystatin D 

lysosomal 

Homo sapiens, Similar to hypothetical protein PR01722, clone MGC:15692, mRNA, complete 
ret finger protein 

Hs.816 SRY (sex determining region Y)-box 2 
microtubule-associated protein 1B 
neuregulin 1 

ubiquitin protein ligase E3A (human papilloma virus E6-associated protein, Angelman syndrome) 

polymerase (DNA directed), epsilon 

Empirically selected from AFFX single probeset 

keratin 10 (epidermolytic hyperkeratosis; keratosis palmaris et plantaris) 

methylenetetrahydrofolate dehydrogenase (NADP+ dependent), methenyltetrahydrofolate 

topoisomerase (DNA) ll alpha (170kD) 

annexin A7 

TEK tyrosine kinase, endothelial (venous malformations, multiple cutaneous and mucosal) 
replication factor C (activator 1) 5 (36.5kD) 

MADS box transcription enhancer factor 2, polypeptide C (myocyte enhancer factor 2C) 
gastrulation brain homeo box 1 
neurofibromin 2 (bilateral acoustic neuroma) 
hypothetical protein FLJ20303 
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101152 


L13800 


AI984625 


Hs.9884 


spindle pole body protein 


135397 


L14922 


L14922 


Hs.166563 


replication factor C (activator 1) 1 (145kD) 


131687 


L15189 


BE297635 


Hs.3069 


heat shock 70kD protein 9B (mortalin-2) 


101168 


L15388 


NfvU)05308 


Hs.211569 


G protein-coupled receptor kinase 5 


421155 


L16895 


H87879 


Hs.102267 


i/syl oxidase 


101226 


L27476 


AF083892 


Hs.75608 


tight junction protein 2 (zona occludens 2) 


133975 


L27624 


C18356 


Hs.295944 


tissue factor pathway inhibitor 2 


134739 


L32976 


NMJ02419 


Hs.89449 


mitogen-activated protein kinase kinase kinase 1 1 


130155 


L33404 


AA101043 


Hs.151254 


kallikrein 7 (chymotryptic, stratum comeum) 


440538 


L35263 


W76332 


Hs.79107 


mitogen-activated protein kinase 14 


132813 


L37347 


BE313625 


Hs.57435 


solute carrier family 11 (proton-coupled divalent metal ion transporters), member 2 


101294 


U0371 


AF168418 


Hs.1 16784 


thyroid hormone receptor interactor 4 


101300 


L40391 


BE535511 


Hs.74137 


transmembrane trafficking protein 


101310 


L416Q7 


L41607 


Hs.934 


glucosamine (N-acetyl) transferase 2, l-branching enzyme 


130344 


L77566 


AW250122 


Hs.154879 


DiGeorge syndrome critical region gene DGSI; likely ortholog of mouse expressed sequence 2 


embryonic lethal 








101381 


M13928 


AW675039 


Hs.1227 


aminolevulinate, delta-, dehydratase 


101668 


M14016 


AW005903 


Hs.78601 


uroporphyrinogen decarboxylase 


133780 


M14219 


AA557660 


Hs.76152 


decorin 


101396 


M15796 

1 » f 1 W » v w 


BE267931 


Hs.78996 


proliferating cell nuclear antigen 


101447 


M21305 

1 • I A— 9 WWW 


M21305 




gb:Human alpha satellite and satellite 3 junction DNA sequence. 


101458 


M22092 

■ V 9 mm mm W W mm 


M22092 




gb:Human neural eel) adhesion molecule (N-CAM) gene, exon SEC and partial cds. 


101470 

1 w 1 » 1 V 


M22898 

1 ¥ < WWW 


NM 000546 


Hs.1846 


tumor protein p53 (Li-Fraumeni syndrome) 


134604 


M22995 


NM 002884 


Hs.865 


RAP1A, member of RAS oncogene family 


101478 


M23379 


NMJ)02890 


Hs.758 


RAS p21 protein activator (GTPase activating protein) 1 


406698 

I WW W W W 


M24364 

1 » Um * WW I 


X03068 


Hs.73931 


major histocompatibility complex, class II, DQ beta 1 


133519 


M24400 


AW583062 


Hs.74502 


chymotrypsinogen B1 


131185 

J W t ■ ww 


M25753 

If li W f WW 


BE280074 


Hs.23960 


cyclinB! 


134116 


M27691 

><lfel WW • 


RB4694 


Hs.79194 


cAMP responsive element binding protein 1 


133999 

i w w- www 


M28213 

T mm m" % * W 


AA535244 


Hs,78305 


RAB2, member RAS oncogene family 


130174 

1 WW 1 ■ i 


M29550 

• V {£■ W WW W 


M29551 


Hs.151531 


protein phosphatase 3 (formerly 2B), catalytic subunit, beta isoform (calcineurin A beta) 


129963 


M29971 


M29971 


Hs.1384 


O-6-methylguanine-DNA methyitransferase 


132983 


M30269 


M30269 


Hs.62041 


nidogen (enactin) 


133900 


M31158 


M31158 


Hs.77439 


protein kinase, cAMP-dependent, regulatory, type II, beta 


101543 


M31166 


M31 166 


Hs.2050 


pentaxin-related gene, rapidly induced by IL-1 beta 


101545 


M31210 


BE246154 


. Hs.154210 


endothelial differentiation, sphingolipid G-protein-coupled receptor, 1 


101620 


M55420 


S55271 


Hs.247930 


Epsifon , IgE 


134691 


M59979 


AW382987 


Hs.88474 


prostaglandin-endoperoxide synthase 1 (prostaglandin G/H synthase and cyclooxygenase) 


133595 


M62810 


AA393273 


Hs.75133 


transcription factor 6-like 1 (mitochondrial transcription factor Mike) 


130425 


M63838 


AA243383 


Hs. 155530 


interferon, gamma-inducible protein 16 


101700 


M64710 


D90337 


Hs.247916 


natriuretic peptide precursor C 


101714 


M68874 


M68874 


Hs.211587 


phosphoiipase A2, group IVA (cytosolic, calcium-dependent) 


134246 


M74524 


D28459 


Hs.80612 


ubiquitin-conjugating enzyme E2A (RAD6 homolog) 


101760 


M80254 


M80254 


Hs.173125 


peptidylprolyi isomerase F (cyclophilin F) 


133948 


M31780 cds3 


X59960 


Hs.77813 


sphingomyelin phosphodiesterase 1, acid lysosomal (acid sphingomyelinase) 


101791 


M83822 


M83822 


Hs.62354 


cell division cycle 4-iike 


101812 


W86934 


BE439894 


Hs.78991 


DNA segment, numerous copies, expressed probes (GS1 gene) 


101813 


M87338 


NM 002914 


Hs.139226 


replication factor C (activator 1) 2 (40kD) 


133396 


M96326_rna1 


M96326 


Hs.72885 


azurocidin 1 (cationic antimicrobial protein 37) 


135152 


M96954 


M96954 


Hs.182741 


TIA1 cytotoxic granule-associated RNA-binding protein-like 1 


129026 


M98833 


AL1 20297 • 


Hs.108043 


Friend leukemia virus integration 1 


101901 


S66793 


H38026 


Hs.308 


anestin 3, retinal (X-arrestin) 


134831 


S72370 


AA853479 


Hs.89890 


pyruvate carboxylase 


134039 


S78569 


NM 002290 


Hs.78672 


laminin, alpha 4 


134395 


S79873 


AA4~56539 


Hs.8262 


lysosomal 


101975 


S83325 


AA079717 


Hs.283664 


aspartate beta-hydroxylase 


101977 


S83364 


AF112213 


Hs.184062 


putative Rab5-interacting protein 


101978 


S83365 


BE561610 


Hs.5809 


putative transmembrane protein; homolog of yeast Golgi membrane protein Yiflp (Yiplp- 


interacting factor) 






olfactory marker protein 


101998 


U01212 


U01212 


Hs.248153 


102003 


U01922 


U01922 


Hs.125565 


translocase of inner mitochondrial membrane 8 (yeast) homolog A 


102007 


U02556 


U02556 


Hs.75307 


t-complex-associated-testis-expressed Mike 


102009 


U02680 


BE245149 


Hs.82643 


protein tyrosine kinase 9 


416658 


U03272 


U03272 


Hs.79432 


fibrillin 2 (congenital contractural arachnodactyly) 


132951 


U04209 


AW821182 


Hs.61418 


microfibrillar-associated protein 1 


135389 


U05237 


U05237 


Hs.99872 


fetal Alzheimer antigen 


102048 


U07225 


U07225 


Hs.339 


purinergic receptor P2Y, G-protein coupled, 2 


130145 


U07620 


U34820 


Hs.151051 


mitogen-activated protein kinase 10 


303153 


U09759 


U09759 


Hs.246857 


mitogen-activated protein kinase 9 


420269 


U09820 


U72937 


Hs.96264 


alpha thalassemia/menta! retardation syndrome X-linked (RAD54 (S. cerevisiae) homolog)- 


102095 


U11313 


U11313 


Hs,75760 


sterol carrier protein 2 


102123 


U14518 


NfVL001809 


Hs.1594 


centromere protein A (17kO) 


102126 


U14575 


AW950870 


Hs.78961 


protein phosphatase 1, regulatory (inhibitor) subunit 8 


102133 


U15173 


AU076845 


Hs.155596 


BCL2/adenovirus E1B 19kD-interacting protein 2 


102139 


U15932 


NM 004419 


Hs.2128 


dual specificity phosphatase 5 


102162 


U18291 


AA450274 


Hs.1592 


CDC16 (cell division cycle 16, S. cerevisiae, homolog) 
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102164 


U18300 


NMJ0OO1O7 


Hs.77602 


427653 


U18383 


AA159001 


Hs.180069 


131817 


U20536 


U20536 


Hs.3280 


102200 


U21551 


AA232362 


Hs.157205 


102210 


U23028 


BE619413 


Hs.2437 


102214 


U23752 


U23752 


Hs.32964 


132811 


U25435 


U25435 


Hs.57419 


131319 


U25997 


NM.003155 


Hs.25590 


102256 


U28251jxis2 


U28251 


Hs.53237 


132316 


U28831 


U28B31 


Hs.44566 


102269 


U30245 


U30245 




exon 1. 








134365 


U32315 


AA568906 


Hs.82240 


102293 


U32439 


AF090116 


Hs.79348 


102298 


U32849 


AA382169 


Hs.54483 


102325 


U35139 


AI815867 


Hs.50130 


302344 


U36764 


BE303044 


Hs.192023 


102361 


U39400 


AA223616 


Hs.75859 


102367 


U39657 


U39656 


Hs.1 18825 


102388 


U41344 


AA362907 


Hs.76494 


102394 


U41766 


NM 003816 


Hs.2442 


129829 


U41813 


AF010258 


Hs.127428 


102251 


U41815 


NM 004398 


Hs.41706 


102409 


U43286 


BE300330 


Hs.1 18725 


133746 


U44378 


AW410035 


Hs.75862 


102423 


U44754 


Z47542 


Hs.179312 


132828 


U47011 cdsl 


AB014615 


Hs.57710 


130441 


U47077 


U63630 


Hs.155637 


102450 


U48251 


U48251 


Hs.75871 


129350 


U50535 


U50535 


Hs.1 10630 


102534 


U56833 


U96759 


Hs.198307 


130457 


U58091 


AB014595 


Hs.155976 


135065 


U58837 


AA019401 


Hs.93909 


102560 


U59289 


R97457 


Hs.63984 


102567 


U59863 


U63830 


Hs.146847 


134305 


U67122 


U61397 


Hs.81424 


102638 


U67319 


U67319 


Hs.9216 


132736 


U68019 


AW081883 


Hs.288261 


sapiens 


mad protein homolog (hMAD-3) mRNA 


133070 


U69611 


U92649 


Hs.64311 


102663 


U70322 


NMJ02270 


Hs.1 68075 


134660 


U73524 


U73524 


Hs.87465 


102735 


U79267 


AF1 11106 


Hs.3382 


102741 


1)79291 


AW959829 


Hs.83572 


101175 


U82671 cds2 


U82671 


Hs.36980 


132164 


U84573 


AI752235 


Hs.41270 


102823 


U90914 


D85390 


Hs.5057 


102826 


U91316 


NM_007274 


Hs.8679 


102831 


U91932 


AA262170 


Hs.80917 


102846 


U96131 


BE264974 


Hs.6566 


129777 


U97018 


U97018 


Hs.12451 


134161 


U97188 


AA634543 


Hs.79440 


134854 


V00503 


J03464 


Hs.1 79573 


302363 


X04327 


AW163799 


Hs.198365 


133708 


X06389 


AI018666 


Hs.75667 


125701 


X07496 


T72104 


Hs.93194 


102915 


X07820 


X07820 


Hs.2258 


134656 


X14787 


AI750878 


Hs.87409 


413858 


X15525_rna1 


NMJ01610 


Hs.75589 


102968 


X16396 


AU076611 


Hs.1 54672 


cyciohydrolase 






102971 


X16609 


X16609 


Hs.183805 


134037 


X53586 rnal 


AI808780 


Hs.227730 


103023 


X53793 


AW500470 


Hs.1 17950 


103037 


X54936 


BE018302 


Hs.2894 


130282 


X55740 


BE245380 


Hs.1 53952 


134542 


X57025 


M14156 


Hs.85112 


128568 


X60673 rnal 


H12912 


Hs.274691 


103093 


X60708 


S79876 


Hs.44926 


133606 


X62048 


U10564 


Hs.75188 


129063 


X63097 


X63094 


Hs.283822 


424460 


X63563 


BE275979 


Hs.296014 


133227 


X64037 


AW977263 


Hs.68257 


103181 


X69636 


X69636 


Hs.334731 


103184 


X69878 


U43143 


Hs.74049 


103194 


X70649 


NMJJ04939 


Hs.78580 

• 



damage-specific DNA binding protein 2 (48kD) 

nuclear respiratory factor 1 

caspase 6, apoptosis-related cysteine protease 

branched chain aminotransferase 1, cytosolic 

eukaryotic translation initiation factor 2B, subunit 5 (epsilon, 82kD) 

SRY (sex determining region Y)-box 1 1 

CCCTC-binding factor (zinc finger protein) 

stanniocalcin 1 

ESTs, Highly similar to Z169_HUMAN ZINC FINGER PROTEIN 169 [H.sapiens] 
KIAA1641 protein 

gb:Human myelomonocytic specific protein (MNDA) gene, 5' flanking sequence and complete 
syntaxin 3A 

regulator of G-protein signalling 7 
N-myc (and STAT) interactor 
necdin (mouse) homolog 

eukaryotic translation initiation factor 3, subunit 2 (beta, 36kD) 

chromosome 11 open reading frame 4 

mitogen-activated protein kinase kinase 6 

proline arginine-rich end leucine-rich repeat protein 

a disintegrin and metalloproteinase domain 9 (meltrin gamma) 

homeo box A9 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 10 (RNA helicase) 
seienophosphate synthetase 2 

MAD (mothers against decapentaplegic, Drosophila) homolog 4 

small nuclear RNA activating complex, polypeptide 1, 43kD 

fibroblast growth factor 8 (androgen-induced) 

protein kinase, DNA-activated, catalytic polypeptide 

protein kinase C binding protein 1 

Human BRCA2 region, mRNA sequence CG006 

von HippeHindau binding protein 1 

cullin 4B 

cyclic nucleotide gated channel beta 1 
cadherin 13, H-cadherin (heart) 
TRAF family member-associated NFKB activator 
ubiquitin-like 1 (sentrin) 

caspase 7, apoptosis-related cysteine protease 

Homo sapiens cDNA: FLJ23037 fis, clone LNG02036, highly similar to HSU68019 Homo 

a disintegrin and metalloproteinase domain 17 (tumor necrosis factor, alpha, converting enzyme) 

karyopherin (importin) beta 2 

ATP/GTP-binding protein 

protein phosphatase 4, regulatory subunit 1 

hypothetical protein MGC14433 

melanoma antigen, family A, 2 

procollagen-lysine, 2-oxoglutarate 5-dioxygenase (lysine hydroxylase) 2 
carboxypeptidase D 

cytosolic acyl coenzyme A thioester hydrolase 
adaptor-related protein complex 3, sigma 1 subunit 
thyroid hormone receptor interactor 13 
echinoderm microtubule-associated protein-iike 
IGF-II mRNA-binding protein 3 
collagen, type I, alpha 2 
2,3-bisphosphoglycerate mutase 
synaptophysin 
apolipoprotein A-l 

matrix metalloproteinase 10 (stromelysin 2) 

thrombospondin 1 

acid phosphatase 2, lysosomal 

methylene tetrahydrofolate dehydrogenase (NAD+ dependent), methenyltetrahydrofolate 

ankyrin 1, erythrocytic 
integrin, alpha 6 

multifunctional polypeptide similar to SAICAR synthetase and AIR carboxylase 

placental growth factor, vascular endothelial growth factor-related protein 

5' nucleotidase (CD73) 

insulin-like growth factor 1 (somatomedin C) 

adenylate kinase 3 

dipeptidylpeptidase IV (CD26, adenosine deaminase comptexing protein 2) 

wee1+ (S. pombe) homolog 

Rhesus blood group, D antigen 

polymerase (RNA) II (DNA directed) polypeptide B (140kD) 

general transcription factor IIF, polypeptide 1 (74kD subunit) 

Homo sapiens, clone IMAGE:3448306, mRNA, partial cds 

fms-re)ated tyrosine kinase 4 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 1 
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10320B 


V710 il 4 

X72841 


AW41134U 


HSJ13J4 


129698 


X749B7 


dc242144 




4 OA AGO 

131486 


X83107 


FQ6972 


HS.27o7z 


130729 


X84194 


A1963747 


u<« -lock's 


A MO^ J 

103334 


X85753 
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NMLG012bU 




132645 


X87870 


A l<?c ATA 1 

AI654712 


It. CAA<\A 

Hs.54424 


135094 


\/ A AO Aft 

X89066 


NM_QQ3304 


Hs.25Ubo/ 


103352 


X89398_cds2 


1 1 ftft ft ft o 

H09366 


Hs.78853 


103353 


v /n no no 

X89399 


V/ft ftftft ft 

X89399 


Hs.1 19274 


132173 


X89426 


O A O ft 

X89426 


Hs.41716 


j Oft n*Ti 

103371 


V/O. jI O J T 

X91247 


X91247 


Hs. 13046 


131584 


v/n j ft j n 

X91648 


AA598509 


Hs.29117 


J O A A — ■* A 

103376 


\ A AAA 

X92098 


AL036166 


Hs.323378 


103378 


4 /AO ^ ^ ft. 

X92110 


AL1 19690 


Hs.153618 


128510 


V AA, A ^ A ft 

X94703 


X94703 


Hs.296371 


j On At A ft 

103410 


^ /ft ft ft a a 

X96506 


AA1 58294 


Hs.334879 


A no A Aft 

133490 


V^O^OOO f 

X97230J 


AF022044 - 


Hs.274601 


d it** J A ft 

103438 


\ /O ft Oft o 

X98263 


AW175781 


Hs.152720 


103440 


* /ft ft o ft ft 

X98296 


X98296 


Hs.77578 


103452 


v son r" ft A 

X99584 


NMJJ06936 


Hs.85119 


133536 


Y00264 


W25797.comp Hs.177486 


135185 


Y07566 


AW404908 


Hs.96038 


118523 


Y07759 


Y07759 


Hs.170157 


134662 


Y07827 


NM 007048 


Hs.284283 


132083 


Y07867 


BE386490 


HS.279663 


103500 


Y09443 


AW408009 


Hs.22580 


134389 


Y09858 


Y09858 


Hs.82577 


132084 


Y12394 


NM.002267 


Hs.3886 


103540 


211559 


NM-002197 


Hs.154721 


133152 


Z11695 


Z11695 


Hs.324473 


103548 


Z15O05 


Z15005 


Hs.75573 


103612 


Z46261 


BE336654 


Hs.70937 


129092 


AA011243.S 


D56365 


Hs.63525 


103692 


AA018418 


AW137912 


Hs.227583 


(CACNA1F) gene, complete cds; HSP27 pseudogene, 


103695 


AA018758 


AW207152 


Hs.186600 


129796 


AA018804 


BE216319 


Hs.5807 


132258 


AA031993 


AA306325 


Hs.4311 


132683 


AA044217 


BE264633 


Hs.143638 


131887 


AA046548 


W17064 


Hs.332848 


member 1 






103723 AA057447_s 


BE274312 


Hs.214783 


453368 


AA058376 


W20296 


Hs.288178 


133260 


AA083572 


AA403045 


Hs.6906 


103765 


AA085696 


AA085696 


Hs.1 69600 


103766 


AA088744 


AI920783 


Hs.191435 


103767 


AA089688 


BE244667 


Hs.296155 


132051 


AA091284 


AA393968 


Hs.180145 


103773 


AA092700 


AI219323 


Hs.101077 


[C.elegans] 






135289 


AA092968 


AW372569 


Hs.9788 


132729 


AA094800 


AW970843 


Hs.55682 


103794 


AA100219 


AF244135 


Hs.30670 


131471 


AA1 14885 


AA164842 


Hs.192619 


134319 


AA1 29547 


BE304999 


Hs.75653 


103807 


AA133016 


AW958264 


Hs.103832 


119159 


AA149507 


AF142419 


Hs.15020 


129863 


AA151005 


BE379765 


Hs.1 29872 


103850 


AA1 87101 


AA1 87101 


Hs.213194 


103855 


AA195179.S 


W02363 


Hs.302267 


322026 


AA203138 


AW024973 


Hs.283675 


135300 


AA203645 


AA142922 


Hs.278626 


103861 


AA206236 


AA206236 


Hs.4944 


130634 


AA227621 


AI769067 


Hs.127824 


[C.elegans] 




Hs.6127 


447735 


AA248283 


AA775268 


103909 AA249611 


AA249611 


Hs.47438 


131236 AA282640 


AF043117 


Hs.24594 


134060 AA287199 


D42039 


Hs.78871 


129013 AA313990 


AA371156 


Hs.107942 


129435 AA314256 


AF151852 


Hs.111449 


103988 AA314389 


' AA314389 


Hs.42500 


104000 AA324364 


AI146527 


Hs.80475 


425284 


AA329211_s 


AF1 55568 


Hs.155489 


128629 


AA399187 


AL096748 


Hs.102708 


133281 


AA421079 


AK001601 


Hs.69594 



retinoblastoma-binding protein 7 

ATP-binding cassette, sub-family E (OABP), member 1 

BMX non-receptor tyrosine kinase 

acylphosphatase 1 > erythrocyte (common) type 

cyclin-dependent kinase 8 

hepatocyte nuclear factor 4, alpha 

transient receptor potential channel 1 

uracil-DNA glycosylase 

RAS p21 protein activator (GTPase activating protein) 3 (lns(1,3,4,5)P4-binding protein) 

endothelial cell-specific molecule 1 

thioredoxin reductase 1 

purine-rich element binding protein A 

coated vesicle membrane protein 

HCGVIil-1 protein 

RAB28, member RAS oncogene family 

DR1-associated protein 1 (negative cofactor 2 alpha) 

killer cell immunoglobulin-iike receptor, three domains, long cytoplasmic tail, 1 

M-phase phosphoproteln 6 

ubiquitin specific protease 9, X chromosome (Drosophila fat facets related) 

SMT3 (suppnsssor of mif two 3, yeast) homolog 1 

amyloid beta (A4) precursor protein (protease nexin-ll, Alzheimer disease) 

Ric (Drosophila)-like, expressed In many tissues 

myosin VA (heavy polypeptide 12, myoxin) 

butyrophilin, subfamily 3, member A1 . 

Pirin 

alkylglycerone phosphate synthase 
spindltn-like 

karyopherin alpha 3 (importin alpha 4) 
aconitasel, soluble 
mitogen-activated protein kinase 1 
centromere protein E (312kD) 
H3 histone family, member A 
poly(rC)-binding protein 2 

Homo sapiens chromosome X map Xp1 1 .23 L-type calcium channel alpha-1 subunit 
complete sequence; and JM1 protein, JM2 protein, and Hb2E genes, complete cds 
ESTs 

GTPase Rab14 

SUMO-1 activating enzyme subunit 2 
WD repeat domain 4 

SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, 

Homo sapiens cDNA FLJ14041 fis, clone HEMBA1005780 
Homo sapiens cDNA FLJ11968 fis, clone HEMBB1001133 
Homo sapiens cDNA: FLJ23197 fis, clone REC00917 
KIAA0826 protein 
ESTs 

CG1-100 protein 
HSPC030 protein 

ESTs, Weakly similar to T22363 hypothetical protein F47G9.4 - Caenorhabditis elegans 

hypothetical protein MGC10924 similar to Nedd4 WW-binding protein 5 

eukaryotic translation initiation factor 3, subunit 7 (zeta, 66/67kD) 

hepatocellular carcinoma-associated antigen 66 

KIAA1600 protein 

fumarate hydratase 

similar to yeast Upf3, variant B 

homolog of mouse quaking QKI (KH domain RNA binding protein) 
sperm associated antigen 9 
hypothetical protein MGC10895 
hypothetical protein FU10330 
NPD009 protein 

Arg/AbMnteracting protein ArgBP2 
hypothetical protein FLJ12783 

ESTs, Weakly similar to T28770 hypothetical protein W03D2.1 - Caenorhabditis elegans 

Homo sapiens cDNA: FLJ23020 fis, clone LNG00943 
SH3 domain binding glutamic acid-rich protein 
ubiquitination factor E4B (homologous to yeast UFD2) 
mesoderm development candidate 2 
DKFZP564M1 12 protein 
CGI-94 protein 
ADP-ribosylation factor-like 5 

polymerase (RNA) II (DNA directed) polypeptide J (13.3kD) 
NS1-associated protein 1 
DKFZP434A043 protein 
high-mobility group 20A 
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104104 M422029 
[H.sapiens] 
108154 AA425230 
132091 AA447052 
135073 AA452000 
131367 AA456687 
129593 AA4B7015_s 
135266 AB002326 
133505 C01527 
132064 C01714 
134393 C01811J 
131427 C02352.S 
133435 C02375 
104282 C14448 
134827 D16611_s 
130443 D25216 
131742 D31352 
132837 D58024_s 
130377 D80897 
104334 D82614 
134593 D87845 
134731 D89377 i 
129913 H06583 
131670 H40732 
104394 H46617 
104402 H56731 
129781 H75570 
129077 H78886 
104417 H81241 
134927 L36531 
129280 M63154 
134498 M63180 
104460 M91504 
104488 N56191 
131248 N78483 

129214 N79268 
130017 R14652 
104530 R20459 
104534 R22303 
sequence. 
104544 R33779 
133328 R36553 
104567 R64534 
128562 R66475 
129575 R70621 
130776 R79356 
104599 R84933 
104660 RC.AA007160 
104667 RC_AA007234_s AI239923 
104718 RC_AA018409 AI143020 
104764 RC_AA025351 AI039243 

104786 RC.AA027168 AA027167 

104787 RC.AA027317 AA027317 
similar to contains Alu repetitive element;, 
134079 RC_AA029423 AK001751 
104804 RCLAA031357 
104865 RCLAA045136 
130828 RC_AA053400 
104907 RC_AA055829 
WARNING ENTRY [H.sapiens] 
104943 RC_AA065217 AF072873 
105013 RC_AA116054 
105024 RCJ\A126311 
132592 RC_AA129390 
105038 RC.AA130273 
105077 RC_AA142919 
105096 RC.AA1 50205 

129215 RC AA176867 
105169 RC AA180321 
132796 RCLAA180487 
130401 RCAA187634 
105200 RC_AA195399 
130114 RC.AA234717 
105330 RC_AA234743 
105337 RCAA234957 
129385 RC_AA235604 



AA422029 Hs.143640 ESTs, Weakly similar to hyperpolarization-activated cyclic nucleotide-gated channel hHCN2 



NMJ305754 

AW954243 

W55956 

AI750575 

AI338247 

R41179 

AI630124 

AA121098 

W52642 

AF151879 

AI929357 

C14448 

BE314037 

D25216 

AA961420 

AA370362 

NMJM909 

D82614 

NM_000437 

D89377 

NM.001310 

H03514 

AA129551 

H56731 

AA306090 

N74724 

A1819448 

L36531 

M63154 

AW246273 

AW955705 

N56191 

AI038989 

AL044335 

AK000096 

AK001676 

R22303 

AI091173 

AW452738 

AA040620 

AA923382 

F08282 

AF167706 

AW815036 

BE298665 



AI858702 
T79340 
AW631469 
AA055829 



H63789 

AA1 26311 

AW803564 

AW503733 

W55946 

AL042506 

AB040930 

BE245294 

NM_006283 

BE396283 

AA328102 

AA233393 

AW338625 

AI468789 

AA172106 



Hs.220689 Ras-GTPase-activating protein SH3-domain-binding protein 

Hs.170218 KIAA0251 protein 

Hs.94030 Homo sapiens mRNA; cDNA DKFZp58BE1624 (from done DKFZp586E1 624) 

Hs. 1 73933 nuclear factor l/A 

Hs.98314 Homo sapiens mRNA; cDNA DKFZp586L0120 (from clone DKFZp586L01 20) 

Hs.97393 KIAA0328 protein 

Hs.324504 Homo sapiens mRNA; cDNA DKFZp586J0720 (from clone DKFZp586J0720) 

Hs.3838 serum-induclble kinase 

Hs.8261 hypothetical protein FLJ22393 

Hs.26706 CGI-121 protein 

Hs.323966 Homo sapiens clone H63 unknown mRNA 

Hs.332338 EST 

Hs.89866 coproporphyrinogen oxidase (coproporphyria, harderaporphyria) 

Hs.155650 KIAA0014 gene product 

Hs.31433 ESTs 

Hs.57958 EGF-TM7-latrophilin-re!ated protein 

Hs.155182 KIAA1036 protein 

HsJB771 phosphoglycerate kinase 1 

Hs.234392 platelet-activating factor acetylhydrolase 2 (40kD) 

Hs.89404 msh (Drosophila) homeo box homolog 2 

Hs. 1 331 3 cAMP responsive element binding protein-like 2 

Hs.10130 ESTs 

Hs.172129 Homo sapiens cDNA: FLJ21409 fis, done COL03924 

Hs.132956 ESTs 

Hs.1 24707 ESTs 

Hs.108479 ESTs 

Hs.320861 KruppeWike factor 8 

Hs.91296 integrin, alpha 8 

Hs.1 10014 gastric intrinsic factor (vitamin B synthesis) 

Hs.84131 threonyl-tRNA synthetase 

Hs.62604 Homo sapiens, clone IMAGE:4299322, mRNA, partial cds 

Hs.1 0651 1 protocadherin 1 7 

Hs.332633 Bardet-Biedl syndrome 2 

Hs.109526 2inc finger protein 198 

Hs.143198 inhibitor of growth family, member 3 

Hs.12457 hypothetical protein FLJ10814 

gb:yh26b09.r1 Soares placenta Nb2HP Homo sapiens cDNA clone iMAGE:1 30841 5', mRNA 

Hs.222362 ESTs, Weakly similar to p40 [H.sapiensl 

Hs.265327 hypothetical protein DKFZp761M41 

Hs.5672 hypothetical protein AF140225 

Hs.101490 ESTs 

Hs.278428 progestin induced protein 

Hs.1 9280 cysteine-rich motor neuron 1 

Hs.151251 ESTs 

Hs.14846 Homo sapiens mRNA; cDNA DKFZp564D016 (from clone DKFZp564D016) 

Hs.30098 ESTs 

Hs.36250 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 

Hs.278585 ESTs 

Hs.10031 KIAA0955 protein 

gb:ze97d11.s1 SoaresJetalheart_NbHH19W Homo sapiens cDNA clone IMAGE:366933 3' 

mRNA sequence. 

Hs.171835 hypothetical protein FU10889 

Hs.31803 ESTs, Weakly similar to N-WASP [H.sapiens] 

Hs.22575 B-cell CLL/lymphoma 6, member B (zinc finger protein) 

Hs.203213 ESTs 

Hs.196701 ESTs, Weakly similar to ALULHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.1 14218 frizzled (Drosophila) homolog 6 

Hs.296288 ESTs, Weakly similar to KIAA0638 protein [H.sapiensl 

Hs 9879 ESTs 

Hs!288850 Homo sapiens cDNA: FLJ22528 fis, clone HRC12825 

Hs.9414 KIAA1488 protein 

Hs.234863 Homo sapiens cDNA FLJ12082 fis, clone HEMBB1002492 

Hs.21599 Kruppel-like factor 7 (ubiquitous) 

Hs.126085 KIAA1497 protein 

Hs.180789 S164 protein 

Hs.173159 transforming, acidic coiled-coil containing protein 1 

Hs.173987 eukaryotic translation initiation factor 3, subunit 1 (alpha, 35kD) 

Hs.24641 cytoskeleton associated protein 2 

Hs.14992 hypothetical protein FLJ11151 

Hs.22120 ESTs 

Hs.23200 myotubularin related protein 1 

Hs.1 10950 Rag C protein 
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105376 RCAA236559 
105397 RC AA242868 
131962 RC AA251776 
131991 RC.AA251909 
128658 RC AA252672_s 
105489 RC M256157 
105508 RC AA256680 
105539 RC.AA258873 
135172 RC AA262727 
131569 RC_AA281451 
132542 RCJW281545 
105643 RC AA282069 
105659 RCLAA283M4 
105666 RC AA283930 
105674 RC_AA284755 
105709 RO.AA291268 
105722 RC AA291927 
105765 RC.AA343514 
115951 RC_AA398109 
105962 RC.AA405737 
105985 RC_AA406610 
gb:X02067 

106008 RC_AA411465 
131216 RC_AA416886 
134222 RC_AA424013 
113689 RC.AA424148 
106141 RC AA424558 
130839 RC_AA424961_s 
106157 RCJ\A425367 
130777 RC.AA425921 
receptor 

130561 RC.AA426220 
106196 RC.AA427735 
WARNING 

131878 RC AA430673 
133200 RC.AA432248 
106302 RC_AA435896 
106328 RC.AA436705 
450534 RC_AA446561 
106423 RC.AA448238 
133442 RC_M448688 
439608 RC_AA449756 
106477 RC_AA450303 
106503 RC.AA452411 
446999 RC.AA454566 
106543 RCJ\A454667 
130010 RC_AA456437 
106589 RCAA456646 
106593 RC.AA456826 
106596 RC.AA456981 
CONTAMINATION 
134655 RC_AA458959 
member 1 

106636 RC_AA459950 
106654 RC_AA460449 
131353 RC.AA463910 
106707 RC.AA464603 
131710 RC.AA464606 
106717 RC.AA465093 
131775 RC_AA465692 
106747 RC.AA476473 
106773 RC_AA478109 
106781 RCJWJ78474 
106817 RC_AA480889 
106846 RC.AA485223 
106848 RC.AA485254 
106856 RC.AA486183 
418699 RC_AA496936 
WARNING 

107001 RC_AA598589 
130638 RC.AA598831J 
107054 RC_AA600150 
107059 RC_AA608545 
107080 RC.AA609210 
107115 RCJ\A61O108 
107130 RC.AA620582 



AW994032 Hs.8768 hypothetical protein FLJ10849 

AA814807 Hs.7395 hypothetical protein FU23182 

AK000046 Hs.267448 hypothetical protein FLJ20039 

AF053306 Hs.36708 budding uninhibited by benzimidazoles 1 (yeast homolog), beta 

BE397354 Hs.324830 diptheria toxin resistance protein required for diphthamide biosynthesis (Saccharornyces)-like 2 

AA256157 Hs.241 15 Homo sapiens cDNA aJ14178 fis, done NT2RP2003339 

AA173942 Hs.326416 Homo sapiens mRNA; cDNA DKFZp564H1916 (from clone DKFZp564H1916) 

AB040884 Hs.109694 KIAA1451 protein 

AB02B956 Hs.12144 KIAA1033 protein 

AL389951 Hs.271623 nucieoporin 50kD 

AL137751 Hs.263671 Homo sapiens mRNA; cDNA DKFZp434l0812 (from clone DKFZp434l0812); partial cds 

BE621719 Hs.173802 KIAA0603 gene product 

AA283044 Hs.25625 hypothetical protein FU1 1 323 

AA426234 Hs.34906 ESTs, Weakly similar to T1 7210 hypothetical protein DKFZp434N041.1 [H.sapiens] 

AI609530 Hs.279789 histone deacetytase 3 

AI928962 Hs.26761 DKFZP586L0724 protein 

AI922821 Hs.32433 ESTs 

AA299688 Hs.24183 ESTs 

BE546245 Hs.301048 sec13-like protein 

AW880358 Hs.339808 hypothetical protein FU10120 

AA406610 gb:zv15b10.s1 Soares__NhHMPu_S1 Homo sapiens cDNA clone IMAGE:753691 3' similar to 

AB033888 Hs.8619 SRY (sex determining region Y)-box 18 

AI81 5486 Hs.243901 Homo sapiens cDNA FLJ20738 fis, clone HEP08257 

AW855861 Hs.8025 Homo sapiens clone 23767 and 23782 mRNA sequences 

AB03785O Hs.16621 DKFZP434I1 16 protein 

AF031463 Hs.9302 phosducin-like 

AB01 1 1 69 Hs.20141 similar to S. cerevisiae SSM4 

W37943 Hs.34892 KIAA1 323 protein 

AW1 35049 Hs.285418 Homo sapiens cDNA FLJ10643 fis, clone NT2RP2005753, highly similar to Homo sapiens 1-1 

AB011095 Hs.16032 KIAA0523 protein 

AA525993 Hs.173699 ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AA083764 Hs.6101 hypothetical protein MGC3178 

AB037715 Hs.183639 hypothetical protein FLJ10210 

AA398859 Hs.1 8397 hypothetical protein FU23221 

AL079559 Hs.28020 KIAA0766 gene product 

AI570189 Hs.25132 KIAA0470 gene product 

AB020722 Hs.16714 Rho guanine exchange factor (GEF) 15 

AL1 37663 Hs.7378 Homo sapiens mRNA; cDNA DKFZp434G227 (from clone DKFZp434G227) 

AW864696 Hs.301732 hypothetical protein MGC5306 

R23324 Hs.41693 DnaJ (Hsp40) homolog, subfamily B, member 4 

AB033042 Hs.29679 cofactor required for Sp1 transcriptional activation, subunit 3 (130kD) 

AA151520 Hs.334822 hypothetical protein MGC4485 

AA676939 Hs.69285 neuropiltn 1 

AA301 1 16 Hs. 142838 nucleolar phosphoprotein Nopp34 

AK000933 Hs.28661 Homo sapiens cDNA FLJ10071 fis, clone HEMBA1001702 

AW296451 Hs.24605 ESTs 

AA452379 Hs.293552 ESTs, Moderately similar to ALU7_HUMAN ALU SUBFAMILY SQ SEQUENCE 

AF265208 Hs.123090 SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily f, 



AW958037 

AW075485 

AW754182 

AK000566 

NM_015368 

AA600357 

AB014548 

NM.007118 

AA478109 

AA330310 

D61216 

AB037744 

AA449014 

W58353 

BE539639 

AI926520 

AW021276 

AI076459 

BE614410 

AL1 22043 

BE379623 

AB033106 



Hs.286 - ribosomal protein L4 

Hs.286049 phosphoserine aminotransferase 

gb:RC2-CT032M31199-011-c01 CT0321 Homo sapiens cDNA, mRNA sequence 

Hs.98135 hypothetical protein FLJ20559 

Hs.30985 pannexin 1 

Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 

Hs.31921 KIAA0648 protein 

Hs.171957 triple functional domain (PTPRF interacting) 

Hs.188833 ESTs 

Hs.24181 ESTs 

Hs.18672 ESTs 

Hs.34892 KIAA1323 protein 

Hs. 1 21 025 chromosome 1 1 open reading frame 5 

Hs.285123 Homo sapiens mRNA full length insert cDNA done EUROIMAGE 2005779 

Hs.173030 ESTs, Weakly similar to ALU8 JttJMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

Hs.3101 6 putative ONA binding protein 

Hs.17121 ESTs 

Hs.15978 KIAA1272 protein 

Hs.23044 RA051 (S. cerevisiae) homolog (E coii RecA homolog) 

Hs.19221 hypothetical protein DKFZp566G1424 

Hs.27693 peptidylprolyl isomerase (cyclophilin)-like 1 

Hs.12913 KIAA1280 protein 
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107156 RC_AA621239 
107174 RC AA621714 
130621 RC.AA621718 
107190 RC D19673 
132626 RCJ)25755 s 
107217 RC D51095 
131610 RC.D60272J 
129604 T08879 
107295 T34527 
<Ga!NAc-T1) 
107299 T4Q327_s 

107315 T62771J5 

107316 T63174_s ' 
107328 T83444 
107334 T93641 
134715 U48263 
128636 U49065 
129938 U79300 
107375 U88573 
130074 U93867 
107387 W01094 
132036 W01568 
107426 W26853 
113857 W27179 
135388 W27965 
130419 W36280J5 
107469 W47063 
132616 W79060 
107506 W88550 
132358 X60486 
107522 X78931.S 
125827 Z14077.S 
107582 RC_AA002147 
107609 RC_AA004711 
107661 RC_AA010383 
107714 RC AA015761 
107775 RC_AA018772 
107832 RC_AA021473_ 
sequence. 

107859 RC.AA024835 
124337 RC.AA025858 
107914 RC.AA027229 
[C.elegans] 

107935 RC.AA029428 
116262 RC_AA035143 
131461 RC.AA035237 
m007 RC.AA039347 
108029 RCJVA040740 
108040 RC_AA041551 
member 1 

108084 RC.AA045513 
108088 RC_AA045745 
108168 RC.AA055348 
130719 RC AA056582 

108189 RC.AA056697 

108190 RC_AA056746 
108203 RC.AA057678 

108216 RC_AA058681 

108217 RC_AA058686 
108245 RC_AA062840 
108277 RC.AA064859 
mRNA 

108280 RC_AA065069 
108309 RCJVA069923 
133739 RCJVA070799. 
108340 RC.AA070815 
108403 RC.AA075374 
3', mRNA sequence. 
108427 RC.AA076382 
3', mRNA sequence. 
108435 RC.AA078787 
108439 RC_AA078986 
3', mRNA sequence. 
108465 RC_AA079393 
108469 RC_AA079487 
sequence 



AA137043 

BE1 22762 

AW513087 

AA836401 

AW504732 

AL080235 

AA357879 

AF088886 

AA186629 

BE277457 
AA316241 
T63174 
AW959891 
T93597 
U48263 
U49065 
AW003668 
BE011845 
AL038596 
D86983 
AL1 57433 
W26853 
AW243158 
W27965 
AF037448 
W47063 
BE262677 
AB028981 
NMJK33542 
X78931 
NMJ03403 
AA002147 
R75654 
AA010383 
AA015761 
AW008846 
r AA021473 

AW732573 

N23541 

AA027229 

AA02942B 

AI936442 

AA992841 

AA039347 

AA040740 

AL121031 

AA058944 
AA045745 
AI453137 
s AA679262 
AW376061 
AA056746 
AW847814 
AA524743 
AA058686 
BE410285 
AA064859 

AA065069 
AA069818 
s BE536554 
AA069820 
AA075374 

AA076382 

T82427 
AA078986 

AA079393 
AA079487 



Hs.9663 programmed cell death 6-interacting protein 

Hs.25338 ESTs 

Hs.16803 LUC7 (S. cerevisiae)-like 

Hs.5103 ESTs 

Hs.21275 hypothetical protein FLJ11011 

Hs.35861 DKFZP586E1621 protein 

Hs.29423 scavenger receptor with C-type lectin 

Hs.11590 cathepsinF 

Hs.801 20 UDP-N-acetyl-alpha.D-5alactosamine:polypeptide N-acetylgalactosamlnyltransferase 1 

Hs.30661 hypothetical protein MGC4606 

Hs.90691 nucleophosmin/nucleoplasmin 3 

Hs.1 93700 Homo sapiens mRNA; cDNA DKFZp586l0324 (from clone DKFZp586l0324) 

Hs.76591 KIAA0887 protein 

Hs.187429 ESTs 

Hs.89040 prepronociceptin 

Hs. 102865 interleukin 1 receptor-like 2 

Hs.135587 Human clone 23629 mRNA sequence 

Hs.251 064 high-mobility group (nonhistone chromosomal) protein 14 

Hs.250745 polymerase (RNA) III (DNA directed) (62kD) 

Hs.1 18893 Melanoma associated gene 

Hs.37706 hypothetical protein DKFZp434E2220 

Hs.291003 hypothetical protein MGC4707 

Hs.5297 DKFZP564A2416 protein 

Hs.99865 epimorphin 

Hs.155489 NS1-assoclated protein 1 

Hs.94668 ESTs 

Hs.283558 hypothetical protein PR01855 

Hs.8021 KIAA1058 protein 

Hs.46423 H4 histone family, member G 

Hs.99971 zinc finger protein 272 

Hs.97496 YY1 transcription factor 

Hs.59952 EST 

Hs.164797 hypothetical protein FLJ13693 

Hs.60389 ESTs 

Hs.60642 ESTs 

Hs.60857 ESTs 

gb:ze66d 1 .s1 Soares retina N2MHR Homo sapiens cDNA clone IMAGE:363956 3', mRNA 

Hs.47584 potassium voltage-gated channel, delayed-rectifier, subfamily S ( member 3 

Hs.281561 Homo sapiens cDNA: FU23582 fis, clone LNG13759 

Hs.61329 ESTs, Weakly similar to T16370 hypothetical protein F45E12.5 - Caenorhabditis elegans 

Hs,61555 ESTs 

Hs.59838 hypothetical protein FLJ 10808 

Hs.27263 KIAA1458 protein 

Hs.61916 EST 

Hs.62007 ESTs. 

Hs.159971 SWl/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily b, 

Hs.1 16602 Homo sapiens, clone 1MAGE:41 54008, mRNA, partial cds 

Hs.62886 ESTs 

Hs.63176 ESTs 

Hs.14235 hypothetical protein FLJ20008; KIAA1839 protein 

Hs.63335 ESTs, Moderately similar to A46010 X-(inked retinopathy protein [H.saptens] 

Hs.63338 EST 

Hs.289005 Homo sapiens cDNA: FLJ21 532 fis, clone COL06049 

Hs.44883 ESTs 

Hs.62588 ESTs 

Hs.89545 proteasome (prosome, macropain) subunit, beta type, 4 

gb:zm50f03.s1 Stratagene fibroblast (937212) Homo sapiens cDNA clone IMAGE:529085 3', 

gb:zm12e11.s1 Stratagene pancreas (937208) Homo sapiens cDNA clone 3', mRNA sequence 
gb:zm67eD3.r1 Stratagene neuroepHhelium (937231) Homo sapiens cDNA clone 5' similar to 

Hs.278270 unactive progesterone receptor, 23 kD 

Hs.1 80909 peroxiredoxin 1 

gb:zm87a01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA clone IMAGE:544872 

gb:zm91g08.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA clone IMAGE:545342 

Hs.194101 Homo sapiens cDNA: FLJ20869 fis, clone ADKA02377 

gb:zm92h01.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA clone IMAGE:545425 

Hs.3462 cytochrome c oxidase subunit Vile 

gb:zm97f08.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA clone 3', mRNA 
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108500 RCLAA083207 

108501 RC AA083256 
gb:M33308 

108533 RCJ\A084415 
mRNA 

108562 RC AA085274 
gb:X15341 

108589 RC_AA088678 
130890 RCJVA100925 
134585 RC_AA101255 
130385 RO.AA126474 
108749 RC_AA127017 

108807 RC AA129968 

108808 RCJ\A130240 
108833 RC AA131866 
107290 RC AA132039 
108846 RC AA132983 
108857 RO.AA133250 
131474 RC_AA133583 
108894 RC AA135941 
108941 ROAA148650 
IMAGE:567202 3\ 
108968 RC_AA151110 
108996 RCJVA155754 
109001 RC_AA156125 
131183 RCJ\A156289 
109019 RC AA156997 

109022 RC AA157291 

109023 RC.AA157293 
109068 RC_AA164293. 
109072 RC AA164676 
129021 RC AA167375 
130346 RCJVA167550 
109146 RC JVA176589 
109172 RCJVA180448 
131080 RC.AA187144 
129208 RC AA189170. 
109222 RC_AA192757 
109300 RC.AA205650 
109481 RC AA233342 
109485 RC AA233472 
109516 RCAA234110 
109537 RC.D80981 
109556 RC_F01660 

109577 RC F02206 

109578 RCJ02208 
109595 RC_F02544 
109625 RCF03918 
131983 RC_F04258_s 
109648 RC_F04600 
109671 RC_F08998 
109699 RCJF09605 
109820 RC.F11115 
109933 RC_H06371 
110014 RC.H10995 
110039 RC_H11938 
110099 RC_H16568 
110107 RC H16772 
110155 RC.H18951 
110197 RCJH20859 
110223 RC.H23747 
110306 RC H38087 
110335 RC.H40331 
110342 RC.H40567 
110395 RC.H46966 
110511 RC.H56640J 
110523 RC.H57154 
110715 RCJH96712 
110754 RCJJ20814 
130132 RC_N25249 
131135 RCJI27100 
134263 RC.N39616 
110938 RC.N48982 
110983 RCJJ51957 
115062 RC_N52271 
111081 RCJJ59435 



AA083207 
AA083256 

AA084415 

AA1007S6 

A1732404 
AI907537 
D14041 
AW067800 
AA127017 
AI652236 
AA045088 
AF188527 
W27740 
AL1 17452 
AK001468 
s L46353 
AK001431 
AA148650 

AI304870 
AW995610 
AI056548 
AI611807 
AA156755 
AA157291 
AA157293 
,f AA164293 
AI732585 
AL044675 
H05769 
AA176589 
AA180448 
s NM.001955 
f AI587376 
AA1 92833 
AA418276 
AA878923 
BE619092 
AI471639 
AI858695 
AI925294 
F02206 
FO2208 
AA078629 
H29490 
AF1 19665 
H17800 
R59210 
H18013 
AW016809 
R52417 
AL1 09666 
H11938 
R44557 
AW151660 
AI559626 
AW090386 
H19836 
H38087 
H65490 
H40961 
AA025116 
H56640 
AI040384 
H96712 
AW302200 
U55936 
NMJD16569 
AW973443 
N48982 
NMJ315367 
AA253314 
A1146349 



Hs.68270 EST 



Hs.68846 

Hs.76698 

Hs.278573 

Hs. 155223 

Hs.71052 

Hs.49376 

Hs.62738 

Hs.61661 

Hs.323780 

Hs.44155 

Hs.62180 

Hs.2726 

Hs.5105 



Hs.188680 

Hs.332436 

Hs.72116 

Hs.285107 

Hs.72150 

Hs.21479 

Hs.72168 

Hs.72545 

Hs.22394 

Hs.173081 

Hs.188757 

Hs.142078 

Hs.144300 

Hs.2271 

Hs.109441 

Hs.333512 

Hs.170142 

Hs.289069 

Hs.28465 

Hs.71913 

Hs.34898 

Hs.87385 

Hs.296639 

Hs.27214 

Hs.27301 

Hs.22697 

Hs.184011 

Hs.7154 

Hs.26634 

Hs.167483 

Hs.323795 

Hs.20945 

Hs.7242 

Hs.21907 

Hs.23748 

Hs.31444 

H5.93522 

Hs.112278 

Hs.31697 

Hs.105509 

Hs.18845 

Hs.33008 

Hs.33333 

Hs.221460 

Hs.19102 

Hs.269029 

Hs.6336 

Hs.184376 

Hs.267182 

Hs.8086 

Hs.38034 

Hs.10267 

Hs.154103 

Hs.271614 



gb:zn08g12.$1 Slratagene hNT neuron (937233) Homo sapiens cDNA clone 3' similar to 
gb:zn06g09,s1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone IMAGE:546688 3', 
gb:zm26c06.s1 Stratagene pancreas (937208) Homo sapiens cDNA clone 3' similar to 
ESTs 

stress-associated endoplasmic reticulum protein 1; ribosome associated membrane protein 4 
H-2K binding factor-2 
stannlocalcin 2 
ESTs 

hypothetical protein FU20644 
ESTs 

ESTs, Weakly similar to AF174605 1 F-box protein Fbx25 [H.sapiens] 
ESTs 

DKFZP586G1517 protein 

anillin (Drosophi/a Scraps homotog), actin binding protein 
high-mobility group (nonhistone chromosomal) protein isoform l-C 
hypothetical protein FU10569 

gb:zo09e06.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

ESTs 
EST 

hypothetical protein FLJ20992 similar to hedgehog-interacting protein 

hypothetical protein FU13397 

ESTs 

ubinuciein 1 

ESTs 

ESTs 

hypothetical protein FLJ10893 
KIAA0530 protein 

Homo sapiens, clone MGC:5564, mRNA, complete cds 

EST 

EST 

endothelin 1 
MSTP033 protein 
similar to rat myomegalin 
ESTs 

hypothetical protein FLJ21016 

Homo sapiens cDNA: FLJ21869 Hs, clone HEP02442 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit (HERG-3) mRNA, complete cds 

ESTs 

ESTs 

ESTs 

pyrophosphatase (inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 24993 mRNA sequence 

Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 35907 

histone acetyltransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1647 protein, partial cds 

arrestin, beta 1 

ESTs 

CTL2 gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to organic anion transporter 1 [H.sapiens] 
ESTs 

KIAA0672 gene product 
synaptosomal-associated protein, 23kD 
TBX3-iso protein 

RNA (guanine-7-) methyltransferase 

Homo sapiens cDNA FLJ12924 fis, clone NT2RP2004709 

M1L1 protein 

LIM protein (similar to rat protein kinase C-binding enigma) 
CGI-1 12 protein 
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111128 


RC N64139 


AW505364 


Hs.19074 


135244 


RC.N66981 


AI834273 


Hs.971 1 


111216 


RCJI68640 


AW139408 


Hs.152940 


437562 


RCLN69352 


AB001636 


Hs.5683 


131002 


RC_N95226 


AL050295 


Hs.22039 


111399 


RCJWJ0138 


AW270776 


Hs.18857 


111514 


RC.R07998 


R07998 




similar lo 






130182 RC_R08929 


BE267033 


Hs.1 92853 


111574 RCLR10307 


AI024145 


Hs.1 88526 


111804 


RC.R33354 


AA482478 


Hs.181785 


111831 


RC.R36083 


R36095 


Hs.268695 


129675 RC.R37938J 


NM.015556 


Hs.172180 


111904 RC.R39330 


Z41572 




sequence 








RC R4flR1fi «? 


AB012193 


Hs.183874 


1190/tt 


RC R411G9 s 


R49031 


Hs.22627 


13AQR7 


or R4^fiQfi 


BE613269 


Hs.21893 




or RR4554 


H24334 


Hs.26125 


119511 


RC RRR4? t > 


R68425 


Hs.13809 


119M4 




R68568 


Hs.183373 




RC R6R763 


R68857 


Hs.265499 




RC R7fl4R7 


R69751 






RC R71^R5 


H05769 


Hs.188757 




RC R71R4A 


AK002126 


Hs.1 1260 




or D7ftQ7fi 


R78376 


Hs.29733 


119719 


RC RQ9451 


R92453 


Hs.34590 


111ARA 




BE297567 


Hs.27047 


119RRR 


RC TT11R79 


AW195317 


Hs.1 0771 6 


1 O 1 000 


rc Tmn79 


AI656378 


Hs.33461 


119Q11 


rc TmnRfi 


AW732747 


Hs.13493 


11991*; 

IO££ ID 


RC T1M19 


AL035703 


Hs.4236 


119Q11 


RC T1*vM1 


T02966 


Hs.167428 


119QRA 


RC 191457 


T16971 


Hs.289014 


119QP8 


RC T91555 


H11257 


Hs.22968 


133376 


RC T23670 


BE618768 


Hs.7232 


113026 


RC T23948 


AA376654 


Hs.183684 


113070 


RC T33464 


AB032977 


Hs.6298 


128970 


ROJ34413 


AI375672 


Hs.165028 


1 13074 


RC T34611 


AK001335 


Hs.31137 


1 13095 


RCJ40920 


AA828380 


Hs.126733 


113179 


RC_T55182 


BE622021 


Hs.152571 


113337 


RC.T77453 


T77453 


Hs.302234 


113421 


RCJT84039 


AI769400 


Hs.189729 


113454 


RC T86458 


AI022165 


Hs.16188 


113481 


RC T87693 


T87693 


Hs.204327 


131441 


RC_J89350_s 


AA302862 


Hs.90063 


113557 


ROJ90945 


H66470 


Hs.16004 


113559 


RC T90987 


T79763 


Hs.14514 


113589 


RC.T91863 


AI078554 


Hs.1 5682 


113591 


RCJT91B81 


T91881 


Hs.200597 


113619 


RC.T93783.S 


R08665 


Hs.1 7244 


113683 


RCJ96687 


AB035335 


Hs.144519 


113692 


RC T96944 


AL360143 


Hs.17936 


113702 


RCJ97307 


T97307 




mRNA 








113717 


RCJ97764 


T99513 


Hs.1 87447 


113824 


RCJV48817 


A1631964 


Hs.34447 


113840 


RCJV58343 


R72137 


Hs.7949 


113844 


RCJW59949 


AI369275 


Hs.243010 


PROTEIN TC10 






113902 


RC W74644 


AA340111 


Hs.100009 


113904 


RCJV74761 


AF125044 


Hs.19196 


113905 


RC W74B02 


R81733 


Hs.33106 


113931 


RCJV81205 


BE255499 


Hs.3496 


113932 


RC W81237 


AA256444 


Hs.126485 


131965 


RCJV9Q146J 


W79283 


Hs.35962 


114035 


RCJV92798 


W92798 


Hs.269181 


114106 


RCZ38412 


AW602528 




133593 


RC Z38709 


AI416988 


Hs.238272 


114161 


RC_Z38904 


BE548222 


Hs.299883 


424949 


RC Z39103 


AF052212 


Hs.153934 


129059 


RC Z39930J 


AW069534 


Hs.279583 


128937 


RC w Z39939 


AA251380 


Hs.10726 


WARNING 






130983 RC_Z40012_i 


AI479813 


Hs.278411 



LATS (large tumor suppressor, Drosophila) homolog 2 

novel protein 

ESTs 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 15 

KIAA0758 protein 

ESTs 

gb:yf16g11.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:127076 3' 

ubiquitin-conjugating enzyme E2G 2 (homologous to yeast UBC7) 

ESTs 

ESTs 

ESTs 

KIAA0440 protein 

gb:HSCZYB122 normalized infant brain cDNA Homo sapiens cDNA clone c-zyb12, mRNA 

cullin 4A 
ESTs 

hypothetical protein DKFZp761N0624 
ESTs 

hypothetical protein FLJ10648 

src homology 3 domain-containing protein HIP-55 

ESTs 

gb:yi40a10.s1 Soares placenta Nb2HP Homo sapiens cDNA done 3', mRNA sequence 

Homo sapiens, clone MGC:5564, mRNA, complete cds 

hypothetical protein FLJ 11264 

EST 

ESTs 

hypothetical protein aJ20392 
hypothetical protein FLJ22344 
ESTs 

like mouse brain protein E46 
KIAA0478 gene product 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.sapiens] 

Homo sapiens clone IMAGE:451939 t mRNA sequence 

acetyl-Coenzyme A carboxylase alpha 

eukar/otic translation initiation factor 4 gamma, 2 

KIAA1151 protein 

ESTs 

protein tyrosine phosphatase, receptor type, E 
ESTs 

ESTs, Highly similar to IGF-II mRNA-binding protein 2 [H.sapiens] 

ESTs 

ESTs 

ESTs 

EST 

neurocalcin delta 
ESTs 
ESTs 
ESTs 

KIAA0563 gene product 
hypothetical protein FLJ13605 
T-cell leukemia/lymphoma 6 
DKFZP434H132 protein 

gb:ye53h05.s1 Soares fetal liver spleen 1NFLS Homo sapiens cONA clone IMAGE: 121497 3\ 

ESTs 
ESTs 

DKFZP586B2420 protein 

Homo sapiens cDNA FLJ14445 fis, clone HEMBB1001294, highly similar to GTP-BINDING 

acyl-Coenzyme A oxidase 1 , palmitoyt 
ubiquitin-conjugating enzyme HBUCE1 
ESTs 

hypothetical protein MGC15749 

hypothetical protein FLJ12604; KIAA1692 protein 

ESTs 

ESTs 

gb:RC5-BT0562-260100-011-A02 BT0562 Homo sapiens cDNA, mRNA sequence 
inositol 1,4,5-triphosphate receptor, type 2 
hypothetical protein FLJ23399 

core-binding factor, runt domain, alpha subunit 2; translocated to, 2 
CGl^81 protein 

ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
NCK-associated protein 1 
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114277 RC_Z40377_s 
[C.elegans] 
114304 RC Z40820 
114364 RC Z41680 
132900 RC AA005112 
129034 RCJMQ05432 
131881 RC.AA010163 
452461 RC^AA026356 
114465 RC..AA026901 
131376 RC_AA036867 
101567 RC.AA044644 
431555 RC AA046426 
132944 RC.AA054515 
114618 RCJ\A084162 
130274 RC.AA085749 
110330 RC J AA098874 
114648 RC_AA101056 
IMAGE:548429 3' 
114658 RC_AA102746 
132456 RC.AA114250. 
131319 RC_AA126561_ 
132225 RC_AA128980_ 
1MAGE:567164 3' 
132669 RC.AA129757 
114709 RC.AA129921 
131973 RCAA133331 
114750 RCJ\A135958 
115714 RC_AA136524_ 
114763 RC.AA147044 
114767 RC_AA148885 
114774 RCJ\A150043 
129388 RC.AA151621 
129183 RC.AA155743 
128869 RC.AA156335 
130207 RC_AA156336 
114798 RC.AA159181 
114800 RC_AA159825 
[C.elegans] 

114828 RC.AA234185 
114846 RC_AA234929 
114848 RC.AA234935 
114902 RC.AA236359 
132271 RC_AA236466 
114907 RC_M236535 
135159 RC.AA236935. 
132204 RCAA236942 
114928 RC.AA237018 
132481 RC.AA237025 
114932 RC.AA242751 
314162 RC.AA242760 
131006 RC.AA242763 
114935 RC.AA242809 
WARNING 

132454 RC AA243133 
437754 RC.AA243495 
114957 RC_AA243706 
114974 RC_AA250848 
114977 RC_AA250868 
114995 RC.AA251152 
115005 RC_AA251544_ 
417177 RC_AA251792 
131889 RC.AA252063 
115026 RC_AA252144 
115045 RC.AA252524 
115068 RC_AA253461 
133138 RC.AA255522 
RECEPTOR, 
115114 RC_AA256468 
129584 RC.AA256528 
115137 RC.AA257976 
134312 RC.AA258296 

115166 RC.AA258409 

115167 RC.AA258421 
129807 RC.AA262077 
115239 RC_AA278650 
115243 RC_AA278766 



AI052229 


Hs.25373 


AI934204 


Hs.16129 


AL1 17427 


Hs.172778 


AA777749 


Hs.5978 


AA481157 


Hs.108110 


AW361018 


Hs.3383 


N78223 


nS.iQolUo 


BE621056 


Hs.131731 


AK001644 


Hs.26156 


M33552 


Hs.56729 


AI815470 


Hs.260024 


T96641 


Hs.6127 


AW979261 


Hs.291993 


AA128376 


Hs.1 53884 


AI288665 


Hs.16621 


AA101056 




AA102383 


Hs.249190 


AB011084 


Hs.48924 



s NMJ03155 
i AA128980 

W38586 
AA397651 
AB018284 
AA887211 
s T19228 
AA810755 
AI859865 
AV656017 
AA662477 
BE561824 
AA768242 
AF044209 
AA159181 
Z19448 

AA252937 
BE016662 
BE614347 
AW275480 
AB030034 
N29390 
s U43374 
AA235827 
AA237018 
W93378 
AA971436 
BE041820 
AF064104 
H23329 

BE296227 
R60366 
AW170425 
AW966931 
AW296978 
AA769266 
.s AI760825 
NMJ04458 
NMJJ02589 
AA251972 
AW014549 
AW512260 
AV657594 

AA527548 

AV656017 

AW968304 

AB011151 

AF095727 

AA749209 

Y11192 

BE251328 

AA806600 



Hs.25590 



Hs.293981 

Hs.301959 

Hs.158688 

Hs.1 29467 

Hs.172572 

Hs.88977 

Hs.1 54443 

Hs.184325 

Hs.1 10964 

Hs.273369 

Hs.80618 

Hs.144904 

Hs.54900 

Hs.131887 

Hs.283522 

Hs.166196 

Hs.169615 

Hs.39504 

Hs.1 15175 

Hs.1 3804 

Hs.95631 

Hs.42265 

Hs.94869 

Hs.49614 

Hs.1 621 8 

Hs.38516 

Hs.22116 

Hs.290880 

Hs.250822 

Hs.5822 

Hs.87680 

Hs.179662 

Hs.87787 

Hs.1 93657 

Hs.1 11339 

Hs.81452 

Hs.34073 

Hs.188718 

Hs.58373 

Hs.87767 

Hs.181161 

Hs7527 

Hs.184325 

Hs.56156 

Hs.334659 

Hs.287832 

Hs.43728 

Hs.5299 

Hs.73291 

Hs.1 16665 



ESTs, Weakly similar to T20410 hypothetical protein E02A10.2 - Caenorhabditis elegans 
ESTs 

Homo sapiens mRNA; cDNA DKFZp566P013 (from clone DKFZp566P013) 

LIM domain oniy 7 

DKFZP547E21 10 protein 

upstream regulatory element binding protein 1 

transcription factor 

hypothetical protein FLJ11099 

hypothetical protein FLJ10782 

lysosomal 

Cdc42 effector protein 3 

Homo sapiens cDNA: FU23020 fis, clone LNG0Q943 
ESTs 

ATP binding protein associated with ceil differentiation 
DKFZP434I1 16 protein 

gb:zn25b03.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

tumor necrosis factor receptor superfamily, member 10a 
KIAA0512 gene product; ALEX2 
stanniocalcin 1 

gb:zo09a1 1.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

guanine nucleotide binding protein (G protein), gamma 3, linked 
proline synthetase co-transcribed (bacterial homolog) 
K1AA0741 gene product 
ESTs 

hypothetical protein FLJ20093 

hypothetical protein dJ511E16.2 

minichromosome maintenance deficient (S. cerevisiae) 4 

CGI-76 protein 

hypothetical protein FLJ23471 

uncharacterized hematopoietic stem/progenitor cells protein MDS027 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer antigen 1 

ESTs, Weakly similar to T24396 hypothetical protein T03F6.2 - Caenorhabditis e\egans 

Homo sapiens mRNA; cDNA DKFZp434J1912 {from clone DKFZp434J1912) 

ATPase, Class I, type 8B, member 1 

hypothetical protein FU20989 

hypothetical protein MGC4308 

sterile-alpha motif and leucine zipper containing kinase AZK 

hypothetical protein dJ462023.2 

Human normal keratinocyte mRNA 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, clone MGC: 15887, mRNA, complete cds 
CDC14 (cell division cycle 14, S. cerevisiae) homolog B 

ESTs, Weakly similar to ALU1.HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

serine/threonine kinase 15 

Homo sapiens cDNA: FLJ22120 fis, clone HEP18874 

ESTs 

nucleosome assembly protein 1 -tike 1 

ESTs 

ESTs 

ESTs 

fatty-acid : Coenzyme A ligase, long-chain 4 

BH-protocadherin (brain-heart) 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA.FLJ 14643 fis, clone NT2RP2001597, weakly similar to RYANODINE 

small fragment nuclease 
CGI-76 protein 
ESTs 

hypothetical protein MGC14139 
myelin protein zero-like 1 
hypothetical protein 

aldehyde dehydrogenase 5 family, member A1 (succinate-semialdehyde dehydrogenase) 
hypothetical protein FLJ10881 
KIAA1842 protein 
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100850 RC AA279667. 
126884 RC AA280791 
115322 RCJVA280819 
133626 RCJVA280B28 
115372 RCJVA282195 
132825 RC.AA283127 
130269 RC.AA284694 
129192 RC AA291137 
452598 RCJ\A291708 
WARNING 

132131 RC_AA293495 
115536 RC.AA347193 
132411 RC.AA398474. 
115575 RC_AA398512 
115601 RC_AA400277 
WARNING 

103928 RC.AA400896 
125819 RC.AA404494 
115683 RC_AA410345 
115715 RC AA416733 
132952 RCJ\A425154 
115819 RC_AA426573 
132525 RC_AA431418 
115895 RC_AA436182 
132333 RCJU437099 
115962 RC.AA446585 
115967 RCJWI46887 
115974 RC.AA447224 
115985 RCJ\A447709 
129254 RC.AA453624 
133071 RC__AA455044 
116095 RC_AA456045 
122691 RC_AA460454. 
116210 RC AA476494 
116213 RC.AA476738 
134585 RC_AA481422 
134790 RCJ\A482269 
116265 RC_AA482595 
129334 RC.AA485084. 
116274 RC_AA485431_ 
303150 RC.AA489057 
129945 RC.AA489638 
116331 RC_AA491000 
116333 RC„AA491250 
132994 RCJVA505133 
134577 RCJ\A598447 
116391 RC.AA599243 
116394 RC.AA599574. 
134531 RC.AA600153 
116417 RC.AA609309 
116429 RC_AA60971O 
116439 RC_AA610068 
116459 RC.AA621399 
427505 RC_AA621752 
132699 RC_C21523 
116541 RC.D12160 
132557 RCJ)19708 
112259 RC_D25801 
116571 RC_D45652 
sequence. 

129815 RCJ)60208_f 
421919 RC_D80504_s 
116643 RC_F03010 
116661 RC_F04247 
repetitive 

116715 RC_F10966 
116729 RC_F137G0 
318709 RCJH05063 
134760 RC_H16758 
116773 RC_H17315_s 
106425 RC_H22556 
116780 RC_H22566 
131978 RC_H48459_s 
116819 RCJH53073 
111428 RC_H56559_s 
133175 RC_H57957_s 



s AA836472 
U49436 
L08895 
AW836130 
AW014385 

s U82671 
F05422 
AA286914 
AI831594 

AF069291 
AK001468 
s AA059412 
AA393254 
AA148984 

D14540 

AA044840 

AF255910 

BE395161 

AI658580 

AA486620 

AW292809 

AB033035 

AA192669 

AI636361 

AI745379 

BE513442 

AA447709 

AA252468 

BE384932 

AA043429 

s R19768 
BE622792 
AA292105 
D14041 
BE002798 
BE297412 

.s AW157022 

s AM 29767 
AA887146 
BE514376 
N4130O 
AF1 55827 
AA1 12748 
BE244323 
T86558 

i NM_006033 
AI742845 
AW499664 
AF191018 
AA251594 
R80137 
AA361562 
AW449822 
D12160 
AA1 14926 
AA337548 
D45652 

BE565817 
AJ224901 
AI367044 
R61504 

AL1 17440 

BE549407 

R52576 

NM.Q00121 

AI823410 

H24201 

H22566 

AA355925 

H53073 

AL031428 

AW955632 



Hs.297939 cathepsinB 

Hs.286236 KIAA1856 protein 

Hs.78995 MADS box transcription enhancer factor 2, polypeptide C (myocyte enhancer factor 2C) 

Hs.75277 hypothetical protein FU13910 

Hs.88678 ESTs, Weakly similar to Unknown [Rsapiens] 

Hs.57698 Empirically selected from AFFX single probeset 

Hs.1 68352 nucleoporin-like protein 1 

Hs. 183299 ESTs 

Hsl68647 ESTs, Weakly similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.40539 chromosome 8 open reading frame 1 

Hs.62180 anillin (Drosophila Scraps homolog), actin binding protein 

Hs.47986 hypothetical protein MGC1 0940 

Hs.43619 ESTs 

Hs.48849 ESTs, Weakly similar to ALU4_HUMAN ALU SUBFAMILY SB2 SEQUENCE CONTAMINATION 

Hs.1 991 60 myeloid/lymphoid or mixed-lineage leukemia (trithorax (Drosophila) homolog) 

Hs.251871 CTP synthase 

Hs.54650 junctional adhesion molecule 2 

Hs.1 390 proteasome (prosome, macropain) subunit, beta type, 2 

Hs.61426 Homo sapiens mesenchymal stem cell protein DSC96 mRNA, partial cds 

Hs.41135 endomucin-2 

Hs.50727 N-acetylgtucosaminidase, alpha- (Sanfilippo disease lilB) 

Hs.51965 K1AA1209 protein 

Hs.45032 ESTs 

Hs.179520 hypothetical protein MGC10702 

Hs.42911 ESTs 

Hs.238944 hypothetical protein FLJ10631 

Hs.2681 15 ESTs, Weakly simitar to T08599 probable transcription factor CA1 50 [H.sapiens] 

Hs.1098 DKFZp434J1813 protein 

Hs.64313 ESTs, Weakly similar to AF257182 1 G-protein-coupled receptor 48 [Ksapiens] 

Hs.62618 ESTs 

Hs.172788 ALEX3 protein 

Hs.172788 ALEX3 protein 

Hs.326740 hypothetical protein MGC10947 

Hs.278573 H-2K binding factor-2 

Hs,287850 integral membrane protein 1 

Hs.55189 hypothetical protein 

Hs.4947 hypothetical protein FLJ22584 

Hs. 1 82874 g uanine nucleotide binding protein (G protein ) alpha 12 

Hs.8217 stromal antigen 2 

Hs.165998 PAI-1 mRNA-binding protein 

Hs.71616 Homo sapiens mRNA; cDNA DKFZp586N1720 (from clone DKFZp586N1720) 

Hs.203963 hypothetical protein FLJ10339 

Hs.279905 clone HQ0310 PRO0310p1 

Hs.85951 exportin, tRNA (nuclear export receptor for tRNAs) 

Hs.751 13 general transcription factor HIA 

Hs.65370 lipase, endothelial 

Hs.1 10713 DEK oncogene (DNA binding) 

Hs.1 2484 Human clone 23826 mRNA sequence 

Hs.279923 putative nucleotide binding protein, estradiol-induced 

Hs.43913 PIBF1 gene product 

Hs.302738 Homo sapiens cDNA: FLJ21425 fis. clone COL04162 

Hs.1 78761 26S proteasome-associated pad1 homolog 

Hs.55200 ESTs 

Hs.249212 polymerase (RNA) III (DNA directed) (155kD) 

Hs.5122 ESTs 

Hs.333402 hypothetical protein MGC1 2760 

gb:HUMGS02848 Human adult lung 3' directed Mbol cDNA Homo sapiens cDNA 3', mRNA 

Hs.26498 hypothetical protein FLJ21657 

Hs.1 09526 zinc finger protein 198 

Hs.1 53638 myeloid/lymphoid or mixed-lineage leukemia 2 

gb:yh16a03.s1 Scares infant brain 1NIB Homo sapiens cDNA clone 3' similar to contains Alu 

Hs. 1 70263 tumor protein p53-binding protein, 1 

Hs.1 15823 ribonudease P, 40kD subunit 

Hs.285280 Homo sapiens cDNA: FLJ22096 fis, clone HEP16953 

Hs.89548 erythropoietin receptor 

Hs.169149 karyopherin alpha 1 (importin alpha 5) 

Hs.247423 adducin 2 (beta) 

Hs.30098 ESTs 

Hs.36232 KIAA0186 gene product 

Hs.93698 EST 

Hs.174174 KIAA0601 protein 

Hs.66666 ESTs, Weakly similar to S1 9560 proline-rich protein MP4 - mouse [M.musculus] 
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4 4CQAA DP UC/imfl r> 

11oo44 RC_Ho49ob_S 


Ho49oo 


MS.oo74J4 


4 ACQ AC D/*» UC/Q70 

11oo45 RC_Hb4y7o 


AAD49530 




110032 RC_nB9030 


AI5732B3 


HS.jo4oo 


a defile n/~» mn^n 

116925 RC_H73110 


H73110 


Hs.260603 


116981 RC_H81783 


N29218 


u*. Annnn 

Hs.40290 


131768 RC_H86259 


AC005757 


Hs.31809 


117031 RC_H88353 


H88353 




contains L1 






117034 RC_H88639 


U72209 


Hs.1 80324 


132542 RCJH88675 


AL1 37751 


Hs.263671 


134403 RC H93708 s 


AA334551 


Hs.82767 


117280 RC N22107 


M18217 


Hs. 1721 29 


117344 RC N24046 


R19085 


Hs.210706 


117422 RC N27028 


AI355562 


Hs.43880 


117475 RC.N30205 


N30205 


Hs.93740 


117487 RC N30621 


N30621 


Hs.44203 


130207 RC N33258 


AF044209 


it_ 4 a Ann A 

Hs. 144904 


117549 RC_N33390 


N33390 


U_ AAA OO 

HS.44483 


117683 RCJJ40180 


N40180 




1MAGE:276387 3' similar to 




117710 RC.N45198 


N45198 


Hs.47248 


104514 RCJI45979J5 


AF164622 


Hs.1 82982 


117791 RC.N48325 


N48325 


Hs.93956 


117822 RC N48913 


AA706282 


Hs.93963 


129647 RC N49394 


AB018259 


II A A C\ A JA 

Hs.1 18140 


117895 RC.N50656 


AW450348 


Hs.93996 


[Ksapiens] 






131557 RC N50721 


AA317439 


Hs.28707 


133057 RC N53143 


AA465131 


11 A A A A. A 

Hs.64001 


118103 RC_N55326 


AA401733 


II A r\ A A A J 

Hs.184134 


118111 RC_N55493 


N55493 




mRNA 






118129 RC.N57493 


N57493 




IMAGE:277358 3*. mRNA 






118278 RC N62955 


N62955 


Hs.316433 


118329 RCJI63520 


N63520 




3', mRNA 






118336 RO.N63604 


BE327311 


Hs.47166 


132457 RCJI64166 


AB017365 


Hs. 173859 


118363 RC N64168 


AI183838 


IB A f+ AH #H n 

Hs.48938 


118364 RC.N64191 


N46114 


1 1 A A ^ A A 

Hs.29169 


118475 RC.N66845 


N66845 




similar to 






118491 RC N67135 


AV647908 


It A A J A J 

Hs.90424 


118500 RCJJ67295 


W32889 


1 1 ^ i r i AAA 

Hs.1 54329 


101663 RCJJ68399 


NMJ103528 


Hs.2178 


118584 RC..N68963 


A 1 A 1 A 0\ A AA A 

AW1 36928 




sequence 






421983 RC N69331 


AI252640 


Hs. 110364 


118661 RC N70777 


AL137554 


Hs.49927 


118684 RC_N71364_s 


N71313 


Hs. 163986 


118689 RCJJ71545J3 


AW390601 


Hs.1 84544 


118690 RC.N71571 


N71571 


1 ■ A A A ,4 J A 

Hs.269142 


118766 RC.N74456 


N74456 


li inn 

Hs.50499 


118793 RC.N75594 


N75594 


Hs.285921 


118817 RC_N79035 


AI668658 


Hs.50797 


118844 RC N80279 


AL035364 


t J" A A A A 

Hs.50891 


118919 RC_N91797 


AW452696 


Hs.1 30760 


129558 RC N92454 


AW580922 


Hs.180446 


132692 RC N94581 


AW191962 


Hs.249239 


118996 RC.N94746 


N94746 


LI- f\~7 A <\AQ 

Hs. 274 248 


119021 RC N98238 


N98238 


nS.ooloo 


119039 RC R02384 


AI160570 


Hs.252097 


119063 RC.R16833 


R 16833 


ll f» A A A A 

Hs.53106 


WARNING 






118523 RC_R41828_s 


Y07759 


Hs. 170157 


119111 RC.R43203 


T02865 


HS.izoo^l 


133970 RC R46395 


AA214228 


Hs.127751 


119146 RC.R58863 


R58863 


Hs.91815 


120296 RC_R78248 


AW995911 


Hs.299883 


119239 RCJT11483 


T11483 




sequence. 


AI692322 


Hs.65373 


119281 RCT16896 


119298 RC T23820 


NM 001241 


Hs.155478 


126502 RC T30222 


T10077 


Hs. 13453 


135073 RCJ/V15275J, 


W55956 


Hs.94030 



ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 
gb:ns44f05.s1 NCI CGAP Alv1 Homo sapiens cDNA clone, mRNA sequence 
ESTs 

ESTs, Moderately similar to A47582 B-cell growth factor precursor [H.sapiens] 
ESTs 

hypothetical protein 

gb:yw21a02.s1 Morton Fetal Cochlea Homo sapiens cDNA done IMAGE:252842 3' simitar to 
YY1 -associated factor 2 

Homo sapiens mRNA; cDNA DKFZp434l0812 {from done DKFZp434l0812); partial cds 
sperm spedfic antigen 2 

Homo sapiens cDNA: FLJ21409 fis, done COL03924 

Homo sapiens cDNA FLJ13182 fis, clone NT2RP3004070 

ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 

ESTs, Weakly similar to I38022 hypothetical protein [Ksapiens] 

ESTs 

nuclear receptor co-repressor 1 
EST 

gb:yy44d02.s1 Soares_multiple_sclerosis_2NbHMSP Homo sapiens cDNA done 

ESTs, Highly similar to similar to Cdc14B1 phosphatase [Ksapiens] 

golgin-67 

EST 

ESTs 

KIAA0716 gene product 

ESTs, Highly similar to SORL_HUMAN SORTILIN-RELATED RECEPTOR PRECURSOR 

signal sequence receptor, gamma (translocon-associated protein gamma) 

Homo sapiens clone 25218 mRNA sequence 

ESTs 

gb:yv50cO2.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA done IMAGE:246146 3', 

gb:yy54c08.s1 Soares_multip!e_sderosls_2NbHMSP Homo sapiens cDNA done 

Homo sapiens cDNA FLJ 1 1375 fis, done HEMBA100041 1 , weakly similar to ANKYRIN 
gb:yy62f01.s1 Soares^muItiple^sclerosisJNbHMSP Homo sapiens cDNA done IMAGE:278137 

HT021 

frizzled (Drosophila) homolog 7 
hypothetical protein FLJ21802 
hypothetical protein FLJ22623 

gb:za46c11.s1 Soares fetal iiver spleen 1NFLS Homo sapiens cDNA clone IMAGE:295604 3* 

Homo sapiens cDNA: FLJ23285 fis, clone HEP09071 
ESTs 

H2B histone family, member Q 

gb:UI-H-BJ1-adp-d-08-0-Ui.s1 NCI_CGAP_Sub3 Homo sapiens cDNA clone 3', mRNA 

peptidylprolyl isomerase C (cyclophilin C) 
protein kinase NYD-SP15 

Homo sapiens cDNA: FLJ22765 fis, done KAIA1180 
Homo sapiens, clone IMAGE:3355383, mRNA, partial cds 
ESTs 
EST 

ESTs, Moderately similar to T47135 hypothetical protein DKFZp761L0B12.1 (Rsapiens] 
ESTs 

hypothetical protein 

myosin phosphatase, target subunit 2 

karyopherin (importin) beta 1 

collagen, type VIII, alpha 2 

hypothetical protein FLJ20758 

ESTs 

pregnancy specific beta-1 -glycoprotein 6 

ESTs, Moderately similar to ALU1JHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

myosin VA (heavy polypeptide 12, myoxin) 
EST 

hypothetical protein 
ESTs 

hypothetical protein FLJ23399 

gb:CHR90049 Chromosome 9 exon Homo sapiens cDNA clone 1 1 1-1 5* and 3', mRNA 

ESTs, Weakly similar to T02345 hypothetical protein KIAA0324 [H.sapiens] 
cyclin T2 

hypothetical protein FLJ 14753 

Homo sapiens mRNA; cDNA DKFZp586E1624 (from clone DKFZp586E1624) 
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119558 RC W38194 W38194 

132736 RC_W42414_s AW081883 
sapiens mad protein 

132173 RC W46577_s X89426 

134873 RC__W49632_s AA884471 

119650 RC_W57613 R82342 

119654 RCJV57759 W57759 
similar to 

119683 RCJV61118 W65379 

119694 RC_W65344 AA041350 

119718 RCJV69216 W69216 

133010 RC.W69379 AI287518 

119938 RC_W86728 AW014862 

120128 RC Z38499 BE379320 

120130 RC Z38630 AA045767 

120148 RC_Z39494 F028Q6 

120155 RC_Z39623 Z39623 

131486 RC_Z40071_s F06972 

120183 RC_Z40174 AW082866 

120184 RC_Z40182 Z40182 
120211 RC_Z40904 Z40904 
120245 RC AA166965 AW959615 
120247 RC_AA167500 AA167500 
120254 RC_AA169599__s W90403 

120259 RC.AA171724 AW014786 

120260 RCJ\A171739 AK000061 
120275 RC_AA177105 AA177105 
120284 RC_AA182626 AA179656 
to contains 

114056 RCJM86324 AA188175 

129507 RCJVA192099 AJ236885 

120302 RCJ\A192173 AA837098 

120303 RC_AA192415 A1216292 
120305 RC_AA192553 AW295096 
120319 RC_M194851 T57776 
133389 RC_AA195520_s AA195764 

120326 RC_AA196300 AA196300 
134272 RC AA196517 X76040 
133145 RC_AA196549 H94227 

120327 RC_AA196721 AK000292 
106686 RC_AA195729_i N66397 

120328 RC_AA195979 AA923278 
120340 RC_AA206828 AA206828 
similar to 

134292 RC.AA207123 AI906291 

131522 RC_AA214539_i AI380040 

129051 RC_M226914_s AA227068 

120375 RC.AA227260 AF028706 

120376 RC.AA227469 AA227469 
IMAGE :663732 3', mRNA sequence. 
120390 RC.AA233122 AA837093 
303876 RC_AA233334_s U64820 
dominant, ataxin 3) 

132038 RC.AA233347 AI825842 

104463 RC_AA233519 T85825 

125750 RC.AA233714 AA018515 

120396 RC_AA233796 AA1 34006 

120409 RC.AA235050J AA235050 
gb:L07077 

120414 RC_AA235704 AW137156 

120420 RC AA236031 AI128114 

120422 RC.AA236352 AL133097 
132221 RC_AA236390_s W94915 

120423 RC_AA236453 AA236453 
120435 RC_AA243370 AA243370 
120453 RC.AA250947 AA250947 

120455 RC_AA251083 AA251720 

120456 RCJ\A251113 AA488750 
120473 RC_AA251973 AA251973 
128922 RC_AA252023 AI244901 
120477 RCJ\A252414 AA252414 
120479 RC_AA252650 AF006689 
120488 RC AA255523 AW952916 
120510 RC.AA258128 AI796395 

120527 RCJ\A262105 AA262105 

120528 RC_AA262107 AI923511 



Empirically selected from AFFX single probeset 

Hs.2B8261 Homo sapiens cDNA: FLJ23037 fis, clone LNG02036, highly similar to HSU68019 Homo 

Hs.41716 endothelial cell-specific molecule 1 

Hs.90449 Human clone 23908 mRNA sequence 

Hs.79856 ESTs, Weakly similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 

gb:zd20g1 1 .s1 SoaresJetaLhearLNbHH19W Homo sapiens cDNA done IMAGE:341252 3" 

Hs.57835 ESTs 

Hs.57847 ESTs, Moderately similar to ICE4.HUMAN CASPASE-4 PRECURSOR [H.sapiens] 

Hs.92848 ESTs 

Hs.62669 Homo sapiens mRNA; cDNA DKFZp586D0923 (from clone DKFZp586D0923) 

Hs.58885 ' ESTs 

Hs.91448 MKP-1 like protein tyrosine phosphatase 

Hs.5300 bladder cancer associated protein 

Hs.65765 ESTs 

Hs.65783 ESTs 

Hs.27372 8 MX non-receptor tyrosine kinase 

Hs.65882 ESTs 

Hs.65885 EST 

Hs.66012 EST 

Hs.111045 ESTs 

Hs.103939 EST 

Hs.111054 ESTs 

Hs.192742 hypothetical protein FU12785 

Hs.101590 hypothetical protein 

Hs.78457 solute carrier family 25 (mitochondrial carrier; ornithine transporter) member 15 

gb:zp54e1 1 .s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 3* similar 

Hs.82506 KIAA1254 protein 

Hs.112180 zinc finger protein 148 (pHZ^52) 

Hs.269933 ESTs 

Hs.96184 ESTs 

Hs. 101 337 uncoupling protein 3 (mitochondrial, proton carrier) 

Hs.191094 ESTs 

Hs.72639 ESTs 

Hs.21 1 45 hypothetical protein RG083M05.2 

Hs.278614 protease, serine, 15 

Hs.6592 Homo sapiens, clone IMAGE:2961368, mRNA, partial cds 

Hs.278732 hypothetical protein FLJ20285 

Hs.334825 Homo sapiens cDNA FLJ14752 fis, done NT2RP3003071 

Hs.290905 ESTs, Weakly similar to protease [H.sapiens] 

gb:zq80b08.s1 Stratagene hNT neuron (937233) Homo sapiens cDNA clone IMAGE:647895 3' 

Hs.81234 immunoglobulin superfamily, member 3 

Hs.239489 TIA1 cytotoxic granule-associated RNA-binding protein 

Hs. 108301 nuclear receptor subfamily 2, group C, member 1 

Hs.111227 Zic family member 3 (odd-paired Drosophila homolog, heterotaxy 1) 

gb:zr18a07.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 

Hs.1 11460 calcium/calmodulin-dependent protein kinase (CaM kinase) II delta 

Hs.66521 Machado-Joseph disease (spinocerebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

Hs.3776 zinc finger protein 216 

Hs.246885 hypothetical protein FLJ20783 

Hs.264482 Homo sapiens mRNA; cDNA DKFZp761 A041 1 (from clone DKFZp761 A041 1) 

Hs.79306 eukaryotic translation initiation fador 4E 

gb:zs38e04.s1 Soares_NhHMPu_S1 Homo sapiens cDNA clone IMAGE:687486 3* similar to 

Hs.181202 hypothetical protein FU10038 

Hs.1 12885 spinal cord-derived growth factor-B 

Hs.301717 hypothetical protein DKFZp434N1928 

Hs.42419 ESTs 

Hs.18978 Homo sapiens cDNA: FLJ22822 fis, clone KAIA3968 

Hs.96450 EST 

Hs.170263 tumor protein p53-btnding protein, 1 

Hs.104347 ESTs, Weakly similar to ALUCJHUMAN !!!! ALU CLASS C WARNING ENTRY III [H.sapiens] 

Hs.88414 BTB and CNC homology 1, basic leucine zipper transcription factor 2 

Hs.269988 ESTs 

Hs.9589 ubiquilin 1 

Hs.43141 DKFZP727C091 protein 

Hs. 1 1 0299 mitogen-activated protein kinase kinase 7 

Hs.63510 KIAA0141 gene product 

Hs.111377 ESTs 

Hs.4094 Homo sapiens cDNA FLJ14208 fis, clone NT2RP3003264 

Hs.104413 ESTs 
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120529 RC.AA262235 
120541 RC.AA278298 
131445 RC.AA278529J 
120544 RC.AA278721 
120562 RC_AA280036 
120569 RC.AA280648 

120571 RCJ\A280738 

120572 RC_AA280794 
129434 RC_AA280837 
130529 RCJ\A280886 
repetitive 

120575 RC.AA280934 
132635 RC.AA281535 
120591 RC_AA281797_s 
120593 RCJ\A282047 
430275 RC.AA283002 
117729 RC.AA283709 
120609 RC.AA283902 
132754 RC_AA284108 
130315 RC_AA284109 
132614 RC.AA284371 
447503 RC.AA284744J 
cds 

135376 RC.AA284784 
120621 RC_AA28484Q 
107868 RC.AA286844 
129868 RC_AA287032 
120644 RC.AA287038 

120660 RCJV\287546 
135370 RC_AA287553_s 

120661 RC_AA287556 
129116 RC AA287564 
131567 RC_AA291015_s 
120699 RC_AA291716 
100690 RCJ\A291749_s 
120726 RC_AA293656 
120737 RC_AA302430 
120745 RC_AA302809 
135192 RC_AA302820_s 
120750 RC.AA310499 
120761 RC_AA321890 

120768 RC_AA340589 

120769 RC.AA340622 
135232 RC.AA342457J 
CONTAMINATION 
133439 RC_AA342828_s 
120793 RCJ\A342864 
120796 RC.AA342973 
120809 RC.AA346495 
repeat, mRNA sequence. 
132459 RC_AA347573 
120825 RC_AA347614 
120827 RC_AA347717 
120839 RC.AA348913 
repeat, mRNA sequence. 
120850 RC.AA349647 
120852 RC.AA349773 
128852 RC_AA350541_s 
135240 RC_AA357159_i 
120870 RC_AA357172_i 
WARNING 

134637 RC_AA369856_s 
120894 RC_AA370132 
131854 RC_AA370472__s 
120897 RC.AA370867 
120915 RC_AA377296 

120935 RC_AA383902 
WARNING 

120936 RC_AA385934 

120937 RC.AA386255 

120938 RC_AA386260 
129722 RC.AA386266 
120960 RCJ\A398014 
120985 RC_AA398222 
120988 RC_AA398235 



AI434823 Hs.104415 ESTs 

W07318 Hs.240 M-phase phosphoprotein 1 

NM.014264 Hs.172052 serine/threonine kinase 18 

BE548277 Hs.103104 ESTs 

BE244580 Hs.302267 hypothetical protein FLJ10330 

AA807544 Hs.24970 ESTs, Weakly similar to B34323 GTP-binding protein Rab2 (H.sapiens] 

AB037744 Hs.34892 KIAA1323 protein 

H39599 Hs.294008 ESTs 

AW967495 Hs.186644 ESTs 

AA178953 gbzp39e03.s1 Stratagene muscle 937209 Homo sapiens cDNA clone 3' similar to contains Alu 

AW978022 Hs.238911 hypothetical protein DKFZp762E1511; K1AA1816 protein 

AB020686 Hs.54037 ectonucleotide pyrophosphatase/phosphodiesterase 4 (putative function) 

AF078847 Hs.191356 general transcription factor HH, polypeptide 2 (44kD subunit) 

AA748355 Hs.193522 ESTs 

Z1 1773 Hs.237786 zinc finger protein 187 

AA306166 Hs.7145 calpain 7 

AW978721 Hs.266076 ESTs, Weakly similar to A46010 X-linked retinopathy protein [Rsapiens] 

AI752244 Hs.75309 eukaryotic translation elongation factor 2 

AI241084 Hs.154353 nonselective sodium potassium/proton exchanger 

AA284371 Hs.118064 similar to rat nuclear ubiquitous casein kinase 2 

AA1 15496 Hs.336898 Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, clone MGC:9890, mRNA, complete 

BE6 17856 Hs.99756 mitochondrial ribosome recycling factor 

AW961294 Hs.143818 hypothetical protein FU23459 

AA286844 Hs.61260 hypothetical protein FU13164 

AW172431 Hs.13012 ESTs 

AI869129 Hs.96616 ESTs 

AA286785 Hs.99677 ESTs 

BE6221 87 Hs.99670 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

AA287556 Hs.263412 ESTs, Weakly similar to ALUBJ-IUMAN IHl ALU CLASS B WARNING ENTRY !!! [Rsapiens] 

AB019494 Hs.225767 IDN3 protein 

AF015592 Hs.28853 CDC7 (cell division cycle 7, S. cerevisiae, homo!o'g)-like 1 

A1683243 Hs.97258 ESTs, Moderately similar to S29539 ribosomal protein L13a, cytosolic [H.sapiens] 

AA383256 Hs.1657 estrogen receptor 1 

AA293655 Hs.97293 ESTs 

AL049176 Hs.82223 chordin-like 

AA302809 gb:EST10426 Adipose tissue, white I Homo sapiens cDNA 3' end, mRNA sequence. 

U83993 Hs.321 709 purinergic receptor P2X, ligand-gated ion channel, 4 

AI191410 Hs.96693 ESTs, Moderately similar to 21 09260A B cell growth factor [Ksapiens] 

AA321890 Hs.1265 branched chain keto acid dehydrogenase E1, beta polypeptide (maple syrup urine disease) 

AA340589 Hs.1 04560 EST 

AI769467 Hs.96769 ESTs 

AL03881 2 Hs.96800 ESTs, Moderately similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE 

223091 Hs.73734 glycoprotein V (platelet) 

AA342864 Hs.96812 ESTs 

AI247356 Hs.96820 ESTs 

AA346495 gb:EST52657 Fetal heart II Homo sapiens cDNA 3' end similar to EST containing O family 

AL1 20071 Hs.48998 fibronectin leucine rich transmembrane protein 2 

AI280215 Hs.96885 ESTs 

AA382525 Hs.132967 Human EST clone 122887 mariner transposon Hsrnart sequence 

AA348913 gb:EST55442 Infant adrenal gland II Homo sapiens cDNA 3' end similar to EST containing Alu 

AA349647 Hs.96927 Homo sapiens cDNA FLJ 12573 fts, clone NT2RM4000979 

AA349773 Hs.191564 ESTs 

R40622 Hs.106601 ESTs 

AA357159 Hs.96986 EST 

AA357172 Hs.292581 ESTs, Moderately similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

U87309 Hs. 180941 vacuolar protein sorting 41 (yeast homolog) 

AA370132 Hs.97063 ESTs 

AF229839 Hs.173202 l-kappa-B-interacting Ras-like protein 1 

AA370867 Hs.97079 ESTs, Moderately similar to AF174605 1 F-box protein Fbx25 [H.sapiens] 

AL135556 Hs.97104 ESTs 

AL048409 Hs.97177 ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

AA385934 Hs.97184 EST, Highly similar to (defline not avaiiable 7499603) [C.elegans] 

AA3B6255 Hs.97186 EST 

AA386260 Hs.104632 EST 

R20855 Hs.5422 glycoprotein M6B 

AA398014 Hs.104684 EST 

AI219896 Hs.97592 ESTs 

AA398235 Hs.97631 ESTs 
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121008 RC.AA398348 
GSSsandaCpG 
21029 RC.AA398482 

21032 RC_AA398504 

21033 RC AA398505 

21034 RC.AA398507 

21035 RCJ\A398523 
21058 RC.AA398625 

21060 RC.AA398632 

21061 RC AA398633 

21091 RCJM398894 
CONTAMINATION 

21092 RC AA398895 
21094 RC.AA398900 
21096 RC.AA398904 
21115 RCAA399122 

21121 RC AA399371 

21122 RC_AA399373 
21125 RC.AA399441 
21151 RCJVA399636 
21153 RC.AA399640 
21163 RC_AA399680 
21176 RC.AA400080 
21192 RC_AA400262 
21223 RC.AA400725 
21227 RCAA400748 
21231 RCAA400780 

21278 RC.AA401631 

21279 RC AA401688 
21282 RC.AA4Q1695 
21299 RCJ\A402227 

21301 RCAA402329 

21302 RCJWJ02398 

21304 RCAA402449 

21305 RC_AA402468 
34721 RC.AA403268. 

21323 RC_AA403314 

21324 RC.AA404229 
29047 RC_AA404260 
31074 RC_AA404271 
21344 RC_AA405026 
21348 RCJ\A405182 
21350 RC.AA405237 

contains Alu 
21400 RC AA406061 

21402 RC_AA406063 

21403 RC.AA406070 
21408 RC.AA406137 
21431 RC.AA405335 
32936 RC_AA411801 
21471 RC_AA411804 
21474 RC_AA411833 
21526 RC.AA412219 
21530 RC_AA412259 

21558 RC_AA412497 
contains L1.t3L1 

21559 RCAA412498 
21584 RCJVA416586 
21609 RO.AA416867 
21612 RCJVA416874 
21737 RC AA421133 
21740 RC_AA421138 
29194 RC_AA422079 
21784 RC.AA423837 

21802 RC.AA424328 

21803 RCJVA424339 
35286 RC_AA424469 
21806 RCJ\A424502 
29517 RC.AA425004 
21845 RC.AA425734 

CONTAMINATION 
21853 RC.AA425887 
21891 RC.AA426456 
21895 RCJWJ27396 
similar to contains 
121899 RC_AA427555 



AA398348 Hs.301720 Human DNA sequence from clone RP1 1-251 J8 on chromosome 13 Contains ESTs, STSs, 



AA398482 
AA393037 
AA398505 
AL389951 
AA398523 
AA398625 
AA398632 
AA393288 
AA398894 

AA398895 
AA402505 
AA398904 
AA398187 
AA399371 
AI126713 
AL042981 
AA399636 
AA399640 
AI676062 
AL121523 
AA400262 
AI0021 10 
AA400748 
AA814948 
AA037121 
AA292873 
AA401695 
AA402227 
NM_006202 
AA402587 
AA293863 
AA402468 
s AK000112 
AA291411 
AA404229 
AI768623 
U16125 
AA405026 
AA405182 
AA405237 

AA406061 

AA406063 

AA406070 

AA406137 

AA035279 

AL120659 

AA411804 

AA402335 

AW665325 

AA778658 

AA412497 

AI192044 

AI024471 

AA416867 

AA416874 

AA421133 

AA421138 

, AA1 50797 
T90789 
AI251870 
AI33B371 

s AW023482 
AA424313 
AW972853 
AI732692 

AA425887 
AA426456 
AA427396 

R55341 



Hs.97641 EST 

Hs.161798 ESTs 

Hs.97360 ESTs 

Hs.271623 nucleoporin 50kD 

Hs.210579 ESTs 

Hs.97391 ESTs 

Hs.97395 ESTs 

Hs.97396 ESTs 

Hs.97657 ESTs, Moderately similar to ALU8JWMAN ALU SUBFAMILY SX SEQUENCE 

Hs.97658 EST 

gb:zt62h10.r1 SoaresJestisJIHT Homo sapiens cDNA clone 5', mRNA sequence 

Hs.332690 ESTs 

Hs.104682 ESTs, Weakly similar to mitochondrial citrate transport protein [H.sapiensj 

Hs.189095 similar to SALL1 (sal (Drosophila)-like 

Hs.192233 ESTs, Highly similar to T00337 hypothetical protein KIAA0568 [Rsapiens) 

Hs.251278 KIAA1201 protein 

Hs.143629 ESTs 

Hs.97694 ESTs 

Hs.1 11902 ESTs 

Hs.97774 ESTs 

Hs 190093 ESTs 

Hs!97169 ESTs, Weakly similar to dJ667H12.2.1 [H.sapiens] 

Hs.97823 Homo sapiens mRNA; cDNA DKFZp434D024 (from clone DKFZp434D024) 

Hs.96343 ESTs, Weakly similar to ALUC.HUMAN !!!! ALU CLASS C WARNING ENTRY !!! [H,sapiens] 

Hs.98518 Homo sapiens cDNA FU1 1490 fis, clone HEMBA1001918 

Hs.177996 ESTs 

Hs.97334 ESTs 

Hs.22826 tropomodulin 3 (ubiquitous) 

Hs.89901 phosphodiesterase 4A, cAMP-specific (dunce (Drosophila)-homolog phosphodiesterase E2) 

Hs.325520 LAT1-3TM protein 

Hs.97316 EST 

Hs.291557 ESTs 

Hs.89306 hypothetical protein FLJ20105 

Hs.97247 ESTs 

Hs.97842 EST 

Hs.1 08264 ESTs 

Hs.1 81581 glutamate receptor, ionotropic, kainate 1 

Hs.1 93754 ESTs 

Hs.97973 ESTs 

gb:zt06e10.s1 NCI_CGAP_GCB1 Homo sapiens cDNA done IMAGE:712362 3' similar to 

Hs.98001 EST 

Hs.98003 ESTs 

Hs.98004 EST 

Hs.98019 EST 

Hs.176731 ESTs 

Hs.61 1 1 aryl-hydrocarbon receptor nuclear translocator 2 

Hs.261575 ESTs 

Hs.1 88760 ESTs, Highly similar to Trad [H.sapiens] 

Hs.9B120 ESTs 

Hs.98122 ESTs 

gb:zt95g12.s1 SoaresJestis.NHT Homo sapiens cDNA clone IMAGE:730150 3' similar to 

Hs.104778 ESTs 

Hs.98232 ESTs 

Hs.98185 EST 

Hs.98168 ESTs 

Hs.104671 erythrocyte transmembrane protein 

Hs.98334 EST 

Hs.109276 latexin protein 

Hs.94308 RAB35, member RAS oncogene family 

Hs. 188898 ESTs 

Hs.157173 ESTs 

Hs.97849 ESTs 

Hs.98402 ESTs 

Hs.112237 ESTs 

Hs.165066 ESTs, Moderately similar to ALU2.HUMAN ALU SUBFAMILY SB SEQUENCE 

Hs.98502 hypothetical protein FLJ14303 

Hs.98469 ESTs 

gb:zw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:77105Q 3' 

Hs.50421 KIAA0203 gene product 
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121917 RC.AA428218 

121918 RC_AA428242 

121919 RCJ\A428281 
121941 RC_AA428865 

5 121942 RC.AA428994 
121970 RC_AA429666 
121993 RC_AA430181 
134660 RC.AA430184 
126753 RC.AA43128B. 
10 122022 RC_AA431293 

122050 RCLAA431478 

122051 RC_AA431492 
122055 RC.AA431732 
122105 RC_AA432278 

15 122125 RCJ\A434411 
135235 RCJ\A435512_ 
122162 *RC AA435698 
129406 RCLAA435711 
318801 RC.AA435815. 

20 122186 RC.AA435842 
122235 RC.AA436475 
129131 RO.AA436489 
134664 RC.AA442060 
122310 RC.AA442079 

25 122334 RC.AA443151 
122382 RC_AA448133 
122425 RC AA447145 
122431 RC.AA447398 
122450 RC.AA447643 

30 302653 RCLAA447742 
122477 RC.AA448226 
122500 RC_AA448825 
122522 RC.AA449444 
122536 RC_AA450O87 

35 122538 RC_AA450211 
122540 RC.AA450244 
122560 RC_AA452123 
421919 RC_AA452155 
122562 RC.AA452156 

40 mRNA 

122585 RC.AA453036 
122608 RC.AA453526 
122635 RC_AA454Q85 
similar to 

45 122636 RC_AA454103 
122653 RC_AA454642 
122660 RC.AA454935 
122703 RC_AA456323 
122724 RC_AA457395 

50 122749 RC.AA458850 
122772 RCLAA459662 
131098 RC_AA459668 
129045 RC_AA459679. 
122777 RC_AA459702 

55 135362 RC_AA460017 
122798 RC_AA460324 
122837 RC.M461509 
122860 RC.AA464414 
mRNA sequence. ■ 

60 122861 RCJW464428 
122910 RC_AA470084 
132899 RCJ\A476606. 
122967 RC.AA478521 
129560 RC_AA478523 

65 123009 RC.AA479949 
128917 RC_AA481252 
123081 RC_AA485351 
123133 RCLAA487264 
123184 RCLAA489072 

70 129671 RC_AA489630 

123233 RC.AA490225 
[H.sapiens] 

123234 RCJ\A490227 
123236 RC AA490255 

75 123255 RC.AA490890 
129503 RC_AA490916 



AA406397 
BE274689 
AA428281 
AA428865 
AW452701 
AA429666 
AW297880 
s U73524 
s AA306478 
AA431293 
AI453076 
AA431492 
AA431732 
AW241685 
AK000492 
i AW298244 
AA628233 
AB018255 
s U40763 
AA398811 
AA436475 
AB026438 
AA256106 
AW192803 
BE465894 
AA446440 
AB007859 
AA447398 
AA447643 
s AJ404468 
AA448226 
AA448825 
AA299607 
AF060877 
M450211 
M476741 
AW392342 
AJ224901 
AA452156 

AI681654 
AA453525 
AA454085 

AW651706 
AW009166 
A1816827 
AA456323 
AA457395 
AA458850 
AW1 17452 
U66669 
s AI082883 
AK001022 
f AA978128 
AW366286 
AA461509 
j AA464414 

AA335721 
AA470084 
. s AA476606 
AA806187 
AA317841 
AA535244 
A1365215 
AI815486 
AA487264 
BE247767 
NMJJ14700 
AW974175 



NM_001938 
AW968504 
AA830335 
s AW768399 



Hs.98038 ESTs 

Hs.184175 chromosome 2 open reading frame 3 

Hs.98560 EST 

Hs.98563 ESTs 

Hs.293237 ESTs 

Hs.98617 EST 

Hs.98661 ESTs 

Hs.87465 ATP/GTP-binding protein 

Hs.95327 CD3D antigen, delta polypeptide (TiT3 complex) 

Hs.98716 ESTs, Moderately similar to T42650 hypothetical protein DKFZp434D0215.1 [H.sapiens] 

Hs.166109 ELAV (embryonic lethal, abnormal vision, Drosophila)-like 2 

Hs.98742 EST 

Hs.98747 EST 

Hs.98699 ESTs 

Hs.98806 hypothetical protein 

Hs.293507 ESTs 

Hs.79946 cytochrome P450, subfamily XIX (aromatization of androgens) 

Hs.111138 KIAA0712 gene product 

Hs.77965 peptidyl-prolyl isomerase G (cyclophiBn G) 

Hs.104673 ESTs 

Hs.1 1 2227 membrane-associated nucleic acid binding protein 

Hs. 1 77534 dual specificity phosphatase 1 0 

Hs.87507 ESTs 

Hs.98974 ESTs, Weakly similar to S65824 reverse transcriptase homolog [H.sapiens] 

Hs.98365 ESTs, Weakly similar to LB4D_HUMAN NADP-DEPENDENT LEUKOTR1ENE B4 12- 

Hs.98643 ESTs 

Hs.100955 KIAA0399 protein 

Hs 99104 ESTs 

Hs.112095 hypothetical protein DKFZp434F1819 

Hs.284259 dynein, axonemal, heavy polypeptide 9 

Hs.324123 ESTs 

Hs.99190 ESTs 

Hs.98969 ESTs 

Hs.99236 regulator of G-protein signalling 20 

Hs.99239 ESTs 

Hs.98279 ESTs, Weakly similar to A43932 mucin 2 precursor, intestinal [H.sapiens] 

Hs.283077 centrosomal P4. 1-associated protein; uncharacterized bone marrow protein BM032 

Hs.109526 zinc finger protein 198 

gb:zx29c03.s1 Soares Jotal_fetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE:787876 3', 

Hs.170737 hypothetical protein FLJ23251 

Hs.143077 ESTs 

gb:zx33a08.s1 Soares Jotal_fetus_Nb2HF8_9w Homo sapiens cDNA clone IMAGE:788246 3' 

Hs.99519 hypothetical protein FLJ14007 

Hs.99376 ESTs 

Hs.180069 nuclear respiratory factor 1 

Hs.269369 ESTs 

Hs.99457 ESTs 

Hs.293372 ESTs, Weakly similar to B34087 hypothetical protein [H.sapiens] 

Hs.99489 ESTs 

Hs.236642 3-hydroxyisobutyryl-CoenzymeA hydrolase 

Hs.30732 hypothetical protein FLJ13409; K1AA1711 protein 

Hs.214397 hypothetical protein FLJ10160 similar to insulin related protein 2 

Hs.99513 ESTs, Weakly similar to T17454 diaphanous-related formin - mouse [M.musculus] 

Hs.145696 splicing factor (CC1 .3) 

Hs.293565 ESTs, Weakly similar to putative p150 [H.sapiens] 

gb:zx78g01.s1 Soares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:809904 3', 

Hs.119394 ESTs 

Hs.98358 ESTs 

Hs.59666 SMAD in the antisense orientation 

Hs.289101 glucose regulated protein, 58kD 

Hs.7845 hypothetical protein MGC2752 

Hs.78305 RAB2, member RAS oncogene family 

Hs.206097 oncogene TC21 

Hs.243901 Homo sapiens cDNA FLJ20738 fis, clone HEP08257 

Hs. 154974 Homo sapiens mRNA; cDNA DKFZp667N064 (from clone DKFZp667N064) 

Hs.18166 KIAA0870 protein 

Hs.1 19004 KIAA0665 gene product 

Hs.188751 ESTs, Weakly similar to MAPB_HUMAN MICROTUBULE-ASSOCIATED PROTEIN 1B 

Hs.16697 down-regulator of transcription 1 , TBP-binding (negative cofactor 2) 

Hs. 123073 CDC2-re!ated protein kinase 7 

Hs.105273 ESTs 

Hs.112157 ESTs 
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131043 RC_AA490925 

123259 RC_AA490955 
[H.sapiens] 

123284 RC.AA495812 

123286 RCJ\A495824 

123315 RC_AA496369 
to contains 

129179 RC_AA504125 

131612 RCJM521473 

123421 RC AA59B440 

123449 RC AA598899 

129021 RC.AA599244 

132830 RC_AA599694. 

123497 RC AA600037 

123604 RC.AA609135 

129539 RC AA609582 

123712 RC AA609684 

123731 RC.AA609839 
similar to 

130725 RC.AA609862 

123800 RC.AA620423 

123841 RC.AA620747 

123929 RCJ\A621364 

123978 RC C20653 

133184 RCJD20085 

132835 RC_D20749 

132406 RC.D51285 s 

128695 RC_D59972 i 

124028 RC.F04112J 
sequence. 

124057 RC_F13604 

134899 RC_H01662 

130973 RC.H05135J 

124106 RC.H12245 

124136 RC.H22842 

124165 RC.H30894 

131229 RC_H43442_s 

mm RC H45996 

129948 RC H69281J 

134374 RC H69485J 

124254 RC_H69899 
similar to 

129056 RC_H70627_s 

100919 RC_H73050_s 

130724 RC„H73260 

100716 RC_H77531 s 

124274 RC.H80552 

129078 RC_H80737_s 

124828 RC_H93412 

124315 RC_H94892_s 

100747 RC_H95643_s 

124324 RC_H96552 

452933 RC.H97146 

132231 RC_H99131_s 

129170 RC_H99462_s 

133143 RC_H99837_s 

132963 RC_N22140 

135297 RCJM22197 

134347 RC_N23756_s 

130365 RC_N24134 

421642 RC_N24195 

439311 RC_N26739 

124383 RC_N27098 

124387 RC_N27637 

129341 RC_N33090 

129081 RC.N35967 

102827 RC.N36959J 

124433 RC.N39069 

124441 RC.N46441 

132338 RCJJ48270J 

131403 RC_N48365_s 

124466 RC.N51316 

132210 RC_N51499_s 

124483 RC_N53976 

124484 RCJJ54157 

124485 RC_N54300 



AF084535 
A1744152 

AA488988 
AA495824 
AA496369 

s AW969025 
AU076668 
AA598440 

} AL049325 
AL044675 

s NMJ14777 
AA765256 
AA609135 
T47614 
AA609684 
AA609839 

T98807 

AA620423 

AA620747 

AA621364 

T89832 

AA001021 

Z83844 

AL1 33731 " 

F04112 

AA902384 

AI609045 

AI638418 

H12245 

H22842 

H30039 

NMJH5340 

BE463721 

AI537162 

N22687 

H69899 

AI769958 

X54534 

AK001507 

X89887 

H80552 

AI351010 

AW952124 

NIVL005402 

X04588 

H96552 

AW391423 

AA662910 

AW250380 

AA094538 

AA099693 

AL1 18782 

AF164142 

W56119 

AF172066 

BE270668 

N27098 

N27637 

AI193519 

A1364933 

BE244588 

AA280319 

AW450481 

AA353868 

AI473114 

R10084 

NM 007203 

AI821780 

H66118 

AB040933 



Hs.22464 epilepsy, progressive myoclonus type 2, Lafora disease (laforin) 

Hs.283374 ESTs, Weakly similar to CA15.HUMAN COLLAGEN ALPHA 1 (V) CHAIN PRECURSOR 

Hs.293796 ESTs 

Hs.188822 ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 

gb:zv37d10.s1 Scares ovary tumor NbHOT Homo sapiens cDNA clone IMAGE:755827 3' similar 

Hs.109154 ESTs 

Hs.334884 SEC10 (S. cerevislae)-like 1 

Hs.291 154 EST, Weakly similar to I38022 hypothetical protein [Ksapiens] 

Hs.1 12493 Homo sapiens mRNA; cDNA DKF2p564D036 (from clone DKFZp564D036) 

Hs.173081 KIAA0530 protein 

Hs.57730 KIAA0133 gene product 

Hs.135191 ESTs, Weakly similar to unnamed protein product [H.sapiens] 

Hs.293076 ESTs 

Hs.323022 ESTs, Highly similar to p60 katanin [H.sapiens] 

Hs.112748 Homo sapiens cDNA: FLJ21543 fis, clone COL06171 

gb:ae62f01.s1 Stratagene lung carcinoma 937218 Homo sapiens cDNA clone IMAGE:951481 3' 

Hs.80248 RNA-bindlng protein gene with multiple splicing 

Hs.112862 EST 

Hs.1 12896 ESTs 

Hs.112981 ESTs 

Hs.170278 ESTs 

Hs.6685 thyroid hormone receptor interactor 8 

Hs.5790 hypothetical protein dJ37E16.5 

Hs.4774 Homo sapiens mRNA; cDNA DKFZp761C1712 (from clone DKFZp761C1712) 

Hs.101299 cullin 5 

gb:HSC2JH062 normalized infant brain cDNA Homo sapiens cDNA clone c-2]h06 3', mRNA 

Hs.73853 bone morphogenetic protein 2 

Hs.321775 hypothetical protein DKFZp434D1428 

Hs.78580 DEAD/H (Asp-Glu-AMsp/His) box polypeptide 1 

gb:ym17a12.M Soares infant brain 1NIB Homo sapiens cDNA done 3', mRNA sequence 

Hs.101770 EST 

Hs.107674 ESTs 

Hs.2450 leucyl-tRNA synthetase, mitochondrial 

Hs.971 01 putative G protein-coupled receptor 

Hs.263988 ESTs 

Hs.8236 ESTs 

gb:yu70c12.s1 Weizmann Olfactory Epithelium Homo sapiens cDNA clone IMAGE:239158 3' 

Hs.108336 ESTs, Weakly similar to ALUE__HUMAN !!!! ALU CLASS E WARNING ENTRY III [H.sapiens] 

Hs.278994 Rhesus blood group, CcEe antigens 

Hs.306084 Homo sapiens clone FLB6914 PR01821 mRNA, complete cds 

Hs.1 72350 HIR (histone cell cycle regulation defective, S. cerevisiae) homolog A 

Hs.1 02249 EST 

Hs.1 02267 lysosomal 

Hs.1 3094 presenilins associated rhomboid-like protein 

Hs.288757 v-ral simian leukemia viral oncogene homolog A (ras related) 

Hs.85844 neurotrophic tyrosine kinase, receptor, type 1 

Hs.159472 Homo sapiens cDNA: FLJ22224 fis, clone HRC01703 

Hs.288555 Homo sapiens cDNA: FLJ22425 fis, clone HRC08686 

Hs.42635 hypothetical protein DKFZp434K2435 

Hs. 1 09059 mitochondrial ribosomal protein L1 2 

Hs.272808 putative transcription regulation nuclear protein; KIAA1 689 protein 

Hs.34851 epsilon-tubulin 

Hs.300208 Sec23-interacting protein p125 

Hs.82042 solute carrier family 23 (nucieobase transporters), member 1 

Hs.155103 eukaryotic translation initiation factor 1A, Y chromosome 

Hs.1 06346 retinoic acid repressive protein 

Hs.151945 mitochondrial ribosomal protein L43 

Hs.102463 EST 

Hs 109019 ESTs 

Hs.226396 hypothetical protein FLJ11126 

Hs.168913 serine/threonine kinase 24 (Ste20, yeast homolog) 

Hs.6456 chaperonin containing TCP1 . subunit 2 (beta) 

Hs.288840 PR01 575 protein 

Hs.161333 ESTs 

Hs.182982 goIgin-67 

Hs.26455 ESTs 

Hs.1 13319 kinesin heavy chain member 2 

Hs.42322 A kinase (PRKA) anchor protein 2 

Hs.179864 ESTs 

Hs.285520 ESTs, Weakly similar to 2109260A B cell growth factor [H.sapiens] 

Hs.15420 KIAA1500 protein 
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124494 RCJI54831 

129200 RC.N59849 

124527 RC_N62132 

124532 RC.N62375 

133213 RC N63138 

124539 RC N63172 

133651 RCJJ63772 

129196 RC N63787 

124575 RC N68168 

124576 RC.N68201 

124577 RCJJ6830O 
mRNA 

124578 RC N68321 
124593 RC N69575 
128501 RC_N75007 
105691 RC N75542 
128473 RCJI9Q066 
128639 RCJ191246 
124652 RC.N92751 
133137 RC_N93214_s 
124671 RC.N99148 
PROTEIN 

133054 RC_R07876 
[C.elegans] 

130410 RC.R10865J 

124720 RC_R11056 
similar to 

124722 RC R11488 

129981 RC.R22947 



N54831 Hs.271381 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

N59849 Hs.13565 Sam68-fike phosphotyrosine protein, T-STAR 

N79264 Hs.269104 ESTs 

N62375 Hs.102731 EST 

AA903424 Hs.6786 ESTs 

D54120 Hs.146409 cell division cycle 42 (GTP-binding protein, 25kD) 

AI301740 Hs.173381 dihydrapyrimidinase-like 2 

BE296313 Hs.265592 ESTs, Weakly similar to I38Q22 hypothetical protein [H.sapiens] 

N68168 gb:za11c01.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 3', mRNA sequence 

N68201 Hs.269124 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

N68300 gb:za12g07.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:292380 3\ 

N6B321 Hs.231500 EST 

N69575 Hs.102788 ESTs 

AL133572 Hs.199009 protein containing CXXC domain 2 

A1680737 Hs.289068 Homo sapiens cDNA FLJ1 1918 fis, clone HEMBB1000272 

T78277 Hs.100293 O-lmked N-acetylglucosamine (GlcNAc) transferase (UDP-N-acetytglucosamineipolypeptide-N- 

AW582962 Hs.102897 CGI-47 protein 

W19407 Hs.3862 regulator of nonsense transcripts 2; DKFZP434D222 protein 

AB002316 Hs.65746 KIAA0318 protein 

AK001357 Hs.102951 Homo sapiens cDNA FLJ10495 fis, clone NT2RP2000297, moderately similar to ZINC FINGER 

AA464836 Hs.291079 ESTs, Weakly similar to T27173 hypothetical protein Y54G1 1 A.9 - Caenorhabditis elegans 



J00077 
R05283 

T97733 
R23053 



Hs.155421 alpha-fetoprotein 

gb;ye91c08.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:125102 3' 

Hs,185685 ESTs 



repetitive element 128944 RC Fl23930_s AL137586 
132965 RC.R26589J A1248173 Hs.191460 



133740 RC R37588_s 

133074 RC.R37613 

124757 RC_R38398 

124762 RC_R39179_f 

124773 RC.R40923 

135266 RC.R41179 

131375 RC_R41294_s 

133753 RC.R42307J 

128540 RC_R43189J 

124785 RC.R43306 

124792 RC_R44357 

124793 RC_R44519 
sequence. 

124799 RC_R45QS8 
sequence. 

124812 RC.R47948J 

124821 RC_R51524 

127274 RC.R54950 

124835 RC.R55241 

124845 RC_R59585 

124847 RC.R6O044 

440630 RC.R60872 



AW162919 

AL1 34275 

H11368 

AA553722 

R45154 

R41179 

AW293165 

N/VL004427 

AW297929 

W38537 

R44357 

R44519 

R45088 

R47948 

H87832 

AW966158 

R55241 

R59585 

W07701 

BE561430 



Hs. 170160 

Hs.6434 

Hs.141055 

Hs.92096 

Hs.106604 

Hs.97393 

Hs.143134 

Hs. 165263 

Hs.328317 

Hs.280740 

Hs.48712 



Hs.188732 

Hs.7388 

Hs.58582 

Hs.101214 

Hs.101255 

Hs.304177 

Hs.239388 



gb;yh31a05.r1 Soares placenta Nb2HP Homo sapiens cDNA done 5' similar to contains L1 

Hs.52763 anaphase-promoting complex subunit 7 

hypothetical protein MGC12936 

RAB2, member RAS oncogene family-like 

hypothetical protein DKFZp761F2014 

Homo sapiens clone 23758 mRNA sequence 

ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 

ESTs 

KIAA0328 protein 
ESTs 

early development regulator 2 (homolog of polyhomeotic 2) 
EST 

hypothetical protein MGC3040 
hypothetical protein FU20736 

gb:yg24h04.s1 Soares infant brain 1NIB Homo sapiens cDNA clone IMAGE:33350 3', mRNA 
gb:yg38g04.s1 Soares infant brain 1MB Homo sapiens cDNA clone !MAGE:34896 3\ mRNA 
ESTs 

kelch (Drosophila)-Iike 3 

Homo sapiens cDNA FLJ12789 fis, clone NT2RP2001947 

EST 

ESTs 

Homo sapiens clone FLB8503 PR02286 mRNA, complete cds 

Human DNA sequence from clone RP1-304B14 on chromosome 6. Contains a gene for a novel 



protein and a part of a gene for a novel protein with two isoforms. Contains ESTs, STSs, GSSs and a CpG island 



124861 RC_R66690 
130141 RC_R67266_s 
124879 RC.R73588 
124892 RC.R79403 
124906 RC.R87647 
124922 RC_R93622 

124940 RC_R99599_s 

124941 RC R99612 
124943 RCJ02888 



R67567 Hs. 1071 10 
NM 004455 Hs.150956 



R73588 

AI970003 

H75964 

R93622 

AF068846 

AI766661 

AW963279 



WARNING ENTRY [H.sapiens] 



124947 RGJQ3170 
124954 RC.T10465 
132924 RCJN5418J 
133113 RC.T15597J 
132975 RCJT15652J 
133235 RCJ16898 s 
131082 RCJT26644J 
124980 RCJ40841 
124984 RC.T47566 i 
124991 RC.T50116 



T03170 

AW964237 

U55184 

BE383768 

R43504 

AW960782 

AI091121 

T40841 

BE313210 

T50116 



Hs.101533 

Hs.23756 

Hs.107815 

Hs.12163 

Hs.103804 

Hs.27774 

Hs.1 23373 

Hs.100165 

Hs.6728 

Hs.154145 

Hs.65238 

Hs.6181 

Hs.6856 

Hs.246218 

Hs.98681 

Hs.223241 



ESTs 

exostoses (multiple)-like 1 
ESTs 

hypothetical protein similar to swine acylneuraminate lyase 
ESTs 

eukaryotic translation initiation factor 2, subunit 2 (beta, 38kD ) 
heterogeneous nuclear ribonucleoprotein U (scaffold attachment factor A) 
ESTs, Highly similar to AF1 61 349 1 HSPC086 [H.sapiens] 

ESTs, Weakly similar to ALU1_HUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 
ESTs 

KIAA1548 protein 
hypothetical protein FU11585 

95 kDa retinoblastoma protein binding protein; KIAA0661 gene product 
ESTs 

ash2 (absent, small, or homeotic, Drosophiia, homolog)-like 
Homo sapiens cDNA: FLJ21781 fe, clone HEP00223 
ESTs 

eukaryotic translation elongation factor 1 delta (guanine nucleotide exchange protein) 
gb:yb77c10.s1 Stratagene ovary (937217) Homo sapiens cDNA clone IMAGE:77202 3' similar 



to similar to SP:VE22 LAMBD P03756 EA22 GENE , mRNA sequence. 
129475 RC_T50145_s NM_004477 Hs.203772 FSHD region gene 1 
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125000 RCJ58615 

132932 RC.J59940J 

129534 RCJ63595 

125008 RC.T64891 

125009 RCJT64924 
132940 RCJ64933_r 

125017 RGJ68875 
sequence, 

125018 RC T69027 
125020 RC T69924 
129891 RCT70353 
134204 RC T79780.S 
125050 RC T79951 
125052 RC_T80174_s 
125054 RCJT80622 
125063 RC T85352 



T58615 

AW1 18826 

AK002126 

T91251 

T64924 

T79136 

T68875 



Hs.110640 

Hs.6093 

Hs.11260 

Hs.303046 
Hs.127243 



ESTs 

Homo sapiens cDNA: FLJ22783 fis, done KAIA1993 
hypothetical protein FLJ11264 

gb:yd60a10.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone 3\ mRNA sequence 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, partial cds 

gb:yc30f05.s1 Stratagene liver (937224) Homo sapiens cDNA clone IMAGE;82209 3', mRNA 



T69027 • Hs.57475 sex comb on midieg homolog 1 

T69981 gb:yc19d03.r1 Stratagene lung (937210) Homo sapiens cDNA clone 5\ mRNA sequence 

AI084813 Hs.13197 ESTs 
AI873257 Hs.7994 hypothetical protein FU20551 
AW970209 Hs. 1 1 1 805 ESTs 

T85104 Hs!222779 ESTs, Moderately similar to similar to NEDD4 [H.sapiens] 
T80622 Hs.268601 ESTs, Weakly similar to envelope [Ksapiens] 

T85352 gb:yd82d01.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA done )MAGE:114721 3' 

similar to contains A!u repetitive element;contains L1 repetitive element mRNA sequence. 

125054 RC.T85373 T85373 gb:yd82f07.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA clone IMAGE:114757 3* 

similar to contains Alu repetitive element;contains MER3 repetitive element ;, mRNA sequence. 



125066 RC.J86284 T86284 
Alu repetitive element;, mRNA sequence 

112264 RC T89579.S AL045364 Hs.79353 

125080 RCJ903B0 T90360 Hs.268620 
WARNING ENTRY [rf.sapiens] 

125097 RC T94328J AW576389 Hs.335774 
125104 RC T95590 T95590 



gb:yd77b07.s1 Soares fetal liver spleen 1 NFLS Homo sapiens cDNA done 3' similar to contains 
transcription factor Dp-1 

ESTs, Highly similar to ALU6.HUMAN ALU SUBFAMILY SP SEQUENCE CONTAMINATION 



EST, Moderately similar to S65657 alpha-1C-adrenergic receptor splice form 2 [H.sapiens] 
gb:ye40a03.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA done 3' similar to 
gb|M10817llGURRAA Iguana iguana 5S (rRNA );, mRNA sequence 

135107 RCT97257J T97257 Hs.337531 ESTs, Moderately similar to 138022 hypothetical protein [Ksapiens] 
129550 RCT97599J AA845462 Hs.124024 deltex (Drosophila) homolog 1 

125118 RCJT97620 R10606 gb:yf35f11.s1 Soares fetal liverspleen 1NFLS Homo sapiens cDNA clone IMAGE:128877 3' 

similar to contains Alu repetitive element;, mRNA sequence. 



125120 RC T97775 
134160 RCJT98152 
125136 RC.W31479 
125144 RC W37999 
125150 RCW38240 
104180 RC.W40150 
131987 RC W45435 
125178 RC_W58202 
125180 RC W58344 
125182 RCW58650 
130588 RC_W68736 
125197 RC_W69106 
133497 RC_W69111 
100562 RC_W69385_s 
125639 RC_W69399_s 
129232 RC_W69459 
101495 RC.W72424 
125209 RC_W72724 
125212 RC W72834 
129132 RC W73955 
125223 RC.W74701 



T97775 

T98152 

AW962364 

AB037742 

W38240 

AA247778 

AW453069 

W93127 

W58469 

AA451755 

AL030996 

AF086270 

BE617303 

NM_006185 

Z97630 

R98881 

W72424 

W72724 

AA746225 

BE383436 

AI916269 



WARNING ENTRY [H.sapiens] 

125225 RC.W76540 W74169 

125228 RC.W79397 AA033982 

132393 RCJV85888 AL1 35094 

125238 RC W86038 N99713 

125247 RC.W86881 AA694191 

129296 RCJV87804 A1051967 

125263 RC W88942 AA098878 

125266 RC.W90022 W90022 
PRECURSOR [H.sapiens] 

131321 RC.W92272 U91543 

131601 RC_W92764_s NM_007115 

131677 RC W93040 H05317 

120837 RC.W93092 BE149656 

125277 RC_W93227 W93227 

125278 RC.W93523 AI218439 
125280 RC_W93659 AI123705 
131856 RC W94003_s W93949 
131844 RC_W94401_s AI419294 
125284 RC W94688 NMJ502666 
313447 RC_W94787_s AW016321 
130799 RC Z38294_s AB028945 
125289 RC.Z38311 T34530 
128874 RC_Z38465_s H06245 



Hs.100717 EST 

Hs.79432 fibrillin 2 (congenital contractural arachnodactyly) 

Hs.129051 ESTs 

Hs.24336 KIAA1321 protein 

Empirically seleded from AFFX single probeset 

Hs.1 19155 Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 814975 

Hs.3657 activity-dependent neuroprotective protein 

Hs.31845 ESTs 

Hs.103120 ESTs 

Hs.263560 ESTs 

Hs.1641 1 hypothetical protein LOC57187 

Hs.278554 heterochromatin-Jike protein 1 

Hs.74266 hypofrietical protein MGC4251 

Hs.301512 nuclear mitotic apparatus protein 1 

Hs.2261 17 H 1 histone family, member 0 

Hs.1 09655 sex comb on midieg (Drosophila)-like 1 

Hs.1 12405 S100 calcium-binding protein A9 (calgranuiin B) 

Hs.103174 ESTs, Weakly similar to TSP2_HUMAN THROMBOSPONDIN 2 PRECURSOR [H.sapiens] 

Hs.103173 ESTs 

Hs. 108847 hypothetical protein MGC2749 

Hs.109057 ESTs, Weakly similar to ALU5_HUMAN ALU SUBFAMILY SC SEQUENCE CONTAMINATION 

Hs.16492 DKFZP564G2022 protein 

Hs.1 10059 ESTs, Weakly similar to 138022 hypothetical protein [H.sapiens] 

Hs.47334 hypothetical protein FLJ 14495 

Hs.109514 ESTs 

Hs.163914 ESTs 

Hs.1 101 22 ESTs 

gb:zn45g10.r1 Stratagene HeLa ceil s3 937216 Homo sapiens cDNA clone 5', mRNA sequence 

Hs.186809 ESTs, Highly similar to LCT2_HUMAN LEUKOCYTE CELL-DERIVED CHEMOTAXIN 2 

Hs.25601 chromodomain helicase DNA binding protein 3 

Hs.29352 tumor necrosis fador, alpha-induced protein 6 

Hs.283549 ESTs 

Hs.306621 Homo sapiens cDNA FLJ11963 fis, clone HEMBB1001051 

Hs.1 03245 EST 

Hs.1 29998 enhancer of polycomb 1 

Hs.106932 ESTs 

Hs.33245 ESTs v 

Hs.324342 ESTs 

Hs.103253 perilipin 

Hs.82306 destrin (actin depolymerizing factor) 

Hs.12696 cortactin SH3 domain-binding protein 

Hs.4210 Homo sapiens cDNA FLJ13069 fis, clone NT2RP3001752 

Hs.106801 ESTs, Weakly similar to PC4259 ferritin associated protein [H.sapiens] 
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130966 RC_Z38525_s 
128875 RC_Z38538_f 
133200 RC_Z38551_s 
130158 RC_Z38783_S 
125295 RC.Z39113 
domain, (semaphorin) 4F 
125298 RC Z39255J 
125300 RC.Z39591 
323122 RC.Z39783 s 
311463 RCJ39920 
130882 RC Z40166J 
128888 RC_Z4G388_S 
125310 RC Z40646 
125315 RC.Z41697 
125317 RC_Z99349 
135096 RC Z99394_s 
104786 RCJ\A027168 
132837 D58024_s 
120456 RC_AA251113 
132459 RCJU347573 
101545 M31210 
133505 C01527 
132360 RC_N62948_s 
132738 RC.W42674 
119586 RC W43000.S 
129914 RC_N31750_s 
130839 AF009301 
132813 L37347 
134342 M99564 
131876 RC AA430673 
105426 RC.AA251297 
132968 RC_AA620722 
132173 RC W46577_s 
113932 RC_W81237 
114452 RCJ^A020825 
PROTEIN TC10 
115243 RC_AA278766 
134403 RC_H93708_s 
129647 RC_N49394 
111428 RC_H56559_s 
115967 RC_AA446887 
120726 RC.AA293656 
114995 RC_AA251152 
303876 RC.AA233334. 
dominant, ataxin 3) 
311463 RC.Z39920 
120302 RC_AA192173 
133071 RC.AA455044 
121032 RC_AA398504 
129829 U41813 
120245 RCJW166965 
120985 RC_AA398222 
114184 RC.Z39095 
447503 RC.AA284744. 
cds 

132837 RC.AA428201 
121034 RC.AA398507 
119718 RC.W69216 
120455 RC_AA251083 
1252B0 RC_W93659 
132155 RC.AA227903 
120609 RC.AA283902 
121278 RCJV\401631 
109023 RCJW157293 
129815 RCJD60208J 
108061 RC_AA043979 
113287 RCJF66847 
114082 RC_Z38239 
116334 RC.AA491457 
131486 RC_Z4Q071_s 
107860 RCJVA024961 
131263 RCJM43826 
132207 RC_AA443294 
129183 RC_AA155743 
408431 RCJ23708 
120575 RC.AA280934 



AW971018 

AB040923 

AB037715 

AB032947 

AB022317 

AW972542 

Z39591 

BE622770 

R55344 

AA497044 

AI760853 

R59161 

R38110 

Z99348 

AA081258 

AA027167 

AA370362 

AA488750 

AL120071 

BE246154 

AI630124 

AW893660 

AK000738 

AF088033 

NM-012421 

AB011169 

BE313625 

NNL000275 

AA083764 

W20027 

AF234532 

X89426 

AA256444 

AI369275 

AA806600 
AA334551 
AB018259 
AL031428 
A1745379 
AA293655 
AA769266 
s U64820 

R55344 
AA837098 
BE384932 
AA393037 
AF010258 
AW959615 
AI219896 
R56434 
.f AA1 15496 

AA370362 

AL389951 

W69216 

AA251720 

AI123705 

AK001607 

AW978721 

AA037121 

AA157293 

BE565817 

AA043979 

T66847 

AK001612 

AL038450 

F06972 

AA024961 

AU077002 

BE206939 

BE561824 

AI338631 

AW978022 



Hs.21659 ESTs 

Hs.106808 kelch (Drosophila)-like 1 

Hs.183639 hypotheUca! protein FU10210 

Hs.151301 Ca2+Kiependent activator protein for secretion 

Hs.25887 sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

Hs.289008 Homo sapiens cDNA: FLJ21814 fis, clone HEP01068 

Hs.1 01376 EST 

Hs.26491 5 Homo sapiens cDNA FU1 2908 fis, clone NT2RP2004399 

Hs.22142 cytochrome b5 reductase b5R.2 

Hs.20887 hypothetical protein FU10392 

Hs.241558 ariadne (Drosophila) homolog 2 

Hs.1 24953 ESTs 

Hs.106296 ESTs 

Hs.112461 ESTs, Weakly similar to I38022 hypothetical protein (H.sapiens] 

Hs.132390 zinc finger protein 38 (KOX 18) 

Hs.10031 KIAA0955 protein 

Hs.57958 EGF-TM7-fatraphiiin-reIated protein 

Hs,88414 BTB and CNC homology 1, basic leucine zipper transcription factor 2 

Hs.48998 fibronectin leucine rich transmembrane protein 2 

Hs.154210 endothelial differentiation, sphingolipid G-protein-coupled receptor, 1 

Hs.324504 Homo sapiens mRNA; cDNA DKFZp586J0720 (from clone DKFZp586J0720) 

Hs.46440 solute carrier family 21 (organic anion transporter), member 3 

Hs.264636 hypothetical protein FLJ20731 

Hs.159225 ESTs 

Hs. 1 3321 rearranged L-myc fusion sequence 

Hs.20141 similar to S. cerevisiae SSM4 

Hs.57435 solute carrier family 1 1 (proton-coupled divalent metal ion transporters), member 2 

Hs.82027 oculocutaneous albinism II (pink-eye dilution (murine) homolog) 

Hs.6101 hypothetical protein MGC3178 

Hs.23439 ESTs 

Hs.61638 myosin X 

Hs.41716 endothelial cell-specific molecule 1 

Hs.126485 hypothetical protein FLJ12604; K1AA1692 protein 

Hs.243010 Homo sapiens cDNA FLJ14445 fis, clone HEMBB1001294, highly similar to GTP-BINDING 

Hs.1 16665 KIAA1842 protein 

Hs.82767 sperm specific antigen 2 

Hs.1 18140 KIAA0716gene product 

Hs.174174 KIAA0601 protein 

Hs.42911 ESTs 

Hs.97293 ESTs 

Hs. 193657 ESTs 

Hs.66521 Machado-Joseph disease (spinocerebellar ataxia 3, olivopontocerebellar ataxia 3, autosomal 

Hs.22142 cytochrome b5 reductase b5R.2 

Hs.269933 ESTs 

Hs.64313 ESTs, Weakly similar to AF257182 1 G-protein-coupled receptor 48 [H.sapiens] 

Hs.161798 ESTs 

Hs.1 27428 homeoboxA9 

Hs.1 11045 ESTs 

Hs.97592 ESTs 

Hs.21062 ESTs 

Hs.336898 Homo sapiens, Similar to RIKEN cDNA 1810038N03 gene, clone MGC:9890, mRNA, complete 

Hs.57958 EGF-TM7-latrophilin-related protein 

Hs.271623 nucleoporin 50kD 

Hs.92848 ESTs 

Hs.104347 ESTs, Weakly similar to ALUC.HUMAN !!!! ALU CLASS C WARNING ENTRY !!! [H.sapiens] 

Hs.106932 ESTs 

Hs.41 127 hypothetical protein FLJ13220 

Hs.266076 ESTs, Weakly similar to A46010 X-linked retinopathy protein [H.sapiens] 

Hs.98518 Homo sapiens cONA FU11490 fis, clone HEMBA1001918 

Hs72168 ESTs 

Hs.26498 hypothetical protein FU21657 

Hs.62651 EST 

Hs.194040 ESTs, Weakly similar to I38022 hypothetical protein [H.sapiens] 

Hs.26962 Homo sapiens cDNA FLJ10750 fis, clone NT2RP3001929 

Hs.48948 ESTs 

Hs.27372 BMX non-receptor tyrosine kinase 

Hs.50730 ESTs 

Hs.24950 regulator of G-protein signalling 5 

Hs.42287 E2F transcription factor 6 

Hs.273369 uncharacterized hematopoietic stem/progenitor cells protein MDS027 

Hs.43266 Homo sapiens cDNA: FU22536 fis, clone HRC13155 

Hs.238911 hypothetical protein DKFZp762E1511; KIAA1816 protein 
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132121 RC_AA443284_s NM.004529 

117657 RC N39074 N39074 

134922 RC W04507_s AI718295 

118523 RC_R41828js Y07759 

116845 RC H64973 AA649530 

115291 RC.AA279943 BE545072 

120326 RC_AA196300 AA196300 

130174 M29550 M29551 

129131 RC.AA436489 AB026436 

129868 RC_AA287032 AW172431 

118661 RC N70777 AL137554 

129829 RC.AA496921 AF010258 

115985 RC.AA447709 AA447709 

134637 RC AA369856_s U87309 

132714 RC_AA252598 W39388 

129771 RC H73237 

123360 RC_AA504784 

132902 RC AA490969 

113716 RCJT97750 

113825 RCJV48860 

130367 RC_Z38501 

120541 RC.AA278298 

116727 RC.F13684 

118219 RC.N62231 

119767 RC W72562 

128917 RC_AA481252 

451553 RC.AA020928 

132716 RCAA251288 

118525 RCJM67861 

114618 RC_AA084162 

119743 RC_W70242 

108154 RC AA425151.S NMJ05754 

122798 RC AA460324 AW366286 

133746 U44378 

119822 RC.W74471 

122186 RC.AA435842 

114941 RC.AA243017 

118053 RC.N53367 

123234 RCJ\A490227 

129280 M63154 

118995 RQ.N94591 

116750 RC_H05960 

129026 M98833 

105127 RC.AA158132 

114513 RC.AA044825 

411856 RC.T35697 

132036 W01568 

130091 RC.W88999 
sequence 

414108 U09564 

119881 RC.W81456 

117770 RC.N47953 

119850 RC_W80447 

115439 RC.AA284561 

123107 RC„AA486071 

406698 M24364 

121231 RC_AA400780 

132074 AB002366 

413670 AB000115 

125277 RC.W93227 

114056 RC_AA1 86324 

121153 RCJ\A399640 

121609 RC_AA416867 

120661 RC.AA287556 

120850 RC_AA349647 

124947 RCJ03170 

130529 RC.AA280886 
repetitive element;, mRNA sequence 

117683 RCJM0180 N40180 
IMAGE:276387 3* similar to contains U.tl 

120745 RC AA302809 AA302809 

120936 RC AA385934 AA385934 

112597 RC R78376 R78376 

120183 RC Z40174 AW082866 

120644 RCIAA287038 AI869129 



AL096748 

AA532718 

AI936442 

AA001356 

AW014486 

AL135301 

W07318 

R76472 

AA862391 

W72552 

AI365215 

AA018454 

BE379595 

N67861 

AW979261 

AA947552 



AW410035 

AF086409 

AA398811 

AA236512 

N53391 

NMJ01938 

M63154 

N94591 

AA760689 

AL120297 

AA045648 

AA044873 

H67899 

AL1 57433 

W88999 

AI267592 

W81486 

AW957372 

AI247568 

AI567972 

AA225048 

X03068 

AA814948 

AA478486 

AB000115 

W93227 

AA188175 

AA399640 

AA416867 

AA287556 

AA349647 

T03170 

AA178953 



Hs.404 ' myeloid/lymphoid or mixed-lineage leukemia (trithorax (Drosophila) homolog); translocated to, 3 

Hs.44933 ESTs 

Hs.91161 prefotdin 4 

Hs. 1701 57 myosin VA (heavy polypeptide 1 2, myoxin) 

gb:ns44f05.s1 NCI CGAP Jtfvl Homo sapiens cDNA done, mRNA sequence 

Hs.122579 hypothetical protein FU10461 

Hs.21 145 hypothetical protein RG083M05.2 

Hs.151 531 protein phosphatase 3 (formerly 2B), catalytic subunit, beta isoform (calcineurin A beta) 

Hs. 1 77534 dual specificity phosphatase 10 

Hs.13012 ESTs 

Hs.49927 protein kinase NYD-SP15 

Hs.127428 homeoboxA9 

Hs.268115 ESTs, Weakly similar to T08599 probable transcription factor CA150(H.sapiens] 

Hs. 1 80941 vacuolar protein sorting 41 (yeast homolog) 

Hs.55336 Homo sapiens, clone MGC:17421 , mRNA, complete cds 

Hs.102708 DKFZP434A043 protein 

Hs.178604 ESTs 

Hs.59838 hypothetical protein FLJ10808 

Hs.18159 ESTs 

Hs.22509 ESTs 

Hs.8768 hypothetical protein FLJ10849 

Hs.240 M-phase phosphoprotein 1 

Hs.65646 ESTs 

Hs.48494 ESTs, Moderately similar to A46010 X-linked retinopathy protein [H.sapiens] 

Hs.58119 ESTs 

Hs.206097 oncogene TC21 

Hs.269211 ESTs 

Hs.283738 casein kinase 1 , alpha 1 

Hs.49390 ESTs 

Hs.291993 ESTs 

Hs.58086 ESTs 

Hs.220689 Ras-GTPase-activating protein SH3-domain-binding protein 

Hs.145696 splicing factor (CC1.3) 

Hs.75862 MAD (mothers against decapentapleglc, Drosophila) homolog 4 

Hs.301327 ESTs 

Hs.104673 ESTs 

Hs.87331 ESTs 

Hs.47629 ESTs 

Hs.1 6697 down-regulator of transcription 1 , TBP-binding (negative cofactor 2) 

Hs.1 10014 gastric intrinsic factor (vitamin B synthesis) 

Hs.323056 ESTs 

Hs.92418 ESTs 

Hs.1 08043 Friend leukemia virus integration 1 

Hs.301957 nudix (nucleoside diphosphate linked moiety X)-type motif 5 

Hs.103446 ESTs 

Hs.4190 Homo sapiens cDNA: FLJ23269 fis, clone COL09533 

Hs.37706 hypothetical protein DKFZp434E2220 

gb:zh70h03.s1 SoaresJetal_liver_spleen_1NFLS_S1 Homo sapiens cDNA clone 3', mRNA 

Hs.75761 SFRS protein kinase 1 

Hs.58648 ESTs 

Hs.46791 ESTs, Weakly similar to I38022 hypothetical protein [Ksapiens] 

Hs.58452 ESTs 

Hs.193090 ESTs, Highly similar to AF161437 1 HSPC319 (H.sapiens] 

Hs.104207 ESTs 

Hs.73931 major histocompatibility complex, class II, DQ beta 1 

Hs.96343 ESTs, Weakly similar to ALUCJ-IUMAN !!!! ALU CLASS C WARNING ENTRY HI [H.sapiens] 

Hs.3852 KIAA0368 protein 

HsJ5470 hypothetical protein, expressed in osteoblast 

Hs.103245 EST 

Hs.82506 KIAA1254 protein 

Hs.97694 ESTs 

Hs.98185 EST 

Hs.263412 ESTs, Weakly similar to ALUB.HUMAN !!!! ALU CLASS B WARNING ENTRY !!! [H.sapiens] 

Hs.96927 Homo sapiens cDNA FLJ12573 fis, clone NT2RM4000979 

Hs.100165 ESTs 

gb:zp39e03.s1 Stratagene muscle 937209 Homo sapiens cDNA clone 3' similar to contains Alu 

gb:yy44d02.s1 Soares_mu!bpte_sclerosis_2NbHMSP Homo sapiens cDNA clone 
L1 repetitive element mRNA sequence. 

gb:EST10426 Adipose tissue, white I Homo sapiens cDNA 3' end, mRNA sequence. 
EST, Highly similar to (defline not available 7499603) [C.elegans] 



Hs.97184 
Hs.29733 
Hs.65882 
Hs.96616 



EST 

ESTs 

ESTs 
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119023 RC.N98488 N98488 
IMAGE:310129 3', mRNA sequence. 

107582 RCJM02147 AA002147 

118249 RCJJ625B0 N62580 

115022 RC.AA252029 AA252029 

117710 RC N45198 N45198 

115341 RCAA281452 AA281452 

118896 RC.N90680 N46213 

121121 RCAA399371 AA399371 

118329 RC_N63520 N63520 
3', mRNA sequence. 

119496 RC.W35416 W35416 

118111 RC_N55493 N55493 
mRNA sequence. 
119062 RC.R16698 
116710 RCJ r 10577 f 
119261 ROJ15956 
122723 RC AA457380 



AW444881 
F10577 
T15956 
AA457380 



gb:zb82h01.s1 Soares_senescentJibroblasts_NbHSF Homo sapiens cDNA clone 
Hs.59952 EST 

Hs.322925 EST, Weakly similar to putative p150 [H.sapiens] 
Hs.87935 ESTs 

Hs.47248 ESTs, Highly similar to similar to Cdc14B1 phosphatase [H.sapiens] 
Hs.88840 EST, Weakly similar to granule cell marker protein [M.musculus] 
Hs.54642 methionine adenosyltransferase II, beta 
Hs.189095 similar to SALL1 (sal (Drosophila)-like 

gb:yy62f01.s1 Soaresjriuitip!e_sclerosis_2NbHMSP Homo sapiens cDNA clone IMAGE:278137 

Hs.1 56861 ESTs, Moderately similar to A46O10 X-linked retinopathy protein [H.sapiens] 

gb:yv50c02.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE;246146 3', 

ESTs 



Hs.77829 
Hs.306088 
Hs.65289 



v-crk avian sarcoma virus CT10 oncogene homolog 
EST 

gb:aa86b10.s1 Stratagene fetal retina 937202 Homo sapiens cDNA clone IMAGE:838171 3' 
similar to contains L1.b3 L1 repetitive element mRNA sequence. 

1 17732 RC.N46452 N46452 gb:yy76h09.s1 Soares_multiple_scterosts„2NbHMSP Homo sapiens cDNA clone 

IMAGE;279521 3* similar to contains L1.t2 L1 repetitive element ;, mRNA sequence. 

104787 RCLAA027317 AA027317 gb:ze97d11.s1 Soares_fetaLheartNbHH19W Homo sapiens cDNA clone IMAGE:366933 3' 

similar to contains Alu repetitive element;, mRNA sequence. 



100071 A28102 

115819 RC AA426573 

130882 RC_Z40166J 

125225 RCJW76540 

108339 RCJVA070801 



A28102 

AA486620 

AA497044 

W74169 

AW151340 



WARNING ENTRY [H.sapiens] 



75 



100338 063483 
121636 RC_AA417027 
103875 RC_AA418387 
118716 RCJJ73460 
119763 RCJV72450 
121917 RC^AA428218 
132806 M91488 
130949 Y10659 
108806 RC AA129933 
133276 RC_AA490478 
134760 RC H16758 
132867 AA121287 
132051 AA091284 
114208 RC_Z39301 
104094 AA418187 
128718 AA426361 
302032 RCN20407 
115501 RC_AA291553 
101997 U01160 
103708 AA037206 
101899 S59184 
115839 RC_AA429038 
409459 D50678 
103563 Z22534 
123233 RC.AA490225 
[H.sapiens] 

121305 RCAA402468 
114798 RC.AA159181 
133145 RC.AA196549 
131567 RC_AA291015_s AF015592 
112300 RC R54554 H24334 
129507 RC.AA192Q99 AJ236885 
121033 RC AA398505 AA398505 
121151 RC_AA399636 AA399636 
121402 RC.AA406063 AA406063 
123203 RC_AA489671 AA352335 
132271 RCAA236466 AB030034 
125197 RC W69106 AF086270 
114935 RC_AA242B09 H23329 
WARNING ENTRY [H.sapiens] 
125279 RC W93640 AW401809 
108778 RC.AA128548 AF133123 
108087 RC_AA045709 AA045708 
132466 RC N66810_s AI597655 
133328 R36553 AW452738 
124057 RC F13604 AA902384 
124800 RC R45115 AW864086 



D86864 
AA379203 
T26379 
AI658908 
R54146 
AA406397 
A1699432 
AV656840 
AF070578 
AW978439 
NM.000121 
AF226667 
AA393968 
AL049466 
AA418187 
NM.002959 
NM.001992 
AA291553 
AU076536 
AA430591 
S59184 
BE300266 
D86407 
L02911 
AW974175 

AA402468 
AA1 59181 
H94227 



Human GABAa receptor alpha-3 subunit 

Hs.41135 endomucin-2 

Hs.20887 hypothetical protein FIJ10392 

Hs.16492 DKFZP564G2022 protein 

Hs.51615 ESTs, Weakly similar to ALU7.HUMAN ALU SUBFAMILY SQ SEQUENCE CONTAMINATION 

Hs.57735 acetyl LDL receptor; SREC 

Hs.306654 Homo sapiens cDNA aJ13574 fis, clone PLACE1008625 

Hs.48802 Homo sapiens clone 23632 mRNA sequence 

Hs.1 18722 fucosyltransferase 8 (alpha (1,6) fucosyltransferase) 

Hs.10450 Homo sapiens cDNA: FU22063 fe, clone HEP10326 

Hs 98038 ESTs 

Hs.278619 hypothetical protein FLJ10099 

Hs.2851 15 interleukin 13 receptor, alpha 1 

Hs.71 168 Homo sapiens clone 24674 mRNA sequence 

Hs.69504 ESTs 

Hs.89548 erythropoietin receptor 

Hs.58553 CTP synthase il 

Hs.180145 HSPC030 protein 

Hs.7859 ESTs 

Hs.330515 ESTs 

Hs.281706 sortilin 1 

Hs. 1 28087 coagulation factor II (thrombin) receptor 

Hs.190086 ESTs 

Hs.50984 sarcoma amplified sequence 

Hs.72071 hypothetical protein FLJ20038 

Hs.79350 RYK receptor-like tyrosine kinase 

Hs.28935 transducin-like enhancer of split 1 , homoiog of Drosophila E(sp1 ) 

Hs.54481 low density lipoprotein receptor-related protein 8, apolipoprotein e receptor 

Hs.150402 Activin A receptor, type I (ACVR1) (ALK-2) 

Hs.188751 ESTs, Weakly similarto MAPBJHUMAN MICROTUBULE-ASSOCIATED PROTEIN 1B 

Hs.291557 ESTs 

Hs.54900 serologically defined colon cancer antigen 1 

Hs.6592 Homo sapiens, clone IMAGE:2961 368, mRNA, partial cds 

Hs.28853 CDC7 (cell division cycle 7, S. cerevtsiae, homo!og)-!ike 1 

Hs.26125 ESTs 

Hs.112180 zinc finger protein 148 (pHZ^52) 

Hs.97360 ESTs 

Hs.143629 ESTs 

Hs.98003 ESTs 

Hs.65641 hypothetical protein FLJ20073 

Hs, 1 1 5 1 75 sterile-alpha motif and leucine zipper containing kinase AZK 

Hs.278554 heterochromatin-like protein 1 

Hs.290880 ESTs, Weakly similar to ALU INHUMAN ALU SUBFAMILY J SEQUENCE CONTAMINATION 

Hs.4779 KIAA1 150 protein 

Hs.90847 general transcription factor IIIC, polypeptide 3 (102kD) 

Hs.40545 ESTs 

Hs.49265 ESTs 

Hs.265327 hypothetical protein DKFZp761 11 41 

Hs.73853 bone morphogenetic protein 2 

Hs.1 38617 thyroid hormone receptor interactor 1 2 
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121029 


RC AA398482 


AA398482 


Hs.97641 


120663 


RC AA287627 


AA827798 


Hs. 105089 


102133 


U15173 


AU076845 


Hs.1 55596 


108246 


RC_AA062855 


AI423132 


Hs.146343 


125226 


RC W78134 


AA782536 


Hs.1 22647 


120260 


RC AA171739 


AK000061 


Hs.101590 


124906 


RCR87647 


H75954 


Hs.107815 


109406 


RC AA226877 


AA199883 


Hs.67624 


109271 


RC AA195668 


AW137422 


Hs.86022 


125052 


RC_T80174_S 


T85104 


Hs.222779 


109101 


RCJ\A167708 


AW608930 


Hs.52184 


115241 


RCJ\A278723 


AA648278 


Hs.193859 


117163 


RC_H97909 


N36861 


Hs.42344 


113530 


RC T90313 


T90313 


Hs.1 6732 


120375 


RCJ\A227260 


AF028706 


Hs.1 11227 


129435 


M314256 


AF151852 


Hs.1 11449 


114864 


RC AA235256 


AA135332 


Hs.71608 


103988 


AA314389 


AA314389 


Hs.42500 


131006 


RCJ\A242763 


AF064104 


Hs.22116 


106781 


RC AA478474 


AA330310 


Hs.24181 


106141 


RC AA424558 


AF031463 


Hs.9302 


116213 


RC AA476738 


AA292105 


Hs.326740 


135266 


AB002326 


R41179 


Hs.97393 


13505B 


RCJ\A430152 


AI379720 


Hs.93814 


119908 


RC W85844 


AA524470 


Hs.58753 


103695 


AA018758 


AW207152 


Hs.186600 


103978 


AA307443 


NM 016940 


Hs.34136 


109485 


RC AA233472 


8E619092 


Hs.28465 


129574 


M458603 


AA026815 


Hs.1 1463 


115347 


RC AA281528 


AA356792 


Hs.334824 


120765 


RC AA338735 


AW961026 


Hs.96752 


WARNING ENTRY [H.sapiensl 




121059 


RC AA398628 


AA393283 




131887 


AA046548 ■ 


W17064 


Hs.332848 


member 1 






112064 


RC R43812 


AL049390 


Hs.22689 


115606 


RC AA400465 


AI025829 


Hs.86320 


131750 


RC H94855 s 


NM 004349 


Hs.31551 


102123 


U14518 


NMJJ01809 


Hs.1594 


129847 


RC W46767 


N64025 


Hs.296178 


133809 


RC AA235275 


AV649326 


Hs.76359 


132210 


RC N51499_s 


NM_007203 


Hs.42322 


122356 RC.AA443794 


AA443794 


Hs.98390 


114958 


RC AA243708 


N20912 


Hs.42369 


103951 


AA287840 


AL353944 


Hs.50115 


134703 


RC_AA280704 


AF1 17065 


Hs.88764 


128727 


AA287864 


AI223335 


Hs.50651 


105743 


RC_AA293300_s BE246502 


Hs.9598 


domain, (semaphorin) 4B 






103744 


AA076003 


AA079267 




sequence 




Hs.301532 


114348 


N80402 


AL050321 


114009 


RC W900&7 


AI248544 


Hs.103000 


134704 


RC AA280849 


AA837124 


Hs.88780 


128629 


AA399187 


AL096748 


Hs.102708 


104410 


H65925 


AI807519 


Hs.104520 


110200 


RC_H21075 


H21075 


Hs.31802 


124483 


RCJI53976 


AI821780 


Hs.179864 


101391 


M14648 


NM.002210 


Hs.295726 


109657 


RC F04826 


R60900 


Hs.26814 


117140 


RC H96813 


H96813 


Hs.42241 


132937 


RC AA233706J AW952912 


Hs.300383 


129799 


R36410 


AW967473 


Hs.239114 


105077 


RC AA142919 


W55946 


Hs.234863 


100850 


RC N58561J5 


AA836472 


Hs.297939 


131043 


RC AA490925 


AF084535 


Hs.22464 


118417 


RC_N66048J 


AF080229 




129254 


RC AA243695 


AA252468 


Hs.1098 


119149 


RC R58910 


BE304701 


Hs.65732 


133996 


AA091367 


AA380267 


Hs.78277 


110223 


RC H23747 


H19836 


Hs.31697 


117626 


RCJ136090 


AK001757 


Hs.281348 


135286 


RC_AA424469_s AW023482 


Hs.97849 


122967 


RC AA478521 


AA806187 


Hs.289101 


131236 


AA282640 


AF043117 


Hs.24594 


128568 


AA463380 


H12912 


Hs.274691 



EST 
ESTs 

BCL2/adenoviais E1B 19kD-interacting protein 2 
ESTs 

N-myristoyltransferase 2 

hypothetical protein 

ESTs 

ESTs 

ESTs 

ESTs, Moderately similar to similar to NEDD-4 [H .sapiens] 

hypothetical protein FU20618 

ESTs 

ESTs 

ESTs 

Zic family member 3 (odd-paired Drosophila homolog, heterotaxy 1 ) 

CGI-94 protein 

ESTs 

ADP-ribosylation factor-like 5 

CDC14 (cell division cycle 14, S. cerevisiae) homolog B 

ESTs 

phosducin-like 

hypothetical protein MGC10947 

KIAA0328 protein 

hypothetical protein 

ESTs 

ESTs 

chromosome 21 open reading frame 6 

Homo sapiens cDNA: FLJ21869 fis, clone HEP02442 

UMP-CMP kinase 

hypothetical protein FLJ14825 

ESTs, Weakly similar to ALU8JWMAN ALU SUBFAMILY SX SEQUENCE CONTAMINATION 

gb:zt74e03.M SoaresJestisJIHT Homo sapiens cDNA clone 5\ mRNA sequence 
SWI/SNF related, matrix associated, actin dependent regulator of chromatin, subfamily e, 

Homo sapiens mRNA; cDNA DKFZp58601318 (from clone DKFZp58601318) 
ESTs 

core-binding factor, runt domain, alpha subunit 2; translocated to, 1; cyclin D-retated 
centromere protein A (17kD) 
hypothetical protein FLJ22637 
catalase 

A kinase (PRKA) anchor protein 2 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp761J1112 (from clone DKFZp761J1112) 
male-specific lethaI-3 (Drosophila)-like 1 
Janus kinase 1 (a protein tyrosine kinase) 

sema domain, immunoglobulin domain (Ig), transmembrane domain (TM) and short cytoplasmic 

gb:zm97e10.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA clone 3', mRNA 

CRP2 binding protein 
KIAA0831 protein 
ESTs 

DKFZP434A043 protein 

Homo sapiens cDNA FLJ13694 fis, clone PLACE20001 15 

ESTs, Highly similar to A59266 unconventional myosin-15 [H.sapiens] 

ESTs 

integrin, alpha V (vitronectin receptor, alpha polypeptide, antigen CD51) 

ESTs 

ESTs 

hypothetical protein MGC3032 

mannosidase, alpha, class 1A, member 2 

Homo sapiens cDNA FLJ12082 fis, clone HEMBB1002492 

cathepsinB 

epilepsy, progressive myoclonus type 2, Lafora disease (taforin) 

gb:Human endogenous retrovirus K clone 10.1 polymerase mRNA, partial cds 

DKFZp434J1813 protein 

ESTs 

DKFZP434F2021 protein 
ESTs 

hypothetical protein FLJ10895 
ESTs 

glucose regulated protein, 58kD 

ubiquitination factor E4B (homologous to yeast UFD2) 

adenylate kinase 3 
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112888 
115192 
118688 
122264 
128981 
131042 
103704 
121341 
106593 
115195 
115425 
117258 
120209 



RC T03872 
RCLAA261920 
RC N71484 
RC AA436837 
AA135452 
RC R42457 
AA028171 
AA233107 
RC M456826 
RCJ\A262156 
RC AA284071 
RC N21299 
RC Z40892 



AW195317 Hs.107716 

AA741024 Hs.88378 

AK000708 Hs.169764 
AA436837 

AA927177 Hs.86041 

AI826288 Hs.171637 

AA028171 Hs.151258 

AF035528 Hs.153863 

AW296451 Hs.24605 

AW968619 Hs.155849 

AAB11895 Hs.180680 

AF086041 Hs.42975 
F02951 

L16991 Hs.79006 

AW959755 Hs.288896 

AA059459 Hs.62592 

N28707 Hs-154304 

N91453 Hs.102987 

R22039 Hs.23217 

AA280095 Hs.B8689 

AA282292 Hs.279841 

N41549 Hs.285410 

AI269096 Hs.135578 

AL137275 Hs.20137 

NM_007331 Hs.1 10457 

NM_002267 Hs.3886 

R36095 Hs.268695 

AA463550 Hs.337532 

AL137939 Hs.40096 

BE246400 Hs.285176 

R24024 Hs.158101 

AI091731 Hs.87293 

R85350 Hs.101368 

AA355925 Hs.36232 

N80181 Hs.221498 

BE149656 Hs.306621 

NM.015361 Hs.268053 

AF071076 Hs.112255 
AA075000 
sequence 

132019 RC AA134965J H56995 Hs.37372 

132310 RC.AA284107 AA173223 Hs.289044 

117367 RCJJ24954 AI041793 Hs.42502 
103743 AA075998 AA075998 



sequence 
134082 L16991 
104774 RC AA026066 
115625 RC_AA401630 
104469 N28707 
107401 W20054 
111686 RC.R21510 
115300 RC AA280026 
115378 RC AA282292 
132224 RC.H97819 
113791 M95767 
129144 AA004987 
104448 L44574 
132084 RCJT26981J3 
111831 RC.R36083 
114765 RC.AA252163 
115029 RCJA252219 
100457 H81492 
104536 R24011 
PROTEIN 91 
116167 RC^AA461562 
103889 AA236771 
131978 RC_H48459_s 
118843 RC N80181 
120837 RCJV93092 
133647 D21852 
129521 U41815 
103746 AA081876 



gb:M15887 ACYL-COA-BINDING PROTEIN (HUMAN);, mRNA sequence 



hypothetical protein FU22344 
ESTs 

hypothetical protein FU20701 

gb:zv57g07.s1 SoaresJestis_NHT Homo sapiens cDNA clone 3', mRNA sequence 
C6G triplet repeat binding protein 1 
hypothetical protein MGC2628 
hypothetical protein FLJ21062 

MAD (mothers against decapentaptegic, Drosophila) homolog 6 

ESTs 

ESTs 

ESTs, Weakly similar to 154374 gene NF2 protein [H.sapiens] 
ESTs 

gb:HSC1HB082 normalized infant brain cDNA Homo sapiens cDNA clone c-1hb08 3\ mRNA 

deoxythymidylate kinase (thymidylate kinase) 

Homo sapiens cDNA FLJ12977 fis, clone NT2RP2006261 

ESTs 

Homo sapiens chromosome 19, BAC 282485 (CIT-B-344H19) 

ESTs 

ESTs 

ESTs 

hypothetical protein RJ10335 
ESTs 

chitobiase, di-N-acetyl- 
hypothetical protein DKFZp434P0116 
Wolf-Hirschhom syndrome candidate 1 
karyopherin alpha 3 (importin alpha 4) 
ESTs 

ESTs, Weakly similar to A47582 B-cell growth factor precursor [H.sapiens] 
ESTs 

acetyl-Coenzyme A transporter 

Homo sapiens cDNA FLJ1 4673 fis, clone NT2RP2003714, moderately similar to ZINC FINGER 

hypothetical protein FLJ20045 
ESTs 

KIAA0186 gene product 
ESTs 

Homo sapiens cDNA FLJ1 1963 fis, clone HEMBB1001051 
KIAA0029 protein 
nucieoporin 98kD 

gb:zm83c07.s1 Stratagene ovarian cancer (937219) Homo sapiens cDNA clone 3', mRNA 

Homo sapiens DNA binding peptide mRNA, partial cds 
Homo sapiens cDNA FLJ12048 fis, clone HEMBB1001990 
ESTs 

gb:zm89b09.r1 Stratagene ovarian cancer (937219) Homo sapiens cDNA clone 5' similar to 



103761 AA085138 AA765163 
BINDING PROTEIN (HUMAN);, mRNA sequence 



130237 L39060 
128752 RC N72879 
135162 AA045930 
131386 AA096412 
129021 RC AA599244 
424274 AA293634 
129913 H06583 
131888 U79298 
23803 mRNA 
118612 RC.N69466 
322026 AA203138 
110892 RC.N38882 
111429 RC_R01245 
113334 RC T76962 
104091 AA417310 
105246 RC_AA226879 



AA913909 
AA504428 
AH87925 
BE219898 
AL044675 
W73933 
NMJ01310 
AW294659 

AB037788 
AW024973 
AL035301 
AI038052 
AW974666 
BE465093 
AA226879 



Hs.153088 
Hs.10487 
Hs.95667 
Hs.173135 
Hs. 173081 
Hs.283738 
Hs.13313 
Hs.34054 

Hs.224961 

Hs.283675 

Hs.97375 

Hs.19162 

Hs.293024 

Hs.106101 



IMAGE:663856 3' similar to contains Alu repetitive element;, mRNA sequence. 



gb:nz79b10.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone 3' similar to gb:M34539 FK506- 

TATA box binding protein (TBP)-associated factor, RNA polymerase I, A, 48kD 
Homo sapiens, clone IMAGE:3954132, mRNA, partial cds 
F-box protein 30 

dual-specificity tyrosine-(Y)-phosphorylation regulated kinase 2 

KIAA0530 protein 

casein kinase 1, alpha 1 

cAMP responsive element binding protein-like 2 

Homo sapiens cDNA: FLJ22488 fis, clone HRC10948, highly similar to HSU79298 Human clone 

cleavage and polyadenylation specific factor 2, 100kD subunit 
NPD009 protein 

Rsapiens gene from PAC 106H8 

ESTs, Weakly similar to I54374 gene NF2 protein [H.sapiens] 

ESTs 

hypothetical protein FLJ22557 

gb:zr19c09.s1 Stratagene NT2 neuronal precursor 937230 Homo sapiens cDNA clone 



113300 RC T67448 T67448 Hs.13101 

117147 RC H97225 s AW901347 Hs.38592 

121349 RC AA405205 AA405205 Hs.97960 

100294 D49396 AA331881 Hs.75454 

133999 M28213 AA535244 Hs.78305 

133259 AA278548 BE379646 Hs.6904 

129423 AA371418 AA204686 Hs.234149 

131098 RC_AA459668 U66669 Hs.236642 

135272 AA399391 AI828337 Hs.97591 

129155 AA046865 AI952677 Hs.108972 



ESTs 

hypothetical protein FLJ23342 

ESTs, Weakly similar to T51 146 ring-box protein 1 [H.sapiens] 
peroxiredoxin 3 

RAB2, member RAS oncogene family 

Homo sapiens mRNA full length insert cDNA clone EUROIMAGE 2004403 
hypothetical protein FLJ20647 
3-hydroxyisobutyryl-Coenzyme A hydrolase 
ESTs 

Homo sapiens mRNA; cDNA DKFZp434P228 (from clone DKFZp434P228) 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65. 



70 



75 



311291 AA056319 AA782601 

120750 RC_AA310499 AI191410 

101002 J04058 AV655843 

133012 AA099241 AA847843 

103879 AA228148_s BE543269 

131281 RCJWI43212 AA251716 

115109 RCJ\A256383 AJ249977 

118502 RC_N67317 AL157488 

134100 L07540 AA460085 

131869 AA484944 AW968547 

115396 RC_AA282985 AA810854 

103860 AA203742 AW976877 

135089 N75611_s AI918035 

129938 U79300 AW003668 

107508 W90095 N74925 

103685 AA005190 AA158008 

125170 AA203147 AL020996 

129179 RC.AA504125 s AW969025 

116262 AA477046 AI936442 

123009 RC AA479949 AA535244 

131004 D29833 029833 

103317 X83441 X83441 

132814 RC_C15251_f D60730 

103992 U77718 BE018142 

109258 X59710 AL044818 

110754 RCJJ20814 AW302200 

132727 AA136382_S N27495 

100341 D63506 AF032922 

134664 AA256106 AA256106 

103826 AA165564 AW162998 

111678 RC R20628 R38487 

101341 L76159 NM.004477 

115455 RC_AA285068 AA876002 

111192 RCJVA477748 AW021968 

129385 RC AA235604 AA172106 

125050 RCJ79951 AW970209 

122105 RC.AA432278 AW241685 

121324 RC.AA404229 AA404229 

120938 RC.AA386260 AA386260 

115001 RC.AA251376 AA251376 
sequence. 

124799 RC.R45088 R45088 
sequence. 

122724 RC AA457395 AA457395 

117791 RC_N48325 N48325 

121895 RC_AA427396 AA427396 



Hs.319817 ESTs 

Hs.96693 ESTs, Moderately similar to 2109260A B cell growth factor [H.sapiens] 

Hs.1 69919 electron-transfer-flavoprotein, alpha polypeptide (glutaric aciduria II) 

Hs.62711 Homo sapiens, clone IMAGE:3351295, mRNA 

Hs.50252 mitochondrial ribosomal protein L32 

Hs.25227 ESTs 

Hs.88049 protein kinase, AMP-activated, gamma 3 non-catalytic subunit 

Hs.50150 Homo sapiens mRNA; cDNA DKFZp564B182 (from clone DKFZp564B182) 

Hs.171075 replication factor C (activator 1) 5 (36.5kD) 

Hs.33540 ESTs, Weakly similar to dJ309K20.4 [H.sapiens] 

Hs.89081 ESTs 

Hs.38057 ESTs 

Hs.301 198 roundabout (axon guidance receptor, Drosophila) homolog 1 

Hs.135587 Human done 23629 mRNA sequence 

Hs.38761 Homo sapiens cDNA: FLJ21564 fis, clone COL06452 

Hs.292444 ESTs 

Hs.8518 selenoprotein N 

Hs.1 09 154 ESTs 

Hs.59838 hypothetical protein FLJ1O808 

Hs.78305 RAB2, member RAS oncogene family 

Hs.2207 salivary proline-rich protein 

Hs.166091 ligase IV, DNA, ATP-dependent 

Hs.57471 ESTs 

Hs.300954 Huntingtin interacting protein K 

Hs.84928 nuclear transcription factor Y, beta 

Hs.6336 KIAA0672 gene product 

Hs.5565 hypothetical protein FLJ22626 

Hs.881 3 syntaxin binding protein 3 

Hs.87507 ESTs 

Hs.24684 KIAA1376 protein 

Hs.169927 ESTs 

Hs.203772 FSHD region gene 1 

Hs.1 20551 tolWike receptor 10 

Hs.1 09438 Homo sapiens clone 24775 mRNA sequence 

Hs.1 10950 Rag C protein 

Hs.111805 ESTs 

Hs.98699 ESTs 

Hs.97842 EST 

Hs.104632 EST 

gb:zs10a06.s1 NCI_CGAP_GCB1 Homo sapiens cDNA clone IMAGE:684754 3\ mRNA 
gb:yg38g04.s1 Soares infant brain 1NIB Homo sapiens cDNA clone IMAGE:34896 3\ mRNA 

Hs.99457 ESTs 

Hs.93956 EST 

gb:zw33a02.s1 Soares ovary tumor NbHOT Homo sapiens cDNA done IMAGE:771050 3' 



similar to contains Alu repetitive element;contains MER12.t2 MER12 repetitive element ;, mRNA sequence 



108244 RC.AA062839 
3', mRNA sequence. 
117852 RC_N49408 
109298 RC_AA205432 
122432 RC.AA447400 
124627 RC.N74625 



AA062839 gb:zm05c09.s1 Stratagene corneal stroma (937222) Homo sapiens cDNA clone IMAGE:513232 

AW8777B7 Hs.136102 KIAA0853 protein 

R77854 . Hs.250693 Krueppekelated zinc finger protein 

AA447400 Hs.187684 ESTs, Weakly similar to B34087 hypothetical protein [H.sapiens] 
N74625 gb:za55c03.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA done IMAGE:296452 3* 



similar to gb:M14338 VITAMIN K-DEPENDENT PROTEIN S PRECURSOR (HUMAN);contains OFR.b3 OFR repetitive element ;, mRNA sequence. 



115141 RC.AA258071 

128636 U49065 

115373 RC.AA282197 

114651 RC_AA101400 

132796 RC.AA180487 

103749 RCJJ35583 

107328 T83444 

115349 RC_AA281563 

111490 RC.R06862 



AA465131 

U49065 

AA664862 

AA101400 

NM_006283 

AL1 35301 

AW959891 

AF121176 

R06862 



Hs.64001 Homo sapiens clone 2521 8 mRNA sequence 

Hs.102865 interleukin 1 receptor-like 2 

Hs.181022 CGi-07 protein 

Hs.189960 ESTs 

Hs.1 731 59 transforming, acidic coiled-coil containing protein 1 

Hs.8768 hypothetical protein FLJ10849 

Hs.76591 KIAA0887 protein 

Hs.1 2797 DEAD/H (Asp-Glu-Ala-Asp/His) box polypeptide 16 

gb:yf11e09.s1 Soares fetal liver spleen 1NFLS Homo sapiens cDNA clone IMAGE:1 26568 3' 



similar to contains L1 repetitive element ;, mRNA sequence. 



103763 AA085354 AA085291 
contains Alu repetitive element;, mRNA sequence 

118791 RC.N75520 N75520 Hs.261003 

116644 RC.F03032 F03032 Hs.290278 

116823 RC_H56485 AW204742 Hs.143542 
[H.sapiens] 

108940 RC.AA148603 AA148603 
IMAGE:567198 3\ mRNA sequence. 

112218 RC.R50057 R50057 Hs.272251 

116557 RC.D20572 i D20572 Hs.90171 

133649 U25849 U25849 Hs.75393 

131745 RC_C20746 AI828559 Hs.31447 



gb:zn01g06.s1 Stratagene colon HT29 (937221) Homo sapiens cDNA clone 3' similar to 

ESTs, Moderately similar to B34087 hypothetical protein [H.sapiens] 

ESTs, Weakly similar to B34087 hypothetical protein [H.sapiens] 

ESTs, Highly similar to CSA_HUMAN COCKAYNE SYNDROME WD-REPEAT PROTEIN CSA 

gb:zo09e04.s1 Stratagene neuroepithelium NT2RAMI 937234 Homo sapiens cDNA clone 

Homo sapiens mRNA; cDNA DKFZp586M1418 (from clone DKFZp586M1418) 
EST 

acid phosphatase 1 , soluble 

ESTs, Moderately similar to A460 10 X-linked retinopathy protein [H.sapiens] 
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116801 RC_H43879 H43B79 
sequence. 

115006 RCLAA251548 AA251548 

123424 RC AA598500 H29882 

120831 RC_AA347919 AA347919 

103691 AA018298 AA018298 

121555 RC_AA412491 AF025771 

111193 RC_N67946 N67946 

132061 RCJ\A058946 AB020700 

134575 RC_AA194568_i AA194568 

115050 RC.AA252794 AA252794 

420208 U31799 BE276055 

133735 AC002045_xpt1 R66740 



NM_003478 

R40697 

R73588 

AF255910 

AW137912 



128546 Z21305 
111946 RC R40697 
124879 RC R73588 
115683 AA410345 
103692 AA018418 
(CACNAIF)gene, complete cds; HSP27 

103767 AA089688 BE244667 

125266 W90022 W90022 
PRECURSOR [H.sapiens] 

135235 AA435512 AW298244 

134497 RCJ\A404494 BE25B532 

426754 RC.AA278529J NMJ)14264 

412177 RC_AA342828_s Z23091 

132000 RC.AA044644 AW247017 

124738 WLAA044644 T07568 

324000 RC_AA196729_i AA604749 

106896 RC_AA196729J AW073202 

132000 RC AA025858 AW247017 
129577 RC_AA025858 
107091 RC_AA233519 
130296 RC.N52271 
102855 RC.N68399 
113689 RC^AA098874 

100939 RC_AA279667_s L04288 

130430 RC H22556 W27893 

106734 RC_N45979_s BE296690 
intersectin 2 long isoform (ITSN2) mRNA 

135148 RC_AA431288_s AA306478 

134221 RC_AA609862 BE280456 

105376 RC N35583 AW994032 

124541 U77718 AF 112222 

134546 AA203147 AL020996 

134000 RC W93092 AW1 75787 

125656 RC.W93092 AW516428 

100939 RC_N58561_s L04288 

125656 RCW93092 AW516428 

101779 RC_W69385_s BE543412 

332489 RC.R22947 R23053 

133000 ROI38959 f AL042444 

125905 RC N38959J AI678638 

129000 RC_H73050_s AA744902 

100920 RC_H73050_s X54534 



N75346 

AI949109 

D31139 

NMJ03528 

AB037850 



Hs.87886 
Hs.162614 
Hs.96889 
Hs.103332 
Hs.50123 
Hs.1 17569 
Hs.3830 
Hs.85938 
Hs.88009 
Hs.95972 
Hs.1 10613 
Hs.101299 
Hs.76666 
Hs.101533 
Hs.54650 
Hs.227583 
pseudogene, 
Hs.296155 
Hs.186809 

Hs.293507 

Hs.251871 

Hs.172052 

Hs.73734 

Hs.36978 

Hs.137158 

Hs.190213 

Hs.334825 

Hs.36978 

Hs.82906 

Hs.246885 

Hs.154103 

Hs.2178 

Hs.16621 

Hs.297939 

Hs.150580 

Hs.288173 

Hs.95327 

Hs.80248 

Hs.8768 

Hs.44499 

Hs.8518 

Hs.334841 

Hs.78687 

Hs.297939 

Hs.78687 

Hs.250505 

NA 

Hs.62402 
Hs.6456 
Hs.107767 
Hs.278994 



gb:yo69h09.s1 Soares breast 3NbHBst Homo sapiens cDNA clone IMAGE: 183233 3', mRNA 

EST 
ESTs 
EST 
ESTs 

zinc finger protein 189 
ESTs 

KIAA0893 protein 
EST 
ESTs 

silver (mouse homolog) like 
KIAA0220 protein 
cullin 5 

C9orf10 protein 
ESTs 

junctional adhesion molecule 2 

Homo sapiens chromosome X map Xp11.23 Uype calcium channel a!pha-1 subunit 
complete sequence; and JM1 protein, JM2 protein, and Hb2E genes, complete cds 
CGM00 protein 

ESTs, Highly similar to LCT2_HUMAN LEUKOCYTE CELL-DERIVED CHEMOTAXIN 2 
ESTs 

CTP synthase 
serine/threonine kinase 18 
glycoprotein V (platelet) 
melanoma antigen, family A, 3 
ESTs 
ESTs 

Homo sapiens cDNA FU14752 fis, clone NT2RP3003071 
melanoma antigen, family A, 3 
CDC20 (cell division cycle 20, S. cerevisiae, homolog) 
hypothetical protein FU20783 
LIM protein (similar to rat protein kinase C-binding enigma) 
H2B histone family, member Q 
DKFZP434I116 protein 
cathepsin B 

putative translation initiation factor 

Homo sapiens cDNA: FLJ21747 fis, clone COLF5160, highly similar to AF182198 Homo sapiens 

CD3D antigen, delta polypeptide (TiT3 complex) 
RNA-binding protein gene with multiple splicing 
hypothetical protein FLJ 10849 
pinin, desmosome associated protein 
selenoprotein N 
selenium binding protein 1 

neutral sphingomyelinase (N-SMase) activation associated factor 
cathepsin B 

neutral sphingomyelinase (N-SMase) activation associated factor 
retinoic acid receptor, alpha 
Hu01 Chip Redos 

p21/Cdc42/Rad -activated kinase 1 (yeast Ste20-related) 
chaperonin containing TCP1, subunit 2 (beta) 
hypothetical protein PR01489 
Rhesus blood group, CcEe antigens 
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TABLE 1A 

Table 1A shows the accession numbers for those pkeys lacking unigenelD's for Tables 1. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gerie cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 



108469 
124106 
108501 
108562 
125008 
125020 
125066 
116661 
125104 
124575 
125263 
116845 
118417 



116761 1 

125446 J 

13684.-12 

36375.1 

1802095J 

116017.1 

1814993.1 

1532859.1 

413347.1 

1666649.1 

1547.2 

393481J 

37186.1 



118584 
103743 
103744 
103746 
103761 
103763 
120209 
120284 
112540 
111904 
121059 
121094 
114106 
130091 
122264 
108280 
129961 
130529 
108309 
107832 
123731 
116571 
132225 
125017 
125063 
125064 
100964 
125118 
102269 
125150 
116801 

118111 
118129 
118329 
118475 
111490 
111514 
104534 
120340 



532052J 
112194.1 
114161.1 
113452.1 
11 4208 J 
48290.6 
1531817.1 
158963.1 
1605263.1 
1719336.1 
273450.1 
275729.1 
1182096.1 
2396L-3 
296527.1 
110682.1 
1706092.1 
158447.1 
11 1495 J 

genbank^AA021473 
genbankJ\A609839 
genbankJD45652 
genbank.AAl 28980 
genbank_T68875 
genbank_T85352 



AA079487 AA128547 M128291 AA079587 AA079600 

H12245 AA094769 R14576 

AA083256 

AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA07931 1 AA079323 AA085274 

T91251 T64891 T85665 

T69981 T69924 AA078476 

T86284 T81933 

R61504F04247 

T95590 AA703278 H62764 

N68168 N69188 N90450 

AA098878 W88942 

AA649530 AA659316 H64973 

AF080229 AF080231 AF080230 AF080232 AF080233 AF080234 BE550633 AI636743 AW614951 BE467547 AI6B0833 
AI633818 N29986 U87592 U87593 U87590 U87591 S46404 U87587 AA463992 AW206802 AI970376 AI58371 8 AI672574 
N25695 AW665466 AI818326 AA1 26128 AI480345 AW013827 AA248638 AI214968 AA204735 AA207155 AA206262 AA204833 
AW003247 AW496808 AI080480 AI631703 AI651023 A1867418 AW818140 AA502500 AI206199 AI671282 AI352545 BE501030 
AI652535 BE465762 AA206331 AW451866 AA471088 AA206342 AA204834 AA206100 AW021661 AA332922 N66048 
AA703396 H92278 AW139734 H92683 U87589 U87595 H69001 U87594 BE466420 AI624817 BE46661 1 AI206344 AA574397 
AA348354 AI493192 

AW136928 AI685655 BE218584 BE465078 N68963 AA975338 BE147199N76377 
AA075998 AA075999 AA070986 AA070896 AA129207 AA078942 AA070783 AA078941 
AA079267 AA076003 
AA075000AA081876 

AA765163 AW298222 AA126126 AA085138 AA076068 
AA085291 AA085354 
F02951 Z40892F04711 
AA179655 AA1B2626 AA1B2603 
R69751 R70467 H69771 H80879 H80878 
Z41572R39330 
AA393283 AA398628 
AA402505 AA393900 
AW602528 BE073859 Z38412 
W88999 

AA436837AA442594 
AA065069 M085108 
R23053 R79884 R76271 
AA178953AA192740 

AA069818AA069971 M069923 AA069908 
AA021473 
M609839 
D45652 
AA1 28980 
T68875 
T85352 
T85373 



genbank.T85373 
entrez_J00212J00212 
149288J R10606 T97620 AA576309 
entrezJJ30245U30245 

NOT_FOUND.entrezJV38240 W38240 

genbank_H43879 H43879 

genbank_N55493 N55493 

genbank_N57493 N57493 

genbank_N63520 N63520 

genbank_N66845 N66845 

genbank_R06852 R06862 

genbank_R07998 R07998 
R22303_at R22303 

genbankJ\A206828 AA206828 
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120376 genbank_AA227469 
104787 genbanLAA027317 
120409 genbankJ\A235050 
120745 genbank. J AA302809 
120809 genbank_AA346495 
120839 genbanK_AA348913 
113702 genbankJ97307 
115001 genbankJ\A251376 
122562 genbankJ\A452156 
122635 genbank_AA454085 
108244 genbank_AA062839 
108277 genbankJ\A064859 
122723 genbank_AA457380 
124028 genbank_F04112 
108403 genbank_AA075374 
122860 genbank_AA464414 
108427 genbank_AA076382 
108439 genbank AA078986 
131353 231290J 
AA969360 

108533 genbank_AA084415 
117031 genbanleH88353 
124254 genbanLH69899 
101447 entrez_M21305 
101458 entrezJ/122092 
124577 genbank_N68300 

108940 genbank_AA148603 

108941 genbank_AA148650 
124627 genbankJ474625 
124720 144582J 

124793 genbank_R44519 
124799 genbanLR45088 
117683 genbank_N40180 
117732 genbank_N46452 
124991 genbankJ50116 
119023 genbanK_N98488 
119239 95573J 



AA227469 

AA027317 

AA235050 

AA302809 

AA346495 

AA348913 

T97307 

AA251376 

AA452156 

AA454085 

AA062839 

AA064859 

AA457380 

F04112 

AA075374 

AA464414 

AA076382 

AA078986 

AW411259 H23555 AW015049 AI684275 AW015886 AW068953 AW014085 AI027260 R52686 AA918278 AI129462 

N34869 AI948416 AA534205 AA702483 AA705292 

AA084415 

H88353 

H69899 

M21305 
M22092 



N68300 
AA148603 
AA148650 
N74625 

R05283R11056 

R44519 

R45088 

N40180 

N46452 

T50116 

N98488 

T11483T11472 



119558 NOT_FOUND_entrez_W38194 

119654 genbank_W57759 W57759 

105246 genbank_AA226879 AA226879 

121350 genbank J\A405237 AA405237 

121558 genbank_AA412497 AA412497 

105985 genbank_AA406610 AA406610 

100071 entrez__A28102A28102 

114648 genbankJ\A101056 AA101056 

121895 genbank_AA427396 AA427396 

100327 enlre2_D55640D55640 

123315 714071J AA496369 AA496646 



W38194 
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Unique Eos probeset identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 



TABLE 2: 

Pkey: 

Accession: 
ExAccn: 
UnigenelD: 
Unigene Title: 



Pkey Accession ExAccn UnigenelD 



100420 


100420 


D86983 


■ ft A A A AAA 

Hs. 118893 


100484 


100484 


till ^\ A m* ft A A. 

NM_005402 


1 t A A A — T W-—T 

Hs.288757 


A A A A A A 

100991 


100991 


1 A A A A A 

J03836 


i i #i a. a. A r* 

Hs.82085 


101168 


101168 


ft 1 ■ M A A A A A A 

NMJ305308 


i i A J A r*AA 

Hs.211569 


101261 


101261 


D30857 


1 1 A AA ^ A 

Hs.82353 


101447 


101447 


M21305 




101543 


101543 


M31166 


Hs.2050 


101560 


101560 


AW958272 


Hs.347326 


101714 


101714 


M68874 


Hs.211587 


101838 


101838 


BE243845 


Hs.75511 


102012 


102012 


BE259035 


Hs.1 18400 


102164 


102164 


NM000107 


Hs.77602 


102283 


102283 


AW161552 


Hs.83381 


102564 


102564 


U59423 


Hs.79067 


102759 


102759 


NM_005100 


Hs.788 


102804 


102804 


NM_002318 


Hs.83354 


102898 


102898 


N1VL002205 


Hs.149609 


103036 


103036 


M13509 


Hs.83169 


103095 


103095 


NMJ505424 


Hs.78824 


103166 


103166 


AA1 59248 


Hs. 180909 


103280 


103280 
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myotubularin related protein 1 
hypothetical protein FLJ10849 
hypothetical protein FLJ20039 
diptheria toxin resistance protein requi 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
KIAA1033 protein 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 
hypothetical protein FU11323 
histone deacetylase 3 
ESTs 

sec13-!ike protein 

gb:zv15b10.s1 Scares JlhHMPu_S1 Homo sapi 

Homo sapiens cDNA FLJ20738 fis, clone HE 

DKFZP434I1 16 protein 

similar to S. cerevisiae SSM4 

Homo sapiens cDNA FLJ10643 fis, clone NT 

ESTs, Weakly similar to ALU INHUMAN ALU S 

hypothetical protein FLJ10210 

KIAA0766 gene product 

Rho guanine exchange factor (GEF) 15 

hypothetical protein MGC5306 

cofactor required for Sp1 transcriptiona 

neuropilin 1 

Homo sapiens cDNA FLJ10071 fis, clone HE 
ESTs, Moderately similar to ALU7JHUMAN A 
ribosomal protein L4 

gb:RC2-CT0321-131199-011-c01 CT0321 Homo 

pannexin 1 

KIAA0648 protein 

ESTs 

ESTs 

chromosome 1 1 open reading frame 5 
ESTs, Weakly similar to ALU8JHUMAN ALU S 
ESTs 

RAD51 (S. cerevisiae) homolog (E coli Re 
peptidylprolyl isomerase (cyclophilin)-l 
programmed cell death 6-interacting prot 
LUC7 (S. cerevisiae)-like 
hypothetical protein FLJ1 1011 
scavenger receptor with C-type lectin 
UDP-N-acetyl-alpha-D^alactosamine:po!yp 
nucleophosmin/nucleoplasmln 3 
K1AA0887 protein 
prepronociceptin 

Human clone 23629 mRNA sequence 
polymerase (RNA) III (DNA directed) (62k 
hypothetical protein DKFZp434E2220 
DKFZP564A2416 protein 
NS1 -associated protein 1 
hypothetical protein PR01855 
H4 histone family, member G 
YY1 transcription factor 
hypothetical protein FLJ13693 
ESTs 
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107832 


107832 


AA021473 




124337 


124337 


N23541 


Hs.281561 


129577 


129577 


N75346 


Hs.306121 


132000 


132000 


AW247017 


Hs.36978 


107935 


107935 


AA029428 


Hs.61555 


131461 


131461 


AA992841 


Hs.27263 


108029 


108029 


AA040740 


Hs.62007 


108084 


108084 


AA058944 


Hs.116602 


108168 


108168 


AI453137 


Hs.63176 


108189 108189 


AW376061 


Hs.63335 


108203 

1 WWfcW w 


108203 


AW847814 


Hs.289005 


108217 


108217 


AA058686 


Hs.62588 


108277 


108277 


AA064859 




108309 


108309" 


AA069818 




108340 


108340 


AA069820 


Hs.180909 


108427 


108427 


AA076382 




108439 


108439 


AA078986 




108469 


108469 

1 WVi Ww 


AA079487 




108501 


108501 


AA083256 




108562 


108562 

I www** 


AA100796 




130890 


130890 

I w w ww V 


AI907537 


Hs.76698 


130385 


130385 


AW067800 


Hs.155223 


108807 


108807 


AI652236 


Hs.49376 


108833 


108833 

J ww www 


AF1 88527 


Hs.61661 


108846 


108846 


AL117452 


Hs.44155 


131474 


131474 


L46353 

www 


Hs.2726 


108941 


108941 


AA148650 




108996 


108996 


AW995610 

J » 1 » w w w w ■ w 


Hs.332436 


131183 


131183 


AI611807 


Hs.285107 


109022 


109022 


AA157291 


Hs.21479 


109068 


109058 


AA1 64293 


Hs.72545 


129021 


129021 


AL044675 


Hs.173081 


109146 


109146 


AA176589 


Hs.142078 


131080 


131080 


NMJJ01955 


Hs.2271 


109222 


109222 


AA192833 


Hs.333512 


109481 


109481 


AA878923 


Hs.289069 


109516 


109516 


AI471639 


Hs.71913 


109556 


109556 


AI925294 


Hs.87385 


109578 


109578 


F02208 


Hs.27214 


109625 


109625 


H29490 


Hs.22697 


109648 


109648 


H17800 


Hs.7154 


109599 


109699 


H18013 


Hs.167483 


109933 


109933 


R52417 


Hs.20945 


110039 


110039 


H11938 


Hs.21907 



gb:ze66c11.s1 Soares retina N2b4HR Homo 
Homo sapiens cDNA: FU23582 fis, clone L 
CDC20 (cell division cycle 20, S. cerevi 
melanoma antigen, family A, 3 
ESTs 

K1AA1458 protein 
ESTs 

Homo sapiens, done !MAGE:41 54008, mRNA, 
ESTs 

ESTs, Moderately similar to A46010 X-lin 
Homo sapiens cDNA: FU21532 fis, clone C 
ESTs 

gb:zm50fl)3.s1 Stratagene fibroblast (937 
gb:zm67e03.r1 Stratagene neuroepithelium 
peroxiredoxin 1 

gb:zm91g08.s1 Stratagene ovarian cancer 
gb:zm92h01.s1 Stratagene ovarian cancer 
gb:zm97f08.s1 Stratagene colon HT29 (937 
gb:zn08g12.s1 Stratagene hNT neuron (937 
gbzm26cG6.s1 Stratagene pancreas (93720 
stress-associated endoplasmic reticulum 
stanniocalcin 2 
hypothetical protein FLJ20644 
ESTs, Weakly similar to AF174605 1 F-box 
DKFZP586G1517 protein 
high-mobiiity group (nonhistone chromoso 
gb:zo09e06.s1 Stratagene neuroepithelium 
EST 

hypothetical protein FLJ13397 

ubinuctein 1 

ESTs 

KIAA0530 protein 
EST 

endothelin 1 

similar to rat myomegalin 

hypothetical protein FLJ21016 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 24993 mRNA sequence 
histone acetyltransferase 
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TABLE 2A 

Table 2A shows the accession numbers for those pkeys lacking unigenelD's for Table 2. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwlst, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey; Unique Eos probeset Identifier number 

CAT number. Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



117156 145392.1 



125565 
132983 



133681 13893.1 



108469 116761.1 AA079487 AA128547 AA128291 AA079587 AA079600 
108501 13684.-12 AA083256 

108562 36375 1 AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA126309 AA079311 AA079323 AA085274 

101300 4669.7 BE53551 1 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 

AW408435 AA121738 A1568978 H13317 R20373 AW948724 AW948744 AA335023 AA436722 AA448690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 AA035104 R76028 AA1 26924 AA741086 AW022056 
AW1 18940 AA121666 AI832409 AA683475 A1140901 A1623576 AW519064 AW474125 AI953923 AI735349 AW150109 
AI436154 AW118130 AW270782 AI804073 N27434 AA876543 AA937815 AI0511 66 AA505378 AI041975 AI335355 
A1089540 AA662243 AI127912 AI925604 AI250880 AI366874 AI564386 AI815196 AI683526 AI435885 AI160934 H79030 
AI801493 AA446691 AI673767 A1076042 AI804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AA907805 AA911493 AI589411 AI371358 AW576236 AI078866 AW516168 AA346372 A1560185 AA471009 R75857 
AA296025 AA523155 AA853168 AI696593 AI658482 AI566601 AW072797 AA128047 AA035502 AW243274 AA992517 
R43760 

W73853 AA9281 1 2 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AI079403 A1370541 AI697341 
H97538 AW188021 AI927669 W72716 A1051402AI188071 AI335900 N21488 AW770478 W92522 AJ691028A1913512 
AI144448 W73819 AA604358 N28900 W95221 AI868132 H98465 AA148793 
1704098.1 R20840 R20839 

1 1922 1 M30269 NM.002508 X82245 AI078760 AW957003 D78945 M27445 AA650439 AL048816 AV660256 AV660347 

M333052 BE295257 T60999 AA383049 AW369677 Z26985 AW175704 AA343326 AW747957 AI818389 W17308 
W17302 H15591 AA371284 AA370412 W94966 BE384365 T28498 R80714 R16959 H21723 AW835154 D56097 D56381 
W21232 AA1 90565 AW379755 AW067895 

AI352558 Z82248 X78138 NM.003405 AU077248 AA223125 S80794 D78577 A1124697 AW403970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 AA305009 H69431 H69456 AL120082 
H1 1706 AA303717 AA361357 H22042 H78020 AW999584 AA134368 AA32291 1 AA322961 H60980 N85248 N31547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 M487004 AW383504 A1342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 AI922133 AI697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AI829449 AA583503 A1635674 AW131665 AI473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 AI608761 AI082748 AI911682 AI248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AI470204 AH88296 AI421367 AI125315 AI087141 AA629032 AA740589 AI554181 
AA150830 AI248541 AI077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
A1205606 M720684 A1872093 AW150042 AL120538 M219627 AA988608 C21397 A1359337 H25337 A1089749 
AA605146 AI359620 AA150478 AI359738 AW383642 AW995424 A1766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279762 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 A1086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 M155615 AW383529 
AA491520 AW028427 AA171496 AI469689 AW664539 AI81 1 102 AI81 1 1 16 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 A1917740 AI655561 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271187 H11617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 A1272249 AA700018 AI579932 AI085941 AW152629 
121335 279548 1 AA404418 A1217248 

130018 18986 J AA353093 AW957317 AW872496 AI560785 AI289110AW13551 2X97261 T68873 

121822 244391 1 AI743860 N49543 AW027759 BE349467 AJ656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 

AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI61 161 1 A1611407 AI61 1452 W56437 AI284566 
AI583349 AW183058 AI308085 AI074952 AA437315 M628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261373 AI432414 AI984994 AI539335 AA401550 AA35B757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 AI832944 AI038530 AA425107 AI014986 AI148349 AW237721 AW779756 
AW137877 AI125293 AA400404 R28554 

108309 1 1 1495.1 AA069818 AA069971 AA069923 AA069908 

107832 genbank AA021473 AA021473 

123523 genbank_AA608588 AA608588 

123964 genbank.C13961 C13961 

118475 genbank N66845 N66845 

104787 genbanklAA027317 AA027317 

106596 304084 1 A1583948 AA578212 AW303715 AA653450 AA456981 AI400385 W88533 AI224133 AW272145 AA088686 R94698 

113947 genbank_W84768W84768 

108277 genbank_AA064859 AA064859 
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108427 genbankJ\A076382 AA076382 

108439 genbanK_AA078986 AA078986 

131353 231290 1 AW41 1259 H23555 AW015049 AI684275 AW015886 AW068953 AW014085 A1027260 R52686 AA918278 AH29462 

AA969360 N34869 AI948416 AA534205 AA702483 AA705292 

101447 entrez_M21305 M21305 

108931 genbanK,AA147186 AA147186 

108941 genbank_AA148650 AA148650 

103138 entrez_X65965 X65965 

119174 genbankj*71234 R71234 

119416 genbank_J97186 T97186 

105985 genbank^AA406610 AA406610 

100327 entrez_D55640 D55640 
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TABLE 3: 



Pkey: 
Accession: 
ExAccn: 
UntgeneiD: 
Unigene Title: 



Unique Eos probeset identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTitle 



100405 

100420 

100481 

100484 

100718 

100991 

101097 

101168 

101194 

101261 

101345 

101447 

101485 

101543 

101550 

101560 

101674 

101714 

101741 

101838 

101857 

102012 

102024 

102164 

102241 

102283 

102303 

102564 

102663 

102759 

102778 

102804 

102887 

102898 

102915 

103036 

103037 

103095 

103158 

103166 

103185 

103280 

103554 

103850 

104465 

104592 

104764 

104786 

104850 

104865 

104894 

104952 

104974 

105178 

105263 

105330 

105376 

105729 

105826 

105977 

10S008 

106031 

106124 



D86425 
D86983 

HG1098-HT1098 

HG1103-HT1103 

HG3342-HT3519 

J03764 

L06797 

L15388 

L20971 

L35545 

L76380 

M21305 

M24736 

M31166 

M31551 

M32334 

M61916 

M68874 

M74719 

M92934 

M94856 

U03057 

U03877 

U18300 

U27109 

U31384 

U33053 

U59423 

U70322 

U81607 

U83463 

U89942 

X04729 

X06256 

X07820 

X54925 

X54936 

X60957 

X67235 

X67951 

X69910 

X79981 

Z18951 

AA187101 

N24990 

R81003 

AA025351 

AA027168 

AA040465 

AA045136 

AA054087 

AA071089 

AA085918 

AA187490 

AA227926 

AA234743 

AA236559 

AA292694 

AA398243 

AA406363 

AA4 11465 

AA412284 

AA423987 



AW291587 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
NMJ)05402Hs.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NM_005308Hs.211569 
L20971 Hs.1 88 
D30857 Hs.82353 
NM_005795Hs.152175 
M21305 

AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
NM_002291Hs.82124 
M68874 Hs.211587 
NM 003199Hs.326198 
BE243845 Hs.75511 
BE550723 Hs.153179 
BE259035 Hs.1 18400 
AA301867 Hs.76224 
NM_000107Hs.77602 
NM_007351Hs.268107 
AW161552 Hs.83381 
U33053 Hs.2499 
U59423 Hs.79067 
NM_002270Hs.168075 
NM_005100Hs.788 
AF000652 Hs.8180 
NMJ)02318Hs.83354 
J03836 Hs.82085 
NM_002205Hs.149609 
X07820 Hs.2258 
M13509 Hs.83169 
BE018302 Hs.2894 
NMJ)05424Hs.78824 
BE242587 Hs.118651 
AA159248 Hs.180909 
NM_006825Hs.74368 
U84722 Hs.76206 
AI878826 Hs.74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AI039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.1 8858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
H93366 Hs.7567 



nidogen 2 

Melanoma associated gene 
cystatin D 

v-ral simian leukemia viral oncogene horn 
inhibitor of DNA binding 1, dominant neg 
serine (or cysteine) proteinase inhibito 
chemokine (C-X-C motif), receptor 4 (fus 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B, cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gb:Human alpha satellite and satellite 3 
selectin E (endothelial adhesion molecul 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase Inhibito 
intercellular adhesion molecule 2 
laminin, beta 1 

phospholipase A2, group IVA (cytosolic, 
transcription factor 4 
connective tissue growth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Drosophila)-like (sea urchin fas 
EGF-containing fibulin-like extracellula 
damage-specific DNA binding protein 2 (4 
multimerin 

guanine nucleotide binding protein 11 

protein kinase Olike 1 

MAD (mothers against decapentaplegic, Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (syntenin) 

lysyl oxidase-like 2 

serine (or cysteine) proteinase inhibito 

integrin, alpha 5 (fibronectin receptor, 

matrix metalloproteinase 10 (stromelysin 

matrix metalloproteinase 1 (interstitial 

placental growth factor, vascular endoth 

tyrosine kinase with immunoglobulin and 

hematopoietically expressed homeobox 

peroxiredoxin 1 

transmembrane protein (63kD), endoplasmi 
cadherin 5, type 2, VE-cadherin (vascula 
caveolin 1, caveolae protein, 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

KIAA0955 protein 

hypothetical protein DKFZp434G171 . 
B-cell CLUIymphoma 6, member B (zinc fi 
phospholipase A2, group lVC (cytosolic, 
desmopfakin {DPI, DPII) 
bromodomaln-containing 4 
AD036 protein 

solute carrier family 7, (cationic amino 
ESTs 

hypothetical protein FLJ10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3 ubiquitin ligase SMURF2 

hypothetical protein FLJ 1 1 1 1 0 

SRY (sex determining region Y)-box 18 

Homo sapiens cDNA: FLJ22296 fis, clone H 

Homo sapiens cDNA: FU21962 fis, clone H 



120 



WO 02/079492 



106155 
106302 
106423 
106793 
107174 
107216 
107295 
107385 
108756 
108846 
108888 
109001 
109166 
109456 
109768 
110107 
110906 
110984 
111006 
111018 
111133 
111760 
113073 
113195 
113923 
114521 
115061 
115096 
115145 
115819 
115947 
116314 
116339 
116430 
116589 
116733 
117023 
117186 
117563 
117997 
118475 
118581 
119073 
119155 
119174 
119221 
119416 
119866 
121335 
121381 
123160 
123473 
123523 
123533 
123964 
124006 
124315 
124659 
124669 
124847 
124875 
125091 
125103 
125355 
125565 
125590 
126511 
126563 
126649 
126872 
127402 
127651 
127759 
128062 
128992 
129046 



AA425309 
AA435896 
AA448238 
AA478778 
AA621714 
D51069 
T34527 
U97519 
AA127221 
AA1 32983 
AA1 35606 
AA156125 
AA179845 
AA232645 
F10399 
H16772 
N39584 
N52005 
N53375 
N54067 
N64436 
R26892 
T33637 
T57112 
W80763 
AA046808 
AA253217 
AA255991 
AA258138 
AA426573 
AA443793 
AA490588 
AA496257 
AA609717 
D59570 
F13787 
H88157 
H98988 
N34287 
N52090 
N66845 
N68905 
R32894 
R61715 
R71234 
R98105 
T97186 
W80814 
AA404418 
AA405747 
AA488687 
AA599143 
AA608588 
AA608751 
C13961 
D60302 
H94892 
N93521 
N95477 
R60044 
R70506 
T91518 
T95333 
R45630 
R20839 
. R23858 
AI024874 
W26247 
AA856990 
AA1 36653 
AA358869 
A1123976 
AI369384 
AA379500 
R49693 
AA195678 



AA425414 Hs.33287 

AA398859 Hs.18397 

AB020722 Hs.16714 

H94997 Hs.16450 

BE122762 Hs.25338 

D51069 Hs.211579 

M186629 Hs.80120 
NM_005397Hs.16426 

AA1 27221 Hs.1 17037 

AL1 17452 Hs.44155 

AA135606 Hs.189384 

AI056548 Hs.72116 

AA219691 Hs.73625 

AW956580 Hs.42699 

F06838 Hs.14763 

AW151660 Hs.31444 

AA035211 Hs.17404 

AW613287 Hs.80120 

BE387014 Hs.166146 

AI287912 Hs.3628 

AW580939 Hs.97199 

BE551929 Hs.268754 

N39342 Hs.103042 

H83265 Hs.8881 

AW953484 Hs.3849 

AW139036 Hs.108957 

AI751438 Hs.41271 

AI683069 Hs.175319 

AA740907 Hs.88297 

AA486620 Hs.41135 

R47479 Hs.94761 

AI799104 Hs.178705 

AK000290 Hs.44033 

AK001531 Hs.66048 

AI557212 Hs.17132 

AL157424 Hs.61289 

AW070211 Hs.102415 

H98988 Hs.42612 

AF055634 Hs.44553 

N52090 Hs.47420 
N66845 
N68905 
BE245360 
R61715 
R71234 
C14322 
T97186 

AA496205 Hs.193700 
AA404418 

AW088642 Hs.97984 

AA488687 Hs.284235 

AA599143 

AA608588 

AA608751 

C13961 

AI147155 Hs.270016 
NM 005402HS.288757 

AI680737 Hs.289068 

AI571594 Hs.102943 

W07701 Hs.304177 

AI887664 Ks.285814 
T91518 

AA570056 Hs.122730 

R60547 Hs.170098 
R20840 

R23858 Hs.143375 

T92143 Hs.57958 

AA516391 Hs.181368 

AA001860 Hs.279531 
AW450979 

AA358869 Hs.227949 

AA382523 Hs.105689 

AI369384 Hs.292441 

AA379621 Hs. 105547 

H04150 Hs.107708 

AB029290 Hs.108258 



Hs.279477 
Hs.310598 

Hs.250700 



nuclear factor l/B 

hypothetical protein FLJ23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma cell adhesion molecule 

UDP-N-acetyl-a!pha-D-galactosamine:polyp 

podocalyxin-fike 

ESTs 

DKFZP586G1517 protein 

gb:zl10a05.s1 Soares_pregnanLuterus_NbH 

hypothetical protein FU20992 similar to 

RAB6 interacting, kinesin-like (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyl-alpha-D-galactosamine:poryp 
Homer, neuronal immediate early gene, 3 
mitogen-activated protein kinase kinase 
complement component C1q receptor . 
Homo sapiens cDNA FU1 1949 fis, clone HE 
microtubule-associated protein 1B 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FLJ22041 similar to 
40S ribosomal protein S27 isoform 
Homo sapiens mRNA full length insert cDN 
ESTs 
ESTs 

endomucin-2 
KIAA1691 protein 

Homo sapiens cDNA FLJ11333 fis, clone PL 

dipeptidyl peptidase 8 

hypothetical protein FLJ10669 

ESTs, Moderately similar to I54374 gene 

synaptojanin 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs, Weakly similar to ALU1.HUMAN ALU S 
unc5 (C.elegans homolog) c 
EST 

gb:za46c11.s1 Soares fetal liver spleen 
gb:za69b09.s1 Soares_fetalJung_NbHL1 9W 
ESTs 

ESTs, Moderately similar to ALU1.HUMAN A 
gb:yi54c08.s1 Soares placenta Nb2HP Homo 
tryptese beta 1 

gb:ye50h09.s1 Soares fetal iiver spleen 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
gb:zw37e02.s1 Soares_total_fetus__Nb2HF8_ 
hypothetical protein FLJ22252 similar to 
ESTs, Weakly similar to 138022 hypotheti 
gb:ae52d04.s1 Stratagene lung carcinoma 
gb:ae54e06.s1 Stratagene lung carcinoma 
gb:ae56h07.s1 Stratagene lung carcinoma 
gb:C13981 Clontech human aorta po!yA+ mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FLJ11918 fis, clone HE 

hypothetical protein MGC12916 

Homo sapiens clone FLB8503 PR02286 mRNA, 

sprouty (Drosophila) homofog 4 

gb:ye20f05.s1 Stratagene lung (937210) H 

ESTs, Moderately similar to KIAA1215 pro 

KIAA0372 gene product 

gb:yg05c08.r1 Soares infant brain 1N1B H 

Homo sapiens, clone IMAGE;3840937, mRNA, 

EGF-TM7-latrophiiin-reIated protein 

U5 snRNP-specific protein (220 kD), orth 

ESTs 

gb:UI-H-BI3-a!a-a-12-0-Ul.s1 NCLCGAP.Su 

SEC13(S.cerevisiae)-like1 

MSTP031 protein 

ESTs 

neural proliferation, differentiation an 
ESTs 

actin binding protein; macrophin (microf 
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129188 

129314 

129371 

129468 

129765 

129805 

129884 

130495 

130639 

130657 

130828 

130972 

131080 

131137 

131182 

131486 

131573 

131647 

131756 

131859 

131881 

132050 

132083 

132164 

132358 

132413 

132456 

132490 

132676 

132687 

132718 

132736 

132760 

132933 

132968 

132994 

133061 

133147 

133161 

133200 

133260 

133363 

133491 

133517 

133550 

133607 

133614 

133627 

133691 

133696 

133913 

133975 

133985 

134039 

134088 

134161 

134299 

134416 

134453 

134656 

134989 

135051 

135073 

135349 

100114 

100130 

100143 

100168 

100208 

100224 

100405 

100420 

100455 

100529 

100618 

100619 



M30257 

AA028131 

M10321 

J03040 

M86933 

AA012933 

AA286710 

AA243278 

059711 

T94452 

AA053400 

AA370302 

J05008 

U85193 

AA256153 

X83107 

AA046593 

AA410480 

D45304 

M90657 

AA010163 

AA1 36353 

Y07867 

U84573 

X60486 

AA1 32969 

AA1 14250 

F13782 

AA283035 

AB002301 

AA056731 

U68019 

K99198 

AA598702 

N77151 

AA505133 

AB000584 

D12763 

AA253193 

AA432248 

AA083572 

AA479713 

L40395 

X52947 

W80846 

M34539 

D67029 

U09587 

M85289 

D10522 

W84712 

D29992 

L34657 

S78569 

043636 

U97188 

AA487558 

M28882 

X70683 

X14787 

AA236324 

C15324 

AA452000 

D83174 

D00596 

D11428 

D13640 

D14874 

D26129 

D28476 

D86425 

086983 

D87953 

HG1862-HT1897 
HG2614-HT2710 
HG2639-HT2735 



NM_001078Hs.109225 
BE622768 Hs.290356 
X06828 Hs.1 10802 
AW410538 Hs.111779 
M86933 Hs.1238 
AA012848 Hs.12570 
AF055581 Hs.13131 
AW250380 Hs.109059 
AI557212 Hs.17132 
AW337575 Hs.201591 
AW631469 Hs.203213 
D81866 Hs.21739 
NM 001955HS.2271 
W27392 Hs.33287 
AI824144 Hs.23912 
F06972 Hs.27372 
AA040311 Hs.28959 
AA359615 Hs.30089 
AA443966 Hs.31595 
AW960564 
AW361018 Hs.3383 
AI267615 Hs.38022 
BE386490 Hs.279663 
AI752235 Hs.41270 
NM_Q03542Hs.46423 
AW361383 Hs.260116 
AB011084 Hs.48924 
NJVL001290HS.4980 
N92589 Hs.261038 
AB002301 Hs.54985 
NMJ)04600Hs.554 
AW081883 Hs.211578 
AA125985 Hs.56145 
BE263252 Hs.6101 
AF234532 Hs.61638 
AA1 12748 Hs.279905 
AI186431 Hs.296638 
AA026533 Hs.66 
AW021103 Hs.6631 
AB037715 Hs.183639 
AA403045 Hs.6906 
A1866286 Hs.71962 
BE619053 Hs.1 70001 
NM_000165Hs.74471 
AI129903 Hs.74669 
BE273749 

NM_003003Hs.75232 
NM_002047Hs.75280 
M85289 Hs.211573 
AI878921 Hs.75607 
AU076964 Hs.7753 
C18356 Hs.295944 
L34657 Hs.78146 
NM_002290Hs.78672 
AI379954 Hs.79025 
AA634543 Hs.79440 
AW580939 Hs.97199 
X68264 Hs.211579 
A1272141 Hs.83484 
AI750878 Hs.87409 
AW968058 Hs.92381 
AI272141 Hs.83484 
W55956 Hs.94030 
AA1 14212 Hs.9930 
X02308 Hs.82962 
NM_000304Hs.103724 
AU076465 Hs.278441 
H73444 Hs.394 
NML002933HS.78224 
AL121516 Hs.138617 
AW291587 Hs.82733 
086983 Hs.1 18893 
AW888941 Hs.75789 
BE313693 Hs.334330 
AI752163 Hs.1 14599 
N24433 Hs.241567 



vascular cell adhesion molecule 1 

mesoderm development candidate 1 

von Willebrand factor 

secreted protein, acidic, cysteine-rich 

amelogenin (Y chromosome) 

tubulin-specificchaperone d 

lysosomal 

mitochondrial ribosomal protein L12 
ESTs, Moderately similar to (54374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586M518 (f 
endotheiin 1 
nuclear factor l/B 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamily member 1 
upstream regulatory element binding prot 
ESTs 
Pirin 

procollagen-lysine, 2-oxoglutarate 5-dio 
H4 histone family, member G 
metalloprotease 1 (pitrilysin family) 
KIAA0512 gene product; ALEX2 
UM domain binding 2 
ESTs, Weakly similar to 138022 hypotheti 
KIAA0303 protein 

Sjogren syndrome antigen A2 {60kD, ribon 
Homo sapiens cDNA: FLJ23037 (is, done L 
thymosin, beta, identified in neuroblast 
hypothetical protein MGC3178 
myosin X 

clone HQ0310PRO0310p1 
prostate differentiation factor 
interleukin 1 receptor-like 1 
hypothetical protein FLJ20373 
hypothetical protein FLJ10210 
Homo sapiens cDNA: FLJ23197 fis, clone R 
ESTs, Weakly similar to B36298 proline-r 
eukaryotic translation initiation factor 
gap junction protein, alpha 1, 43kD (con 
vesicle-associated membrane protein 5 (m 
FK506-binding protein 1A (12kD) 
SEC14(S. cerevisiae)-like 1 
glycyl-tRNA synthetase 
heparan sulfate proteoglycan 2 (perlecan 
myristoylated alanine-rich protein kinas 
calumenin 

tissue factor palhway inhibitor 2 
platelet/endothelial cell adhesion molec 
laminin, alpha 4 
KIAA0096 protein 
IGF-II mRNA-binding protein 3 
complement component C1q receptor 
melanoma cell adhesion molecule 
SRY (sex determining region Y)-box 4 
thrombospondin 1 

nudix (nucleoside diphosphate linked moi 

SRY (sex determining region Y)-box 4 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

serine (or cysteine) proteinase inhibito 

thymidylate synthetase 

peripheral myelin protein 22 

KIAA0015 gene product 

adrenomedullin 

ribonuciease, RNase A family, 1 (pancrea 
thyroid hormone receptor interactor 12 
nidogen2 

Melanoma associated gene 

N-myc downstream regulated 

calmodulin 2 (phosphorylase kinase, delt 

collagen, type VIII, alpha 1 

RNA binding motif, single stranded inter 
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102300 

102378 

102395 

102460 

102491 

102499 

102523 

102560 

102564 

102589 

102600 

102645 

102687 

102693 

102709 

102759 

102804 

102882 

102907 

102915 

102927 

102960 

103011 

103020 

103029 

103036 

103056 



HG2855-HT2995 

HG3044-HT3742 

HG3342-HT3519 

HG3543-HT3739 

HG4069-HT4339 

HG417-HT417 

J03764 

L06797 

L0S246 

L12711 

L13977 

L15388 

L19871 

L20859 

U2176 

L49169 

L76380 

M15990 

M23254 

M24736 

M26576 

M27396 

M31166 

M31994 

M32334 

M35878 

M36429 

M57730 

M57731 

M60858 

M62994 

M68874 

M69043 

M74719 

M75126 

M84349 

M92843 

M92934 

M93056 

M94856 

M95787 

S76965 

S81914 

U03057 

U03100 

U03877 

U08021 

U14391 

U31384 

U32944 

U40369 

U41767 

U48959 

U51010 

U51478 

U53445 

U59289 

U59423 

U62015 

U63825 

U67963 

U73379 

U73824 

U77604 

U81607 

U89942 

X04412 

X06985 

X07820 

X12876 

X15729 

X52541 

X53416 

X54489 

X54925 

X57206 



U56725 Hs.180414 
X02761 Hs.287820 
BE295928 Hs.75424 
T81309 

AL048753 Hs.303649 
AA836472 Hs.297939 
J03836 Hs.82085 
BE245301 Hs.89414 
AI439011 Hs.86386 
L12711 Hs.89643 
AA340987 Hs.75693 
NMJ)05308Hs.21 1569 
NM_001674Hs.460 
BE247295 Hs.78452 
L42176 Hs.8302 
NM_006732Hs.75678 
NMJX)5795Hs. 152175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 Hs.76392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NM.004428Hs.1624 
AV650262 Hs.75765 
NIVL005381 
AF043045 Hs.81008 
M68874 Hs.211587 
M69043 Hs.81328 
NM_003199Hs.326198 
AI879352 Hs.1 18625 
W01076 Hs.278573 
M92843 Hs.343586 
BE243845 Hs.75511 
AA236291 Hs.1 83583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NMJ06823HS.75209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs.76224 
AI752666 Hs.76669 

NMJ)04998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.1 5432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
AI984144 Hs.66713 
AL1 19566 Hs.6721 
NM_007019Hs.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NIVL005100HS.788 
NIVL002318HS.83354 
AI767736 Hs.290070 
BE409861 Hs.202833 
X07820 Hs.2258 
BE512730 Hs.65114 
AI904738 Hs.76053 
AJ243425 Hs.326035 
X53416 Hs.195464 
AW800726 Hs.789 
M13509 Hs.83169 
Y18024 Hs.78877 



heat shock 70kD protein 2 
fibronectin 1 

inhibitor of DNA binding 1 , dominant neg 
insulin-like growth factor 2 (somatomedi 
small inducible cytokine A2 (monocyte ch 
cathepsin B 

serine (or cysteine) proteinase inhibito 
chemokine (C-X-C motif), receptor 4 (fus 
myeloid cell leukemia sequence 1 (BCL2-r 
transketolase (Wernicke-Korsakoff syndro 
prolylcarboxypeptidase (angiotensinase C 
G protein-coupled receptor kinase 5 
activating transcription factor 3 
solute carrier family 20 (phosphate tran 
four and a half LIM domains 2 
FBJ murine osteosarcoma viral oncogene h 
calcitonin receptor-like 
v-yes-1 Yamaguchi sarcoma viral oncogene 
calpain 2, (m/ll) large sub-unit 
setectin E (endothelial adhesion molecul 
collagen, type IV, alpha 1 
asparagine synthetase 
pentaxin-related gene, rapidly induced b 
aldehyde dehydrogenase 1 family, member 
intercellular adhesion molecule 2 
insulin-like growth factor binding prate 
guanine nucleotide binding protein (G pr 
ephrin-A1 
GR02 oncogene 
nucleolin 

filamin B, beta (actin-binding protein-2 
phospholipase A2, group IVA (cytosoiic, 
nuclear factor of kappa light polypeptid 
transcription factor 4 
hexokinase 1 

CD59 antigen p18-20 (antigen identified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase inhibito 
fatty acid binding protein 5 (psoriasis- 
transgelin 

protein kinase (cAMP-dependent, catalyti 
immediate early response 3 
singed (Drosophila)-Iike (sea urchin fas 
catenin (cadherin-associated protein), a 
EGF-containing fibulin-like extracellula 
nicotinamide N-methyltransferase 
myosin IE 

guanine nucleotide binding protein 11 
dynein, cytoplasmic, light polypeptide 
spermidine/spermine N 1 -acetyltransferase 
a dislntegrin and metalloproteinase doma 
myosin, light polypeptide kinase 
gb:Human nicotinamide N-methyitransferas 
ATPase, Ua+IK+ transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13, H-cadherin (heart) 
MAD (mothers against decapentaplegic, Dr 
cysteine : rich, angiogenic inducer, 61 
hepatitis delta antigen-interacting pro! 
lysosomal 

ubiquitin carrier protein E2-C 
eukaryotic translation initiation factor 
microsomal glutathione S-transferase 2 
A kinase (PRKA) anchor protein (g ravin) 
lysy) oxidase-like 2 
gelsolin (amyloidosis, Finnish type) 
heme oxygenase (decycltng) 1 
matrix metalloproteinase 10 (stromelysin 
keratin 18 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
early growth response 1 
filamin A, alpha (actin-binding protein- 
GR01 oncogene (melanoma growth stimulati 
matrix metalloproteinase 1 (interstitial 
inositol 1,4,5-trisphosphate3-kinase B 
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103080 X59798 
103095 X60957 
103138 X65965 
103176 X69111 
103195 X70940 
103347 X87838 
103371 X91247 
103432 X97748 
103471 Y00815 
103967 M303711 
104447 L44538 
104764 AA025351 
104783 AA027050 
104798 AA029462 
104865 AA045136 
104877 AA047437 
104894 AA054087 
104952 AA071089 
105113 AA156450 
105178 AA187490 
105196 AA195031 
105215 AA205724 
105263 AA227926 
105271 AA227986 
105330 AA234743 
105461 AA253216 

105492 AA256210 

105493 AA256268 
105594 AA279397 
105727 AA292379 
105732 AA292717 
105767 AA346551 
105882 AA400292 
105936 AA404338 
106031 AA412284 
106124 AA423987 
106222 AA428594 
106241 AA430108 

106263 M431462 

106264 AA431470 
106366 AA443756 
106454 AA449479 
106634 AA459916 
106724 AA465226 
106793 AA478778 
106799 AA479037 
106842 AA482597 
106868 AA487561 
106890 AA489245 
106961 AA504110 
106974 AA520989 
107030 AA599434 
107061 AA608649 
107086 AA609519 
107216 D51069 
107385 1)97519 
107444 W28391 
107985 AA035638 
108507 AA083514 
108695 AA121315 
108931 AA147186 
109001 AA156125 
109195 AA188932 
109390 AA219653 
109456 AA232645 
109737 F10078 
110411 H48032 
110660 H82117 
110906 N39584 
111018 N54067 
111091 N59858 
111356 N90933 
111378 N93764 
111741 R26124 
111769 R27957 
112318 R55470 



AU077231 Hs.82932 
NM_Q05424Hs.78824 
X65965 

AL021154 Hs.76884 
AA351647 Hs.2642 
AU077309 Hs.171271 
X91247 Hs.13046 
X97748 

Y00815 Hs.75216 
AL120051 Hs.144700 
AW204145 Hs.156044 
AI039243 Hs.278585 
AA533513 Hs.93659 
AW952619 Hs.17235 
T79340 Hs.22575 
AI138635 Hs.22968 
AF065214 Hs.18858 
AW076098 Hs.345588 
AB037816 Hs.8982 
AA313825 Hs.21941 
W84893 Hs.9305 
AA205759 Hs.10119 
AW388633 Hs.6682 
AA807881 Hs.25329 
AW338625 Hs.22120 
BE539071 Hs.69388 
AI805717 Hs.289112 
AL047586 Hs.10283 
AB024334 Hs.25001 
AL135159 Hs.20340 
AW504170 Hs.274344 
AW370946 Hs.23457 
W46802 Hs.81988 
A1678765 Hs.21812 
X64116 Hs.171844 
H93366 Hs.7567 
AA356392 Hs.21321 
BE019681 Hs.6019 
W21493 Hs.28329 
AL046859 Hs.3407 
AA186715 Hs.336429 
NM 014038HS.5216 
W25491 Hs.288909 
N48670 Hs.28631 
H94997 Hs.16450 
BE313412 Hs.7961 
AF1 24251 Hs.26054 
BE1 85536 Hs.301183 
AA489245 Hs.88500 
AW243614 Hs.18063 
AI817130 Hs.9195 
AL1 17424 Hs.25035 
BE147611 Hs.6354 
NM_012331Hs.26458 
D51069 Hs.211579 
NM_005397Hs.16426 
W28391 Hs.343258 
T40064 Hs.71968 
AI554545 Hs.68301 
AB029000 Hs.70823 
AA147186 

AI056548 Hs.72116 
AF047033 Hs.132904 
AW007485 Hs.87125 
AW956560 Hs.42699 
AA055415 Hs.13233 
AW001579 Hs.9645 
AA782114 Hs.28043 
AA035211 Hs.17404 
AI287912 Hs.3628 
AA300067 Hs.33032 
BE301871 Hs.4867 
AW1 60993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 



cyclin D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with immunoglobulin and 

gb:H.sapiens SOD-2 gene for manganese su 

inhibitor of DMA binding 3, dominant neg 

eukaryob'c translation elongation factor 

catenin (cadherin-associated protein), b 

thioredoxin reductase 1 

gb;H.sap(ens PTX3gene promotor region. 

protein tyrosine phosphatase, receptor t 

ephrin-B1 

ESTs 

ESTs 

protein disulfide isomerase related prot 

Homo sapiens clone TCCCIA00176 mRNA sequ 

B-cell CLL/lymphoma 6, member B (zinc fi 

Homo sapiens clone IMAGE:451939, mRNA se 

phosphollpaseA2, group IVC (cytosolic, 

desmoplakin (DPI, DPII) 

Homo sapiens, clone IMAGE:3506202, mRNA, 

AD036 protein 

angiotensin receptor-like 1 

hypothetical protein FU 14957 

solute carrier family 7, (cationic amino 

ESTs 

ESTs 

hypothetical protein FLJ20505 

CGI-43 protein 

RNA binding motif protein 8B 

tyrosine 3-monooxygenase/tryptophan 5-mo 

KIAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (Drosophila) homolog 2 (mitogen 
ESTs 

Homo sapiens cDNA: FLJ22296 fis, clone H 

Homo sapiens cDNA: FLJ21962 fis, clone H 

Homo sapiens clone FLB9213 PR02474 mRNA, 

Homo sapiens cDNA: FLJ21288 fis, clone C 

hypothetical protein FU 14005 

protein kinase (cAMP-dependent, catalyti 

RIKEN cDNA9130422N19gene 

HSPC028 protein 

hypothetical protein FLJ22471 

Homo sapiens cDNA: FLJ22141 fis, done H 

ESTs 

Homo sapiens clone 25012 mRNA sequence 
novel SH2-containing protein 3 
molecule possessing ankyrin repeats indu 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FU10768 fis, clone NT 
Homo sapiens cDNA FU13698 fis, clone PL 
chloride intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductase A 
melanoma cell adhes/on molecule 
podocalyxin-like 

proliferation-associated 2G4, 38kD 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

ESTs 

KIAA1077 protein 

gb:zo38d01.s1 Stratagene endothelial eel 
hypothetical protein FLJ20992 similar to 
solute carrier family 4, sodium bicarbon 
EH-domain containing 3 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
Homo sapiens mRNA forKIAA1741 protein, 
ESTs 
ESTs 

mitogen-activated protein kinase kinase 
hypothetical protein DKFZp434N185 
mannosyl (alpha-1,3-)-g!ycoprotein beta- 
hypothetical gene DKFZp434A1 1 14 
KIAA0846 protein 
ESTs 

ESTs, Highly similar to T46395 hypotheti 
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112951 
113057 
113195 
113490 
113542 
113803 
113847 
113910 
113947 
114047 
115061 
115819 
115870 
115964 
1 16228 
116264 
116314 
116589 
1 17023 
117112 
117156 
117176 
117280 
119559 
1 19866 
120655 
121314 
121335 
121822 
121835 
122331 
122577 
123160 
123486 
124059 
124339 
124358 
124364 
124726 
124763 
125167 
125304 
125307 
125329 
125598 
125609 
418245 
127435 
127566 
127619 
128453 
12B495 
128515 
128580 
128623 
128642 
128669 
128903 
128914 
129087 
129188 
129226 
129265 
129345 
129468 
129488 
129498 
129557 
129619 
129627 
129762 
129884 
130018 
130147 
130178 
130282 



T16550 

T26674 

T57112 

T88700 

T90527 

W42789 

W60002 

W78175 

W84768 

W94427 

AA253217 

AA426573 

AA432374 

AA446622 

AA478771 

AA482594 

AA490588 

D59570 

H88157 

H94648 

H97538 

H98670 

N22107 

W38197 

W80814 

AA287347 

AA402799 

AA404418 

AA425107 

AA425435 

AA442872 

AA452860 

AA488687 

AA599674 

F13673 

H99093 

N22495 

N23031 

R15740 

R39610 

W45560 

Z39833 

Z40583 

AA825437 

R66613 

AA868063 

AA128075 

N66570 

AI051390 

AA627122 

X02761 

AF010193 

AA149044 

U82108 

D78676 

L35240 

M598737 

R69417 

AA232837 

N72695 

M30257 

M96843 

X68277 

AA292440 

J03040 

AA228107 

AA449789 

W01367 

AA610116 

AA258308 

AA460273 

AA286710 

T68873 

D63476 

M62403 

X55740 



AA307634 Hs.6650 
AW1 94301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs.7890 
AW880709 Hs.283683 
NM_005032Hs.4114 
AA1 13262 Hs.17901 
W84768 

AL035858 Hs.3807 
AI751438 Hs.41271 
AA486620 Hs.41135 
NM_005985Hs.48029 
AA987568 Hs.74313 
AI767947 Hs.50841 
D51174 Hs.272239 
AI799104 Hs.178705 
AI557212 Hs.17132 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 

H45100 Hs.49753 
M18217 Hs.172129 
W38197 

AA496205 Hs.1 93700 
AA305599 Hs.238205 
W07343 Hs.1 82538 
AA404418 
AI743860 

AB033030 Hs.300670 
AL133437 Hs.1 10771 
AA829725 Hs.334437 
AA488687 Hs.284235 
BE019072 Hs.334802 
BE387335 Hs.283713 
H99093 Hs.343411 
AW070211 Hs.102415 
AF265555 Hs.250646 
NM_003654Hs.1O4576 
BE410405 Hs.76288 
AL137540 Hs.102541 
AL359573 Hs.1 24940 
AW580945 Hs.330466 
AA825437 Hs.58875 



T40064 

AA868063 

AA088767 

X69086 

AI051390 

AA627122 

X02761 



Hs.71968 

Hs.1 04576 

Hs.83883 

Hs.286161 

Hs.116731 

Hs.1 63787 

Hs.287820 



NM_005904Hs.100602 
BE395085 Hs.10086 
U82108 Hs.101813 
BE076608 Hs.105509 
Z28913 Hs.102948 
W28493 Hs.180414 
AW150717 Hs.345728 
AW867491 Hs.107125 
AI348027 Hs.1 08557 
NM_001078Hs.109225 
BE222494 Hs.1 80919 
AA530892 Hs.171695 
R22497 Hs.1 10571 
AW410538 Hs.1 11779 
AW966728 Hs.54642 
AA449789 Hs.75511 
AL045404 Hs.46366 
AA209534 Hs.284243 
T40064 Hs.71968 
AA453694 Hs.1 2372 
AF055581 Hs.1 31 31 
AA353093 

D63476 Hs.172813 
U20982 Hs.1516 
BE245380 Hs.153952 



vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from clone RP1-187J1 1 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens cDNA FLJ10500 fis, clone NT 
Homo sapiens mRNA for KIAA1671 protein, 
chromosome 8 open reading frame 4 
plastin 3 (T isoform) 

Homo sapiens, clone IMAGE:3937015, mRNA, 
gb:zh53d03.s1 Soares_feta|Jiver_spleen_ 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA full length insert cDN 
endomucin-2 

snail 1 (drosophila homolog), zinc finge 

K1AA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FLJ11333 fis, clone PL 
ESTs, Moderately similar to I54374 gene 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs 
ESTs 

uveal autoantigen with coiled coil domai 
Homo sapiens cDNA: FLJ21409 fis, done C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
hypothetical protein PRO2013 
phospholipid scrambJase 4 
gb:zw37e02.s1 SoaresJata(Jetus_Nb2HF8_ 
metallothionein 1E (functional) 
KIAA1204 protein 

Homo sapiens cDNA: FU21904 fis, clone H 
hypothetical protein MGC4248 
ESTs, Weakly similar to 138022 hypotheti 
Homo sapiens cDNA FLJ14680 fis, clone NT 
ESTs, Weakly similar to S64054 hypotheti 
DEAD/H (Asp-Glu-Ala-Asp/His) box pofypep 
Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
baculoviral IAP repeat-containing 6 
carbohydrate (keratan sulfate Gal-6) sul 
calpain 2, (m/ll) large subunit 
netrin 4 

GTP-binding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
carbohydrate (keratan sulfate Gal-6) sul 
transmembrane, prostate androgen induced 
Homo sapiens cDNA FLJ13613 fis, clone PL 
ESTs 
ESTs 

fibronectin 1 

MAD (mothers against decapentaplegic, Dr 
type I transmembrane protein Fn14 
solute carrier family 9 (sodium/hydrogen 
CTL2 gene 

enigma (LIM domain protein) 
heat shock 70kD protein 8 
STAT induced STAT inhibitor 3 
plasmalemma vesicle associated protein 
hypothetical protein PP1057 
vascular cell adhesion molecule 1 
inhibitor of DNA binding 2, dominant neg 
dual specificity phosphatase 1 
growth arrest and DNA-damage-inducible, 
secreted protein, acidic, cysteine-rich 
methionine adenosyltransferase II, beta 
connective tissue growth factor 
KIAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
tripartite motif protein TRIM2 
lysosomal 
metallothionein 1L 

PAK-interacting exchange factor beta 
insulin-like growth factor-binding prate 
5* nucleotidase (CD73) 
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130431 L10284 
130495 AA243278 
130553 AA430032 

130638 H16402 

130639 D59711 
130657 T94452 
130666 AA431571 
130776 R79356 
130818 AA280375 
130840 Z49269 
130899 Z41740 
131002 AA121543 
131080 J05008 
131084 AA101878 
131091 T35341 
131107 N87590 
131182 AA256153 
131207 W74533 
131319 U25997 
131328 V01512 
131328 V01512 
131328 V01512 
131328 V01512 
131509 X56681 
131555 AA161292 
131564 AA491465 
131573 AA046593 
131692 D50914 
131756 D45304 
131859 M90657 
131909 W69127 
131915 AA316186 
132046 AA384503 
132050 AA136353 
132151 AA044755 
132164 U84573 
132187 AA058911 
132303 AA620962 
132314 AA285290 
132358 X60486 
132398 R31641 
132421 AA489190 
132490 F13782 
132520 AA257993 
132546 M24283 
132610 AA443114 
132716 T35289 
132840 N23817 
132883 AA047151 
132968 N77151 
132989 M480074 
132999 Y00787 
133071 T99789 
133076 W84341 
133099 L09209 
133147 D12763 
133149 T16484 
133161 AA253193 
133200 AA432248 
133220 X82200 
133260 AA083572 
133295 L00352 
133349 N75791 
133391 X57579 
133398 X02612 
133436 H44631 
133454 AA090257 
133478 X83703 
133491 L40395 
133510 AA227913 
133517 X52947 
133526 M11313 
133538 L14837 
133562 M60721 
133584 D90209 
133590 T67986 



AW505214 Hs.155560 
AW250380 Hs.109059 
AF062649 Hs.252587 
AW021276 Hs.17121 
AI557212 Hs.17132 
AW337575 Hs.201591 
BE548267 Hs.337986 
AF167706 Hs.19280 
AW190920 Hs.19928 
BE048821 Hs.20144 
AI077288 Hs.296323 
AL050295 Hs.22039 

NA/L001955HS.2271 
NMJ)17413Hs.303084 
AJ271216 Hs.22880 
BE620886 Hs.75354 
AI824144 Hs.23912 
AF1 04266 Hs.24212 
NM_003155Hs.25590 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251 Hs.25647 
AW939251 Hs.25647 
X56681 Hs.2780 
T47364 Hs.278613 
T93500 Hs.28792 
AA040311 Hs.28959 
BE559681 Hs.30736 
AA443966 Hs.31595 
AW960564 

NM_016558Hs.274411 
AI161383 Hs.34549 
AI359214 Hs.179260 
AI267615 Hs.38022 
BE379499 Hs.1 73705 
AI752235 Hs.41270 
AA235709 Hs.4193 
BE177330 Hs.325093 
AF1 12222 Hs.323806 
NM_003542Hs.46423 
AA876616 Hs.16979 
AW1 63483 Hs.48320 
NM.001290Hs.4980 
AA257992 Hs.50651 
M24283 Hs. 168383 
AA1 60511 Hs.5326 
BE379595 Hs.283738 
BE218319 Hs.5807 
AA373314 Hs.5897 
AF234532 Hs.61638 
AA480074 Hs.331328 
Y00787 Hs.624 
BE384932 Hs.64313 
AW946276 Hs.6441 
W16518 Hs.279518 
AA026533 Hs.66 
AA370045 Hs.6607 
AW021103 Hs.6631 
AB037715 Hs.183639 
NM_006074Hs,318501 
AA403045 Hs.6906 
AI147861 Hs.213289 
AW631255 Hs.8110 
AW103364 Hs.727 
NWL000499HS.72912 
BE294068 Hs.737 
BE547647 Hs.177781 
X83703 Hs.31432 
BE619053 Hs.1 70001 
AW880841 Hs.96908 
NMJ)0O165Hs.74471 
AU077051 Hs.74561 
NM 003257HsJ4614 
M60721 Hs.74870 
D90209 Hs.181243 
T70956 Hs.75106 



calnexin 

mitochondrial ribosomal protein L12 
pituitary tumor-transforming 1 
ESTs 

ESTs. Moderately similar to I54374 gene 
ESTs 

Homo sapiens cDNA FLJ10934 lis, clone OV 

cysteine-rich motor neuron 1 

hypothetical protein SP329 

small inducible cytokine subfamily A (Cy 

serum/glucocorticoid regulated kinase 

KIAA0758 protein 

endothelin 1 

apelin; peptide Ggand forAPJ receptor 

dipeptidytpeptidaselll 

GCN1 (general control of amino-acid synt 

ESTs 

latrophilin 

stanniocalcin 1 

v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
v-fos FBJ murine osteosarcoma viral onco 
jun D proto-oncogene 
interferon, afpha-induc/ble protein 27 
Homo sapiens cDNA FLJ1 1041 fis, clone PI 
ESTs 

KIAA01 24 protein 
ESTs 

transmembrane 4 superfamiiy member 1 

SCAN domain-containing 1 

ESTs, Highly similar to S94541 1 clone 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FLJ22050 fis, clone H 

procollagen-lysine, 2-oxoglutarate 5-dio 

DKFZP58601624 protein 

Homo sapiens cDNA: FLJ21210 fis, clone C 

pinin, desmosome associated protein 

H4 histone family, member G 

ESTs, Weakly similar to A43932 mucin 2 p 

double ring-finger protein, Dorfin 

L1M domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

intercellular adhesion molecule 1 (CD54) 

amino acid system N transporter 2; porcu 

casein kinase 1 , alpha 1 

GTPaseRab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
myosin X 

hypothetical protein FLJ13213 
interieukin 8 

ESTs, Weakly similar to AF257182 1 G-pro 

Homo sapiens mRNA; cDNA DKFZp586J021 (fr 

amyloid beta (A4) precursor-like protein 

interieukin 1 receptor-like 1 

AXIN1 up-regulated 

hypothetical protein FLJ20373 

hypothetical protein FU10210 

Homo sapiens mRNA full length insert cDN 

Homo sapiens cDNA: FU23197 fis, clone R 

low density lipoprotein receptor (famili 

L-3-hydroxyacyl-Coenzyme A dehydrogenase 

inhibin, beta A (activin A, activin AB a 

cytochrome P450, subfamily I {aromatic c 

immediate early protein 

hypothetical protein MGC5618 

cardiac ankyrin repeat protein 

eukaryotic translation Initiation factor 

p53-induced protein 

gap junction protein, alpha 1, 43kD (con 

alpha-2-macrogbbulin 

tight junction protein 1 (zona occludens 

H2.0 (Drosophila)-like homeo box 1 

activating transcription factor 4 (tax-r 

clusterin (complement lysis inhibitor, S 
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133617 


AA148318 


BE244334 Hs.75249 


133651 


U97105 


AI301740 Hs.1 73381 


133671 


T25747 


AW503116 Hs.301819 


133678 


K02574 


AW247252 


133681 


D78577 


AI352558 


133722 


X53331 


AW969976 Hs.279009 


133730 


S73591 


BE242779 Hs.179526 


133750 


X95735 


BE410769 Hs.75873 


133802 


L16862 


AW239400 Hs.76297 


133825 


U44975 


BE616902 Hs.285313 


133838 


M97796 


BE222494 Hs.180919 


133859 


U86782 


U86782 Hs.178761 


133889 


AA099391 


U48959 Hs.211582 


133960 


M19267 


M19267 Hs.77899 


133975 


D29992 


C18356 Hs.295944 


133977 


L19314 


AI125639 Hs.250666 


134039 


S78569 


NIVL002290HS.78672 


134075 


U28811 


NNL012201HS.78979 


134081 


L77886 


AL034349 Hs.79005 


134164 


C14407 


AW245540 Hs.79516 


134203 


M60278 


AA161219 Hs.799 


134238 


R81509 


AA102179 Hs.160726 


134299 


AA487558 


AW580939 Hs.97199 


134332 


D86962 


D86962 Hs.81875 


134339 


AA478971 


R70429 Hs.81988 


134343 


D50683 


D50683 Hs.82028 


134381 


U56637 


AI557280 Hs.184270 


134403 


M61199 


AA334551 


134416 


M28882 


X68264 Hs.211579 


134493 


X15183 


M30627 Hs.289088 


134558 


S53911 


NM 001773HS.85289 


134817 


U20734 


AU076592 Hs.198951 


134983 


D28235 


028235 Hs.196384 


134989 


AA236324 


AW968058 Hs.92381 


135052 


AA148923 


AL1 36653 Hs.93675 


135062 


AA174183 


AK000967 Hs.93872 


135069 


AA456311 


AA876372 Hs.93961 


135071 


L08069 


W27190 Hs.94 


135073 


AA452000 


W55956 Hs.94030 


135170 


AA282140 


T53169 HS.95B7 


135196 


J02854 


C03577 Hs.9615 


135348 


AA442054 


U80983 Hs.268177 


134404 


AB000450 


AB000450 Hs.82771 


439561 


AB002380 


AF1 80681 Hs.6582 


100082 


AB003103 


AA1 30080 Hs.4295 


132817 


AB004884 


N27852 Hs.57553 


130150 


AF000573 


BE094848 Hs.15113 


100104 


AF008937 


AF008937 


447973 


AF009301 


AB011169 Hs.20141 


332613 


AF009368 


AF029674 Hs.173422 


100113 


D00591 


NM 001269HS.84746 


133980 


D00760 


AA294921 Hs.348024 


100129 


D11139 


AA469369 Hs.5831 


100154 


D14657 


H60720 Hs.81892 


100169 


D14878 


AL037228 Hs.82043 


129718 


D17716 


NM 002410HS.121502 


100190 


D21090 


M91401 Hs.178658 


134742 


D26135 


NM 001346HS.89462 


100211 


D26528 


D26528 Hs.1 23058 


100238 


D30742 


L24959 Hs.348 


130283 


D31762 


NM 01 2288HS.1 53954 


134237 


D31765 


D31765 Hs.170114 


100248 


D31888 


NM 015156HS.78398 


100256 


D38128 


D25418 Hs.393 


100262 


D38500 


D38500 Hs.278468 


134329 


D38551 


N92036 Hs.81848 


100281 


D42087 


AF091035 Hs.184627 


100294 


D49396 


AA331881 Hs.75454 


100327 


055640 


D55640 


100335 


D63391 


AW247529 Hs.6793 


134495 


D63477 


D63477 Hs.84087 


100338 


D63483 


D86864 Hs.57735 


135152 


D64015 


M96954 Hs.182741 


.134269 


D79990 


NM_014737Hs.80905 


100372 


D79997 


NM_014791Hs.184339 


134304 


D80010 


BE613486 Hs.81412 



ADP-ribosylation factor-like 6 interacts 

dihydropyrimidinase-Iike 2 

zinc finger protein 146 

nucleoside phosphorylase 

tyrosine 3-monooxygenase/tryptophan 5-mo 

matrix Gla protein 

upregulated by 1,25-dihydroxyvitamin D-3 
zyxin 

G protein-coupled receptor kinase 6 

core promoter element binding protein 

inhibitor of DNA binding 2, dominant neg 

26S proteasome-associated pad1 homoiog 

myosin, light polypeptide kinase 

tropomyosin 1 (alpha) 

tissue factor pathway inhibitor 2 

hairy (Drosophila)-homolog 

laminin, alpha 4 

Golgi apparatus protein 1 

protein tyrosine phosphatase, receptor t 

brain abundant membrane attached signal 

diphtheria toxin receptor (heparin-bindi 

Homo sapiens cDNA FLJ1 1680 fis, clone HE 

complement component C1q receptor 

growth factor receptor-bound protein 10 

disabled (Drosophila) homoiog 2 (mitogen 

transforming growth factor, beta recepto 

capping protein (actin filament) muscle 

sperm specific antigen 2 

melanoma cell adhesion molecule 

heat shock 90kD protein 1, alpha 

CD34 antigen 

jun B proto-oncogene 

prostaglandin-endoperoxide synthase 2 (p 

nudix (nucleoside diphosphate linked moi 

decidual protein induced by progesterone 

KIAA1682 protein 

Homo sapiens mRNA; cDNA DKFZp667D095 (fr 
DnaJ (Hsp40) homoiog, subfamily A, membe 
Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
Homo sapiens cDNA: FLJ22290 fis, clone H 
myosin regulatory light chain 2, smooth 
phospholipase C, gamma 1 (formerly subty 
vaccinia related kinase 2 
Rho guanine exchange factor (GEF) 12 
proteasome (prosome, macropain) 26S subu 
tousled-like kinase 2 

homogentisate 1,2-dioxygenase (homogenti 
syntaxin 16 

similar to S. cerevisiae SSM4 

KIAA1605 protein 

chromosome condensation 1 

v-ral simian leukemia viral oncogene horn 

tissue inhibitor of metalloproteinase 1 

KIAA0101 gene product 

D 123 gene product 

mannosyl (a1pha-1,6-)-glycoprotein beta- 
RAD23 (S. cerevisiae) homoiog B 
diacylglycerol kinase, gamma (90kD) 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
calcium/calmodulin-dependent protein kin 
TRAM-like protein 
KIAA0061 protein 
KIAA0071 protein 

prostaglandin 12 (prostacyclin) receptor 
postmeiotic segregation increased 2-Iike 
RAD21 (S. pombe) homoiog 
KIAA01 18 protein 
peroxiredoxin 3 

gb:Human monocyte PABL (pseudoautosomal 

platelet-activating factor acetylhydroia 

KIAA0143 protein 

acetyl LDL receptor; SREC 

TIA1 cytotoxic granule-associated RNA-bi 

Ras association (RalGDS/AF-6) domain fam 

KIAA0175 gene product 

(ipin 1 
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100394 


D84276 


D84284 


Hs.66052 


100405 


D86425 


AW291587 Hs.82733 


100418 


D86978 


D86978 


Hs.84790 


133154 


D87012 


D87012 


Hs.194685 


134347 


D87075 


AF164142 


Hs.82042 


444099 


D87432 


D87432 


Hs.10315 


100438 


D87448 


AA013051 


Hs.91417 


134593 


D87845 


NM_000437Hs.234392 


100481 


HG1098-HT1098 X70377 


Hs.121489 


100552 


HG2167-HT2237 AA019521 


Hs.301946 


100591 


HG2415-HT2511 NM_004091Hs,231444 


100652 


HG2825-HT2949 BE613608 


Hs.142653 


100662 


HG2887-HT3031 AI368680 


Hs.816 


100899 


HG4660-HT5073 AL039123 


Hs.103042 


100905 


HG4704-HT5146 L12260 


Hs.172816 


100945 


HG884-HT884 


AF002225 


Hs.180686 


100950 


HG919-HT919 


AF128542 


Hs. 166846 


100964 


J00212 


J00212 




135407 


J04029 


J04029 


Hs,99936 


130149 


J04031 


AW067805 


Hs.172665 


131877 


J04088 


J04088 


Hs.156346 


101016 


J04543 


J04543 


Hs.78637 


134786 


L06139 


T29618 


Hs.89640 


134100 


L07540 


AA460085 


Hs.171075 


134078 


L08895 


L08895 


Hs.78995 


101132 


L11239 


L11239 


Hs.36993 


134849 


L11353 


BE409525 


Hs.902 


332736 


L13773 


Z83689 


Hs.114765 


101152 


L13800 


AI984625 


Hs.9884 


135397 


L14922 


L14922 


Hs.166563 


432642 


L15189 


BE297635 


Hs.3069 


101168 


L15388 


NM.005308HS.211569 


421155 


L16895 


H87879 


Hs.102267 


101226 


L27476 


AF083892 


Hs.75608 


415138 


L27624 


C18356 


Hs.295944 


134739 


L32976 


NM_002419Hs.89449 


130155 


L33404 


AA101043 


Hs.151254 


440538 


L35263 

\m W V ^ V«* V 


W76332 


Hs.79107 


409916 


L37347 


BE313625 


Hs.57435 


101294 


L40371 


AF168418 


Hs. 11 6784 


101300 


L40391 


BE535511 




101310 


L41607 


L41607 


Hs.934 


130344 


L77556 


AW250122 Hs.154879 


101381 


M13928 


AW675039 Hs.1227 


101381 


M13928 


AW675039 Hs.1227 


415678 


M14016 


AW005903 Hs.78601 


133780 


M14219 


AA557660 


Hs.76152 


101396 


M15796 


BE267931 


Hs.78996 


101447 


M21305 


M21305 




101458 


M22092 


M22092 




101470 


M22898 


NM 000546HS.1846 


134604 


M22995 


NM 002884Hs.865 


101478 


M23379 


NM 002890HS.758 


133519 


M24400 


AW583062 


Hs.74502 


131185 


M25753 


BE280074 


Hs.23960 


134116 


M27691 


R84694 


Hs.79194 


133999 


M23213 


AA535244 


Hs.78305 


130174 


M29550 


M29551 


Hs.151531 


129963 


M29971 


M29971 


Hs.1384 


132983 


M30269 


M30269 




133900 


M31158 


M31158 


HsJ7439 


101543 


M31166 


M31166 


Hs.2050 


101545 


M31210 


BE246154 


Hs.154210 


101620 


M55420 


S55271 


Hs.247930 


134691 


M59979 


AW382987 


Hs.88474 


133595 


M62810 


AA393273 


Hs.75133 


101700 


M64710 


D90337 


Hs.247916 


101714 


M68874 


M68874 


Hs.211587 


134246 


M74524 


D28459 


Hs.80612 


101760 


M80254 


M80254 


Hs.173125 


415022 


M81780 


X59960 


Hs.77813 


415022 


M81780 


X59960 


Hs.77813 


415022 


M81780 


X59960 


Hs.77813 


415022 


M81780 


X59960 


Hs.77813 


415022 


M81780 


X59960 


Hs.77813 


101791 


M83822 


M83822 


Hs.62354 



CD38 antigen (p45) 

nldogen 2 

KIAA0225 protein 

topoisomerase (DNA) 111 beta 

solute earner family 23 (nucleobase tra 

solute carrier family 7 (catlonic amino 

topoisomerase (DNA) II binding protein 

platelet-activating factor acetylhydrola 

cystatin D 

lysosomal 

Homo sapiens, Similar to hypothetical pr 
ret finger protein 

SRY (sex determining region Y)-box 2 
microtubule-associated protein 1B 
neuregulin 1 

ubiquitin protein Eigase E3A (human papi 
polymerase (DNA directed), epsilon 
Empirically selected from AFFX single pr 
keratin 10 (epiderrnolytic hyperkeratosis 
methylenetetrahydrofolale dehydrogenase 
topoisomerase (DNA) ll alpha (170kD) 
annexin A7 

TEK tyrosine kinase, endothelial (venous 
replication factor C (activator 1) 5 (36 
MADS box transcription enhancer factor 2 
gastrulation brain homeo box 1 
neurofibromin 2 (bilateral acoustic neur 
myeloid/lymphoid or mixed-lineage leukem 
spindle pote body protein 
replication factor C (activator 1) 1 (14 
heat shock 70kD protein 9B (mortalin-2) 
G protein-coupled receptor kinase 5 
lysyl oxidase 

tight junction protein 2 (zona occludens 
tissue factor pathway inhibitor 2 
mitogen-activated protein kinase kinase 
kaiiikrein 7 (chymotryptic, stratum com 
mitogen-activated protein kinase 14 
solute carrier family 11 (proton-coupled 
thyroid hormone receptor interactor 4 
transmembrane trafficking protein 
glucosaminyl (N-acetyl) transferase 2, 1 
DiGeorge syndrome critical region gene D 
aminolevulinate, delta-, dehydratase 
aminolevulinate, delta-, dehydratase 
uroporphyrinogen decarboxylase 
decorin 

proliferating cell nuclear antigen 
gb:Human alpha satellite and satellite 3 
gb:Human neural ceil adhesion molecule ( 
tumor protein p53 (Li-Fraumeni syndrome) 
RAP1 A, member of RAS oncogene family 
RAS p21 protein activator (GTPase activa 
chymotrypsinogen B1 
cyclin B1 

cAMP responsive element binding protein 
RAB2, member RAS oncogene family 
protein phosphatase 3 (formerly 2B), cat 
O-6-methylguanine-DNA methyltransferase 
nidogen (enactin) 

protein kinase, cAMP-dependent, reguiato 
pentaxin-related gene, rapidly induced'b 
endothelial differentiation, sphingolipi 
Epsilon , IgE 

prostaglandin-endoperoxide synthase 1 (p 
transcription factor 6-like 1 (mitochond 
natriuretic peptide precursor C 
phospholipase A2, group IVA (cytosolic, 
ubiquitin-conjugating enzyme E2A (RAD6 h 
peptidylprolyl isomerase F (cyclophilin 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
sphingomyelin phosphodiesterase 1, acid 
cell division cycle 4-like 
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101812 


M86934 


BE439894 


Hs.78991 


101813 


M87338 


NM_002914Hs.139226 


133396 


M96326 


M96326 


Hs.72885 


428161 


M96954 


M96954 


Hs.182741 


129026 


M98833 


AL120297 


Hs.108043 


101901 


S66793 


H38026 


Hs.308 


134831 


S72370 


AA853479 


Hs.89890 


134039 


S78569 


NM_002290Hs.78672 


442355 


S79873 


AA456539 


Hs.8262 


101975 


S83325 


AA079717 


Hs.283664 


101977 


S83364 


AF1 12213 


Hs.184062 


101978 


S83365 


BE561610 


Hs.5809 


101998 


U01212 


U01212 


Hs.248153 


102003 


U01922 


U01922 


Hs.125565 


102007 


U02556 


U02556 


Hs.75307 


102009 


U02680 


BE245149 


Hs.82643 


416658 


U03272 


U03272 


Hs.79432 


132951 


U04209 


AW821182 Hs.61418 


135389 


U05237 


U05237 


Hs.99872 


102048 


U07225 


U07225 


Hs.339 


130145 


U07620 


U34820 


Hs.151051 


303153 


U09759 


U09759 


Hs.246857 


420269 


U09820 


U72937 


Hs.96264 


102095 


U11313 


U11313 


Hs.75760 


102123 


U14518 


NIVL001809HS.1594 


102126 


U14575 


AW950870 Hs.78961 


102133 


U15173 


AU076845 Hs.155596 


102139 


U15932 


NM 004419HS.2128 


102162 


U18291 


AA450274 


Hs.1592 


102164 


U18300 


NM_000107Hs.77602 


4-27653 


U18383 


AA159001 


Hs.180069 


131817 


U20536 


U20536 


Hs.3280 


102200 


U21551 


AA232362 


Hs.157205 


102210 


U23028 


BE619413 


Hs.2437 


102214 


U23752 


U23752 


Hs.32964 


132811 


U25435 


U25435 


Hs.57419 


131319 


U25997 


NM_003155Hs.25590 


102256 


U28251 


U28251 


Hs.53237 


132316 


U28831 


U28831 


Hs.44566 


102269 


U30245 


U30245 




417526 


U32315 


AA568906 


Hs.82240 


102293 


U32439 


AF090116 


Hs.79348 


102298 


U32849 


AA382169 


Hs.54483 


102325 


U35139 


AI815867 


Hs.50130 


428734 


U36764 


BE303044 


Hs.192023 


102361 


U39400 


AA223616 


Hs.75859 


102367 


U39657 


U39656 


Hs.1 18825 


102388 


U41344 


AA362907 


Hs.76494 


102394 


U41766 


NM_003816Hs.2442 


129829 


U41813 


AF010258 


Hs. 127428 


102409 


U43286 


BE300330 


Hs. 11 8725 


133746 


U44378 


AW410035 Hs.75862 


102423 


U44754 


Z47542 


Hs.179312 


132828 


U47011 


AB014615 


Hs.57710 


132828 


U47011 


AB014615 


Hs.57710 


132828 


U47011 


AB014615 


Hs.57710 


132828 


U47011 


AB014615 


Hs.57710 


425322 


U47077 


U63630 


Hs.155637 


102450 


U48251 


U48251 


Hs.75871 


129350 


U50535 


U50535 


Hs.1 10630 


102534 


U56833 


U96759 


Hs. 198307 


130457 


U58091 


AB014595 


Hs.155976 


135065 


U58837 


AA019401 


Hs.93909 


102560 


U59289 


R97457 


Hs.63984 


102567 


U59863 


U63830 


Hs.146847 


417173 


U67122 


U61397 


Hs.81424 


102638 


U67319 


U67319 


Hs.9216 


132736 


U68019 


AW081883 Hs.211578 


133070 


U69611 


U92649 


Hs.64311 


102663 


U70322 


NM_002270Hs.168075 


134660 


U73524 


U73524 


Hs.87465 


102735 


U79267 


AF111106 


Hs.3382 


102741 


U79291 


AW959829 Hs.83572 


130564 


U82671 


U82671 


Hs.36980 


130564 


U82671 


U82671 


Hs.36980 


132164 


U84573 


AI752235 


Hs.41270 



DNA segment, numerous copies, expressed 
replication factor C (activator 1) 2 (40 
azurocidin 1 (cationic antimicrobial pro 
TIA1 cytotoxic granule-associated RNA-bi 
Friend leukemia virus integration 1 
arrestin 3, retinal (X-arrestin) 
pyruvate carboxylase 
lamlnin, alpha 4 

lysosomal-associated membrane protein 2 

aspartate beta-hydroxylase 

putative RabS-interacting protein 

putative transmembrane protein; homolog 

olfactory marker protein 

translocase of inner mitochondrial membr 

t-complex-associated-testis-expressed 1- 

protein tyrosine kinase 9 

fibrillin 2 (congenital contractural ara 

microfibrillar-associated protein 1 

fetal Alzheimer antigen 

purinergic receptor P2Y, G-protein coupl 

mitogen-aclivated protein kinase 10 

mitogen-activated protein kinase 9 

alpha thaiassemia/menta! retardation syn 

sterol carrier protein 2 

centromere protein A (17kD) 

protein phosphatase 1, regulatory (inhib 

BCL2/adenovirus E1B 19kD-interacting pro 

dual specificity phosphatase 5 

CDC16 (cell division cycle 16, S. cerevi 

damage-specific DNA binding protein 2 (4 

nuclear respiratory factor 1 

caspase 6, apoptosis-related cysteine pr 

branched chain aminotransferase 1, cytos 

eukaryotic translation initiation factor 

SRY (sex determining region Y)-box 1 1 

CCCTC-binding factor (zinc finger protei 

stanniocalcin 1 

ESTs, Highly similar to Z169.HUMAN ZINC 
KIAA1641 protein 

gb:Human myelomonocytic specific protein 
syntaxin 3A 

regulator of G-protein signalling 7 
N-myc(and STAT) interactor 
necdin (mouse) homolog 
eukaryotic translation initiation factor 
chromosome 11 open reading frame 4 
mitogen-activated protein kinase kinase 
proline arginine-rich end leucine-rich r 
a disintegrin and metalloproteinase doma 
homeo box A9 

selenophosphate synthetase 2 
MAD (mothers against decapentaplegic, Dr 
small nuclear RNA activating complex, po 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
fibroblast growth factor 8 (androgen-ind 
protein kinase, DNA-activated, catalytic 
protein kinase C binding protein 1 
Human BRCA2 region, mRNA sequence CG006 
von Hippel-Lindau binding protein 1 
cullin 4B 

cyclic nucleotide gated channel beta 1 

cadherin 13, H-cadherin (heart) 

TRAF family member-associated NFKB activ 

ubiquitin-like 1 (sentrin) 

caspase 7, apoptosis-related cysteine pr 

Homo sapiens cDNA: FU23037 fis, clone L 

a disintegrin and metalloproteinase doma 

karyopherin (importln) beta 2 

ATP/GTP-binding protein 

protein phosphatase 4, regulatory subuni 

hypothetical protein MGC 14433 

melanoma antigen, family A, 2 

melanoma antigen, family A, 2 

procollagen-lysine, 2-oxoglutarate 5-dto 
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102823 


U90914 


D85390 Hs.5057 


102826 


U91316 


NM 007274HS.8679 


102831 


U91932 


AA262170 Hs.80917 


102846 


U96131 


BE264974 Hs.6566 


129777 


U97018 


U97018 Hs.12451 


134161 


U97188 


AA634543 Hs.79440 


134854 


V00503 


J03464 Hs.179573 


429257 


X04327 


AW163799 Hs.198365 


413985 


X06389 


AI018666 Hs.75667 


419768 


X07496 


T72104 Hs.93194 


102915 


X07820 


X07820 Hs.2258 


134656 


X14787 


AI750878 Hs.87409 


413858 


X15525 


NM 001610HS.75589 


102968 


X16396 


AU076611 Hs.1 54672 


102971 


X16609 


X16609 Hs.183805 


134037 


X53586 


AI808780 Hs.227730 


134037 


X53586 


AI808780 Hs.227730 


103023 


X53793 


AW500470 Hs.1 17950 


103037 


X54936 


BE018302 Hs.2894 


130282 


X55740 


BE245380 Hs.153952 


134542 


X57025 


M14156 Hs.85112 


128568 


X60673 


H12912 Hs.274691 


128568 


X60673 


H12912 Hs.274691 


103093 


X60708 


S79876 Hs.44926 


413076 


X62048 


U10564 Hs.75188 


129063 


X63097 


X63094 Hs.283822 


424460 


X63563 


BE275979 Hs.296014 


411077 


X64037 


AW977263 Hs.68257 


103181 


X69636 


X69636 Hs.334731 


103184 


X69878 


U43143 Hs.74049 


103194 


X70649 


NMJ)04939Hs.78580 


103208 


X72841 


AW411340 Hs.31314 


129698 


X74987 


BE242144 Hs.12013 


131486 


X83107 


F06972 Hs.27372 


130729 


X84194 


AI963747 Hs.18573 


103334 


X85753 


NM 001260HS.25283 


132645 


X87870 


AI654712 Hs.54424 


135094 


X89066 


NM 003304HS.250687 


103352 


X89398 


H09366 Hs.78853 


103352 


X89398 


H09366 Hs.78853 


103353 


X89399 


X89399 Hs.1 19274 


132173 


X89426 


X89426 Hs.41716 


103371 


X91247 


X91247 Hs.13046 


131584 


X91648 


AA598509 Hs.29117 


103376 


X92098 


AL036166 Hs.323378 


103378 


X92110 


AL119690 Hs.153618 


128510 


X94703 


X94703 


103410 


X96506 


AA158294 Hs.295362 


133490 


X97230 


AF022044 Hs.274601 


332689 


X97230 


AF022044 Hs.274601 


103438 


X98263 


AW175781 Hs.152720 


103440 


X98296 


X98296 Hs.77578 


103452 


X99584 


NM 006936HS.85119 


133536 


Y00264 


W25797.comp 


420234 


Y07566 


AW404908 Hs.96038 


426502 


Y07759 


Y07759 Hs.170157 


134662 


Y07827 


NMJ)07048Hs.284283 


132083 


Y07867 


BE386490 Hs.279663 


103500 


Y09443 


AW408009 Hs.22580 


134389 


Y09858 


Y09858 Hs.82577 


132084 


Y12394 


NM 002267HS.3886 


103540 


Z11559 


NM 002197HS.154721 


133152 


Z11695 


211695 Hs.324473 


103548 


Z15005 


Z15005 Hs.75573 


103612 


Z46261 


BE336654 Hs.70937 


129092 


AA011243 


D56365 Hs.63525 


103692 


AA018418 


AW137912 Hs.227583 


103695 


AA018758 


' AW207152 Hs.186600 


129796 


AA018804 


BE218319 Hs.5807 


434993 


AA031993 


AA306325 Hs.4311 


132683 


AA044217 


BE264633 Hs.143638 


131887 


AA046548 


W17064 Hs.332848 


103723 


AA057447 


BE274312 Hs.214783 


453368 


AA058376 


W20296 Hs.288178 


133260 


AA083572 


AA403045 Hs.6906 


103765 


AA085696 


AA085696 Hs.169600 



carboxypeptidase D 

cytosoBc acyl coenzyme A thioester hydr 
adaptor-related protein complex 3, Sigma 
thyroid hormone receptor interactor 13 
echinoderm microtubule-associated protei 
IGF-II mRNA-binding protein 3 
collagen, type I, alpha 2 
2,3-bisphosphoglycerate mutase 
synaptophysin 
apolipoprotein A-l 

matrix metalloproteinase 10 (stromelysin 

thrombospondin 1 

acid phosphatase 2, lysosomal 

methylene tetrahydrofolate dehydrogenase 

ankyrin 1, erythrocytic 

integrin, alpha 6 

tntegrin, alpha 6 

multifunctional polypeptide similar to S 

placental growth factor, vascular endoth 

5' nucleotidase (CD73) 

insuiin-like growth factor 1 (somatomedi 

adenylate kinase 3 

adenylate kinase 3 

dipeptidylpeptidase IV (CD26, adenosine 

weel (S. pombe) homolog 

Rhesus blood group, D antigen 

polymerase (RNA) II (DNA directed) polyp 

general transcription factor IIF, polype 

Homo sapiens, clone IMAGE:3448306, mRNA, 

fms-related tyrosine kinase 4 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 

retinoblastoma-btnding protein 7 

ATP-binding cassette, sub-family E (OABP 

BMX non-receptor tyrosine kinase 

acylphosphatase 1 , erythrocyte (common) 

cyclin-dependent kinase 8 

hepatocyte nuclear factor 4, alpha 

transient receptor potential channel 1 

uracil-DNAglycosylase 

uracil-DNA glycosylase 

RAS p21 protein activator (GTPase activa 

endothelial cell-specific molecule 1 

thioredoxin reductase 1 

purine-rich element binding protein A 

coated vesicle membrane protein 

HCGVIll-1 protein 

RAB28, member RAS oncogene family 

DR1 -associated protein 1 (negative cofac 

killer cell immunoglobuiin-like receptor 

killer cell immunogtobulin-like receptor 

M-phase phosphoprotein 6 

ubiquitin specific protease 9, X chromos 

SMT3 (suppressor of mif two 3, yeast) ho 

Hs. 1 77486 amyloid beta (A4) precursor protein (pro 

Ric (Drosophila)-like, expressed in many 

myosin VA (heavy polypeptide 12, myoxin) 

butyrophilin, subfamily 3, member A1 

Pirin 

alkylglycerone phosphate synthase 
spindiin-like 

karyopherin alpha 3 (importin alpha 4) 

aconitasel, soluble 

mitogen-activated protein kinase 1 

centromere protein E (312kD) 

H3 histone family, member A 

poly(rC)-binding protein 2 

Homo sapiens chromosome X map Xp1 1 .23 L- 

ESTs 

GTPase Rab14 

SUMO-1 activating enzyme subunit 2 
WD repeat domain 4 
SWI/SNF related, matrix associated, acti 
Homo sapiens cDNA FLJ14041 fis, clone HE 
Homo sapiens cDNA FLJ1 1968 fis, clone HE 
Homo sapiens cDNA: FU23197 fis, clone R 
KIAA0826 protein 
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103766 


AA088744 


AI920783 


Hs.191435 


103767 


AA089688 


BE244667 




132051 


AA091284 


AA393968 


Hs.1 80145 


103773 


AA092700 


AI219323 


Hs.101077 


135289 


AA092968 


AW372569 


Hs.9788 


409659 


AA094800 


AW970843 


Hs.55682 


103794 


AA100219 


AF244135 


Hs.30670 


131471 


AA1 14885 


AA164842 


Hs.192619 


134319 


AA129547 


BE304999 


Hs.285754 


103807 


AA133016 


AW958264 


Hs.103832 


446392 


AA149507 


AF142419 


Hs.15020 


129863 


AA151005 


BE379765 


Hs.1 29872 


103850 


AA187101 


AA187101 


Hs.213194 


103855 


AA195179 


W02363 




103861 


AA206236 


AA206236 


Hs.4944 


130634 


AA227621 


AI769067 


Hs.1 27824 


447735 


AA248283 


AA775268 


Hs.6127 


103909 


AA249611 


AA249611 


Hs.47438 


458928 


AA282640 


AF043117 


Hs.24594 


415824 


AA287199 


D42039 


Hs.78871 


129013 


AA313990 


AA371156 


Hs.107942 


129435 


AA314256 


AF151852 


Hs.111449 


103988 


AA314389 


AA314389 


Hs.342849 


104000 


AA324364 


AI146527 


Hs.80475 


425284 


AA329211 


AF1 55568 


Hs.348043 


128629 


AA399187 


AL096748 


Hs.102708 


133281 


AA421079 


AK001601 


Hs.69594 


104104 


AA422029 


AA422029 


Hs.143640 


332455 


AA425230 


NM_005754Hs.220689 


132091 


AA447052 


AW954243 




135073 


AA452000 


W55956 


Hs.94030 


131367 


AA456687 


AI750575 


Hs.173933 


129593 


AA487015 


AI338247 


Hs.98314 


133505 


C01527 


AI630124 


Hs.324504 


132064 


C01714 


AA121098 


Hs.3838 


442351 


C01811 


W52642 


Hs.8261 


131427 


C02352 


AF151879 


Hs.26706 


433892 


C02375 


AI929357 


Hs.323966 


104282 


C14448 


C14448 


Hs.332338 


134827 


016611 


BE314037 


Hs.89866 


425330 


D25216 


D25216 


Hs.1 55650 


131742 


D31352 


AA961420 


Hs.31433 


456935 


D58024 


AA370362 


Hs.57958 


425218 


D80897 


NM_014909Hs.155182 


104334 


D82614 


D82614 


Hs.78771 


134593 


D87845 


NMJ)00437Hs.234392 


134731 


D89377 


D89377 


Hs.89404 


445776 


H06583 


NMJ01310HS.13313 


131670 


H40732 


H03514 


Hs.15589 


104394 


H46617 


AA1 29551 


Hs.1 721 29 


104402 


H56731 


H56731 


Hs.132956 


439130 


H75570 


AA306090 


Hs.124707 


129077 


H78886 


N74724 


Hs. 108479 


104417 


H81241 


AI819448 


Hs.320861 


134927 


L36531 


L36531 


Hs.91296 


129280 


M63154 


M63154 


Hs.110014 


134498 


M63180 


AW246273 


Hs.84131 


104460 


M91504 


AW955705 


Hs.62604 


104488 


N56191 


N56191 


Hs.106511 


131248 


N78483 


AI038989 


Hs.332633 


130017 


R14652 


AK000096 


Hs.143198 


104530 


R20459 


AK001676 


Hs.12457 


104534 


R22303 


R22303 




104544 


R33779 


AI091173 


Hs.222362 


133328 


R36553 


AW452738 


Hs.265327 


104567 


R64534 


AA040620 


Hs.5672 


129575 


R70621 


F08282 


Hs.278428 


130776 


R79356 


AF1 67706 


Hs.19280 


104599 


R84933 


AW815036 


Hs.151251 


104660 


AA007160 


BE298665 


Hs.14846 


104667 


AA007234 


AI239923 


Hs.63931 


104718 


AA018409 


AI143020 


Hs.36250 


104764 


M025351 


AI039243 


Hs.2785B5 


104786 


AA027168 


AA0271 67 


Hs.10031 


104787 


AA027317 


AA027317 




134079 


AA029423 


AK001751 


Hs.171835 



ESTs 

CGI-100 protein 
HSPC030 protein 

ESTs, Weakly similar to T22363 hypotheti 

hypothetical protein MGC 10924 similar to 

eukaryotic translation initiation factor 

hepatocellular carcinoma-associated anti 

KIAA1600 protein 

fumarate hydratase 

similar to yeast Upf3, variant B 

homolog of mouse quaking QKI (KH domain 

sperm associated antigen 9 

hypothetical protein MGC10895 

hypothetical protein FU10330 

hypothetical protein FU 12783 

ESTs, Weakly similar to T28770 hypotheti 

Homo sapiens cDNA: FU23020 fis, clone L 

SH3 domain binding glutamic acid-rich pr 

ubiquitination factor E4B (homologous to 

mesoderm development candidate 2 

DKFZP564M112 protein 

CGl-94 protein 

ADP-ribosylation factor-like 5 

polymerase (RNA) II (DNA directed) polyp 

NS1 -associated protein 1 

DKFZP434A043 protein 

high-mobiiity group 20A 

ESTs, Weakly similar to hyperpolarizatio 

Ras-GTPase-activating protein SH3-domain 

KIAA0251 protein 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
nuclear factor l/A 

Homo sapiens mRNA; cDNA DKFZp586L0120 (f 
Homo sapiens mRNA; cDNA DKFZp586J0720 (f 
serum-inducible kinase 
hypothetical protein FLJ22393 
CGI-121 protein 

Homo sapiens clone H63 unknown mRNA 
EST 

coproporphyrinogen oxidase (coproporphyr 

KIAA0014 gene product 

ESTs 

EGF-TM7-latrophilin-related protein 
KIAA1036 protein 
phosphoglycerate kinase 1 
platelet-activating factor acetylhydrola 
msh (Drosophila) homeo box homolog 2 
cAMP responsive element binding protein- 
ESTs 

Homo sapiens cDNA: FLJ21409 fis, clone C 

ESTs 

ESTs 

ESTs 

KruppeHike factor 8 
integrin, alpha 8 

gastric intrinsic factor (vitamin B synt 

threonyl-tRNA synthetase 

Homo sapiens, clone IMAGE:4299322, mRNA, 

protocadherin 17 

Bardet-Biedl syndrome 2 

inhibitor of growth family, member 3 

hypothetical protein FLJ10814 

gb:yh26b09.r1 Soares placenta Nb2HP Homo 

ESTs, Weakly similar to p40 [H.sapiens] 

hypothetical protein DKFZp761l141 

hypothetical protein AF140225 

progestin induced protein 

cysteine-rich motor neuron 1 

ESTs 

Homo sapiens mRNA; cDNA DKFZp564D016 (fr 
ESTs 

ESTs, Weakly similar to I38022 hypotheti 
ESTs 

KIAA0955 protein 

gb:ze97d11.s1 Soares_fetaLhearLNbHH19W 
hypothetical protein FLJ 10889 
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104804 


AA031357 


AI858702 


Hs.31803 


104865 


AA045136 


T79340 


Hs.22575 


130828 


AA053400 


AW631469 Hs.203213 


104907 


AA055829 


AA055829 


Hs.196701 


104943 


AA065217 


AF072873 


Hs.1 14218 


105013 


AA1 16054 


H63789 


Hs.296288 


105024 


AA126311 


AA126311 


Hs.9879 


132592 


AA129390 


AW803564 Hs.288650 


105038 


AA130273 


AW503733 Hs.9414 


105077 


AA142919 


W55946 


Hs.234863 


105096 


AA150205 


AL042506 


Hs.21599 


129215 


AA1 76867 


AB040930 


Hs.1 26085 


105169 


AA180321 


BE245294 


Hs.180789 


132796 


AA180487 


NM_006283Hs.173159 


427210 


AA187634 


BE396283 


Hs.173987 


105200 


AA195399 


AA328102 


Hs.24641 


130114 


AA234717 


AA233393 


Hs.14992 


105330 


AA234743 


AW338625 Hs.22120 


105337 


M234957 


AI468789 


Hs.347187 


422040 


AA235604 


AA172106 


Hs.1 10950 


105376 


AA236559 


AW994032 Hs.8768 


105397 


AA242868 


AA814807 


Hs.7395 


431679 


AA251776 


AK000046 


Hs.343877 


131991 


AA251909 


AF053306 


Hs.36708 


421305 


AA252672 


BE397354 


Hs,324830 


105489 


AA256157 


AA256157 


Hs.24115 


105508 


AA256680 


AA173942 


Hs.326416 


105539 


AA258873 


AB040884 


Hs.109694 


135172 


AA262727 


AB028956 


Hs.12144 


131569 


AA281451 


AL389951 


Hs.271623 


431129 


AA281545 


AL137751 


Hs.263671 


105643 


AA282069 


BE621719 


Hs.173802 


105659 


AA283044 


AA283044 


Hs.25625 


105666 


AA283930 


AA426234 


Hs.34906 


105674 


AA284755 


AI609530 


Hs.279789 


105709 


AA291268 


AI928962 


Hs.26761 


105722 


AA291927 


AI922821 


Hs.32433 


105765 


AA343514 


AA299688 


Hs.24183 


115951 


AA398109 


BE546245 


Hs.301048 


130884 


AA398109 


BE546245 


Hs.301048 


105962 


AA405737 


AW880358 Hs.339808 


105985 


M406610 


AA4Q6610 




106008 


AA411465 


AB033888 


Hs.8619 


457322 


AA416886 


AI815486 


Hs.243901 


134222 


AA424013 


AW855861 Hs.8025 


446954 


AA424148 


AB037850 


Hs.16621 


106141 


AA424558 


AF031463 


Hs.9302 


447973 


AA424961 


AB011169 


Hs.20141 


106157 


AA425367 


W37943 


Hs.34892 


428314 


AA425921 


A W1 35049 Hs.26285 


446727 


AA426220 


AB011095 


Hs.16032 


106196 


AA427735 


AA525993 


Hs.173699 


457714 


AA430673 


AA083764 




133200 


AA432248 


AB037715 


Hs.183639 


106302 


M435896 


AA398859 


Hs.18397 


106328 


AA436705 


AL079559 


Hs.28020 


450534 


AA446561 


AI570189 


Hs.25132 


106423 


AA448238 


AB020722 


Hs.16714 


439608 


AA449756 


AW864696 Hs.301732 


106477 


M450303 


R23324 


Hs.41693 


106503 


M452411 


AB033042 


Hs.29679 


446999 


AA454566 


AA151520 




106543 


AA454667 


AA676939 


Hs.69285 


442007 


M456437 


AA301116 


Hs.142838 


106589 


AA456646 


AK000933 


Hs.28661 


106593 


AA456826 


AW296451 Hs.24605 


106596 


AA456981 


AA452379 




423064 


AA458959 


AF265203 


Hs.8740 


106636 


AA459950 


AW958037 Hs.286 


106654 


AA460449 


AW075485 Hs.286049 


131353 


AA463910 


AW754182 




106707 


AA464603 


AK000566 


Hs.98135 


452909 


AA464606 


NM_015368Hs.309B5 


106717 


M465093 


AA600357 


Hs.239489 


453141 


AA465692 


AB014548 


Hs.31921 


106747 


M476473 


NMJ»7118Hs.171957 



ESTs, Weakly similar to N-WASP [H.sapien 
B-cell CLL/lymphoma 6, member B (zinc fi 
ESTs 

ESTs, Weakly similar to ALU1_HUMAN ALU S 

frizzled (DrosophOa) homolog 6 

ESTs, Weakly similar to KIAA0638 protein 

ESTs 

Homo sapiens cONA: FU22528 fis, done H 
KIAA1488 protein 

Homo sapiens cDNA FU12082 fis, clone HE 
Kruppel-like factor 7 (ubiquitous) 
KIAA1497 protein 
S164 protein 

transforming, acidic coiled -coil contain 
eukaryotic translation initiation factor 
cytoskeleton associated protein 2 
hypothetical protein FU1 1 151 
ESTs 

myotubularin related protein 1 
Rag C protein 

hypothetical protein FU10849 
hypothetical protein FLJ23182 
hypothetical protein FLJ20039 
budding uninhibited by benzimidazoles 1 
diptheria toxin resistance protein requi 
Homo sapiens cDNA FLJ14178 fis, clone NT 
Homo sapiens mRNA; cDNA DKFZp564H1916 (f 
KIAA1451 protein 
KIAA1033 protein 
nucleoporin 50kD 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

KIAA0603 gene product 

hypothetical protein FLJ11323 

ESTs, Weakly similar to T17210 hypotheti 

histone deacetyiase 3 

DKFZP586L0724 protein 

ESTs 

ESTs 

sec13-like protein 

sec13-like protein 

hypothetical protein FLJ10120 

gb:zv15b10.s1 Soares_NhHMPu_S1 Homosapi 

SRY (sex determining region Y)-box 18 

Homo sapiens cDNA FLJ20738 fis, clone HE 

Homo sapiens clone 23767 and 23782 mRNA 

DKFZP434I1 16 protein 

phosducin-like 

similar to S. cerevisiae SSM4 
KIAA1323 protein 

Homo sapiens cDNA FLJ10643 fis, clone NT 
KIAA0523 protein 

ESTs, Weakly similar to ALU1JHUMAN ALU S 

hypothetical protein MGC3178 

hypothetical protein FLJ10210 

hypothetical protein FLJ23221 

KIAA0766 gene product 

KIAA0470 gene product 

Rho guanine exchange factor (GEF) 15 

hypothetical protein MGC53G6 

DnaJ (Hsp40) homolog, subfamily B, membe 

cofactor required for Sp1 transcriptiona 

hypothetical protein MGC4485 

neuropilin 1 

nucleolar phosphoprotein Nopp34 

Homo sapiens cDNA aJ10071 fis, clone HE 

ESTs 

ESTs, Moderately similar to ALU7JHUMAN A 
SWI/SNF related, matrix associated, acti 
ribosomal protein L4 
phosphoserine aminotransferase 
gb:RC2-CT0321-131 199-01 1-c01 CT0321 Homo 
hypothetical protein FLJ20559 
pannexin 1 

TIA1 cytotoxic granule-associated RNA-bi 
KIAA0648 protein 

triple functional domain (PTPRF interact 
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106773 


AA478109 


AA478109 


Hs.188833 


106781 


AA478474 


AA330310 


Hs.24181 


106817 


AA480889 


D61216 


Hs.18672 


106846 


AA485223 


AB037744 


Hs.34892 


106848 


AA485254 


AA449014 


Hs.121025 


106856 


AA486183 


W58353 


Hs.285123 


418699 


AA496936 


BE539639 


Hs.173030 


107001 


AA598589 


AI926520 


Hs.31016 


442853 


AA598831 


AW021276 


Hs.17121 


107054 


AA600150 


AI076459 


Hs.15978 


107059 


AA60B545 


BE614410 


Hs.23044 


107080 


AA609210 


AL122043 


Hs.19221 


107115 


AA610108 


BE379623 


Hs.27693 


107130 


AA620582 


AB033105 


Hs.12913 


107156 


AA621239 


AA137043 


Hs.9663 


107174 


AA621714 


BE122762 


Hs.25338 


130621 


AA621718 


AW513087 


Hs.16803 


107190 


D19673 


AA836401 


Hs.87860 


132626 


D25755 


AW504732 


Hs,21275 


107217 


D51095 


AL080235 


Hs.35861 


332584 


D60272 


AA357879 


Hs.29423 


444655 


T08879 


AF0B8886 


Hs.11590 


107295 


T34527 


AA1 86629 


Hs.80120 


107299 


T40327 


BE277457 


Hs.30661 


107315 


T62771 


AA316241 


Hs.90691 


107316 


T63174 


T63174 


Hs.193700 


107328 


T83444 


AW959891 


Hs.76591 


107334 


T93641 


T93597 


Hs.187429 


456340 


U48263 


U48263 


Hs.89040 


128636 


U49065 


U49065 


Hs.102865 


129938 


U79300 


AW003668 


Hs.135587 


107375 


U88573 


BE011845 


Hs.251064 


130074 


U93867 


AL038596 


Hs.250745 


107387 


W01094 


D86983 


Hs.1 18893 


132036 


W01568 


AL1 57433 


Hs.37706 


107426 


W26853 


W26853 


Hs.291003 


135388 


W27965 


W27965 


Hs.99865 


130419 


W36280 


AF037448 


Hs.155489 


107469 


W47063 


W47063 


Hs.94668 


434203 


W79060 


BE262677 


Hs.283558 


107506 


W38550 


AB028981 


Hs.8021 


132358 


X60486 


NM_003542Hs.46423 


107522 


X78931 


X78931 


Hs.99971 


456495 


Z14077 


NMJ)03403Hs.97496 


107582 


AA002147 


AA002147 


Hs.59952 


107609 


AA004711 


R75654 


Hs. 164797 


107661 


AA010383 


AAO10383 


Hs.60389 


107714 


AA015761 


AA015761 


Hs.60642 


107775 


AA018772 


AW008846 


Hs.60857 


107832 


AA021473 


AA021473 




107859 


M024835 


AW732573 


Hs.47584 


107914 


AA027229 


AA027229 


Hs.61329 


107935 


AA029428 


AA029428 


Hs.61555 


410196 


AA035143 


AI936442 


Hs.59838 


131461 


AA035237 


AA992841 


Hs,27263 


108007 


AA039347 


AA039347 


Hs.61916 


108029 


AA040740 


AA040740 


Hs.62007 


108040 


AA041551 


AL1 21031 


Hs.159971 


108084 


AA045513 


AA058944 


Hs.116602 


108088 


AA045745 


AA045745 


Hs.62886 


108168 


AA055348 


AI453137 


Hs.63176 


130719 


AA056582 


AA679262 


Hs.14235 


108189 


M056697 


AW376061 


Hs.63335 


108190 


AA056746 


AA056746 


Hs.63338 


108203 


AA057678 


AW847814 


Hs.289005 


108216 


AA058681 


AA524743 


Hs.44883 


108217 


AA058686 


AA058686 


Hs.62588 


108245 


AA062840 


BE410285 


Hs.89545 


108277 


AA064859 


AA064859 




108280 


AA065069 


AA065069 




108309 


AA069923 


AA069818 




108340 


AA070815 


AA069820 


Hs.180909 


108403 


AA075374 


AA075374 




108427 


AA076382 


AA076382 




108435 


AA078787 


T82427 


Hs.194101 


108439 


AA078986 


AA078986 





ESTs 
ESTs 
ESTs 

K1AA1323 protein 

chromosome 1 1 open reading frame 5 
Homo sapiens mRNA full length insert cDN 
ESTs, Weakly similar to ALU8_HUMAN ALU S 
putative DNA binding protein 
ESTs 

KIAA1272 protein 

RAD51 (S. cerevisiae) homolog (E coli Re 
hypothetical protein DKFZp566G1424 
peptidylprolyl isomerase (cyclophilinH 
KIAA1280 protein 

programmed cell death 6-interacting prot 
ESTs 

LUC7 (S. cerevisiae)-like 
ESTs 

hypothetical protein FLJ1101 1 
DKFZP586E1621 protein 
ESTs; Weakly similar to macrophage lecti 
cathepsin F 

UDP-N-acetyl-aipha-D-gaIacto$amine:polyp 

hypothetical protein MGC4606 

nucleophosmin/nucleoplasmin 3 

Homo sapiens mRNA; cDNA DKFZp586l0324 (f 

KIAA0887 protein 

ESTs 

prepronociceptin 
interleukin 1 receptor-like 2 
Human clone 23629 mRNA sequence 
high-mobility group (nonhistone chromoso 
polymerase (RNA) III (DNA directed) (62k 
Melanoma associated gene 
hypothetical protein DKFZp434E2220 
hypothetical protein MGC4707 
epimorphin 

NS1 -associated protein 1 
ESTs 

hypothetical protein PR01855 

KIAA1058 protein 

H4 histone family, member G 

zinc finger protein 272 

YY1 transcription factor 

EST 

hypothetical protein FLJ13693 

ESTs 

ESTs 

ESTs 

gb:ze66c11.s1 Soares retina N2b4HR Homo 
potassium voltage-gated channel, delayed 
ESTs, Weakly similar to T1 6370 hypotheti 
ESTs 

hypothetical protein FLJ10808 

KIAA1458 protein 

EST 

ESTs 

SWI/SNF related, matrix associated, acti 
Homo sapiens, clone IMAGE:41 54008, mRNA, 
ESTs 
ESTs 

hypothetical protein FU20008; KIAA1839 
ESTs, Moderately similar to A46010 X-!in 
EST 

Homo sapiens cDNA: FLJ21532 fis, clone C 

ESTs 

ESTs 

proteasome (prosome, macropain) subunit, 
gb:zm50f03.s1 Stratagene fibroblast (937 
gb;zm12e11.s1 Stratagene pancreas (93720 
gb:zm67e03.r1 Stratagene neuroepithelium 
peroxiredoxin 1 

gb:zm87a01.s1 Stratagene ovarian cancer 
gb:zm91g08.s1 Stratagene ovarian cancer 
Homo sapiens cDNA: FLJ20869 fis, clone A 
gb:zm92h01.s1 Stratagene ovarian cancer 



133 



WO 02/079492 



108465 


AA079393 


AA079393 


Hs.3462 


108469 


AA079487 


AA079487 




108500 


AA083207 


AA083207 


Hs.68270 


108501 


AA083256 


AA083256 




108533 AA084415 


AA084415 




108562 AA085274 


AA1 00796 




108589 AA088678 


AI732404 


Hs.68846 


130890 M100925 


AI907537 


Hs.76698 


432645 AA101255 


D14041 


Hs.347340 


130385 AA126474 


AW067800 Hs.155223 


108749 


AA127017 


AA127017 


Hs.71052 


108807 


AA1 29968 


AI652236 


Hs.49376 


108808 


AA130240 


AA045088 


Hs.62738 


108833 


AA131866 


AF1 88527 


Hs.61661 


108846 


AA1 32983 


AL1 17452 


Hs.44155 


108857 


AA1 33250 


AK001468 


Hs.62180 


131474 


AA133583 


L46353 


Hs.2726 


108894 


AA135941 


AK001431 


Hs.5105 


108941 


AA148650 


AA148650 




108968 


AA151110 


AI304870 


Hs.188680 


108996 


AA1 55754 


AW995610 Hs.332436 


109001 


AA156125 


AI056548 


Hs.72116 


131183 


AA156289 


AI611807 


Hs.285107 


109019 M156997 


AA156755 


Hs.72150 


109022 AA157291 


AA157291 


Hs.21479 


109023 AA157293 


AA1 57293 


Hs.72168 


109068 AA164293 


AA164293 Hs72545 


109072 


AA1 64676 


AI732585 


Hs.22394 


426981 


AA1 67375 


AL044675 


Hs.173081 


130346 


AA1 67550 


H05769 


Hs.188757 


109146 


AA176589 


AA176589 


Hs.142078 


109172 


AA180448 


AA180448 


Hs.144300 


428438 


AA187144 


NM_001955Hs.2271 


129208 


AA189170 


AI587376 


Hs.109441 


109222 


AA192757 


AA1 92833 


Hs.333512 


109300 


AA205650 


AA418276 


Hs.170142 


109481 


AA233342 


AA878923 


Hs.289069 


109485 


AA233472 


BE619092 


Hs.28465 


109516 


AA234110 


AI471639 


Hs.71913 


109537 


D80981 


AI858695 


Hs.34898 


109556 


F01660 


AI925294 


Hs.87385 


109577 


F02206 


F02206 


Hs.296639 


109578 


F02208 


F02208 


Hs.27214 


109595 


F02544 


AA078629 


Hs.27301 


109625 


F03918 


H29490 


Hs.22697 


428376 


F04258 


AF119665 


Hs.184011 


109648 


F04600 


H17800 


Hs.7154 


109671 


F08998 


R59210 


Hs.26634 


109699 


F09605 


H18013 


Hs.167483 


109820 


F11115 


AW016809 


Hs.1 19021 


109933 


H06371 


R52417 


Hs.20945 


110014 


H10995 


AL109666 


Hs.7242 


110039 


H11938 


H11938 


Hs.21907 


110099 


H16568 


R44557 


Hs.23748 


110107 


H16772 


AW151660 


Hs.31444 


110155 


H18951 


AI559626 


Hs.93522 


110197 


H20659 


AW090386 


Hs.1 12278 


110223 


H23747 


H19836 


Hs.31697 


1 10306 


H38087 


H38087 


Hs.105509 


110335 


H40331 


H65490 


Hs.18845 


110342 


H40567 


H40961 


Hs.33008 


110395 


H46966 


AA025116 


Hs.33333 


110511 


H56640 


H56640 


Hs.221460 


110523 


H57154 


AI040384 


Hs.19102 


110715 


H96712 


H96712 


Hs.269029 


110754 


N20814 


AW302200 


Hs.6336 


428454 


N25249 


U55936 


Hs.184376 


431663 


N27100 


NM_016569Hs.267182 


134263 


N39616 


AW973443 Hs.8086 


110938 N48982 


N48982 


Hs.38034 


110983 


N51957 


NM_015367Hs.1O267 


111081 


N59435 


AI146349 


Hs.271614 


111128 


N64139 


AW505364 Hs.19074 


431548 


N66981 


AI834273 


Hs.9711 


111216 


N68640 


AW139408 Hs.152940 


437562 


N69352 


AB001636 


Hs.5683 



cytochrome c oxidase subunit Vile 
gb:zm97f08.s1 Stratagene colon HT29 (937 
EST 

gb:zn08g12.s1 Stratagene hNT neuron (937 
gb:zn06g09.s1 Stratagene hNT neuron (937 
gb:zm26c06.s1 Stratagene pancreas (93720 
ESTs 

stress-associated endoplasmic reticulum 
H-2K binding factor-2 
stanniocalcln 2 
ESTs 

hypothetical protein FLJ20644 
ESTs 

ESTs, Weakly similar to AF174605 1 F-box 

DKFZP586G1517 protein 

anillin (Drosophila Scraps homolog), act 

high-mobility group (nonhistone chromoso 

hypothetical protein FU10569 

gb:zo09e06.s1 Stratagene neuroepHhelium 

ESTs 

EST 

hypothetical protein FLJ20992 similar to 
hypothetical protein FLJ13397 
ESTs 

ubinuclein 1 

ESTs 

ESTs 

hypothetical protein FLJ10893 
KIAA0530 protein 

Homo sapiens, clone MGC:5564, mRNA, comp 

EST 

EST 

endothelin 1 
MSTP033 protein 
similar to rat myomegalin 
ESTs 

hypothetical protein FLJ21016 

Homo sapiens cDNA: FU21869 fis, clone H 

ESTs 

ESTs 

ESTs 

Homo sapiens potassium channel subunit ( 

ESTs 

ESTs 

ESTs 

pyrophosphatase (inorganic) 

ESTs 

ESTs 

ESTs 

ESTs 

Homo sapiens clone 24993 mRNA sequence 

Homo sapiens mRNA full length insert cDN 

histone acetyifransferase 

ESTs 

ESTs 

Homo sapiens mRNA for KIAA1647 protein, 

arrestin, beta 1 

ESTs 

CTL2gene 

ESTs 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to organic anion tr 
ESTs 

KIAA0672gene product 
synaptosomal-assoctated protein, 23kD 
TBX3-iso protein 

RNA (guanine-7-) methyltransferase 
Homo sapiens cDNA FLJ12924 fis, clone NT 
MIL1 protein 
CGI-1 12 protein 

LATS (large tumor suppressor, Drosophila 

novel protein 

ESTs 

DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
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111399 


R00138 


AW270776 Hs. 18857 


111514 


R07998 


R07998 




428744 


R08929 


BE267033 


Hs.192853 


111574 


R10307 


AI024145 


Hs.188526 


111804 


R33354 


AA482478 


Hs. 181785 


111831 


R36083 


R36095 


Hs.268695 


426773 


R37938 


NMJ)15556Hs.172180 


111904 


R39330 


Z41572 




428371 


R40816 


AB012193 


Hs.183874 


112033 


R43162 


R49031 


Hs.22627 


130987 


R45698 


BE613269 


Hs.21893 


1 12300 


R54554 


H24334 


Hs.26125 


112513 


R68425 


R68425 


Hs.13809 


112514 


R68568 


R68568 


Hs.183373 


112522 


R68763 


R68857 


Hs.265499 


112540 


R70467 


R69751 




428655 


R73565 


H05769 


Hs.188757 


129534 


R73640 


AK002126 


Hs.11260 


112597 


R78376 


R78376 


Hs.29733 


112732 


R92453 


R92453 


Hs.34590 


451798 


T03865 


BE297567 


Hs.27047 


1 12888 


T03872 


AW195317 Hs.107716 


131863 


T10072 


AI656378 


Hs.33461 


112911 


T10080 


AW732747 


Hs.13493 


132215 


T10132 


AL035703 


Hs.4236 


112931 


T15343 


T02966 


Hs.167428 


112984 


T23457 


T16971 


Hs.289014 


112998 


T23555 


H11257 


Hs.22968 


133376 


T23670 


BE618768 


Hs.7232 


113026 


T23948 


AA376654 




113070 


T33464 


AB032977 


Hs.6298 


410781 


T34413 


AI375672 


Hs.165028 


113074 


T34611 


AK001335 


Hs.31137 


113095 


T40920 


AA828380 


Hs.126733 


113179 


T55182 


BE622021 


Hs.152571 


113337 


T77453 


T77453 


Hs.302234 


113421 


T84039 


AI769400 


Hs.189729 


113454 


T86458 


AI022166 


Hs.16188 


113481 


T87693 


T87693 


Hs.204327 


453345 


T89350 


AA302862 


Hs.90063 


113557 


T90945 


H66470 


Hs.16004 


113559 


T90987 


T79763 


Hs.14514 


113589 


T91863 


AI078554 


Hs.15682 


113591 


T91881 


T91881 


Hs.200597 


113619 


T93783 


R08665 


Hs.17244 


113683 


T96687 


AB035335 


Hs.144519 


113692 


T96944 


AL360143 


Hs.17936 


113702 


T97307 


T97307 




113717 


T97764 


T99513 


Hs.187447 


113824 


W48817 


AI631964 


Hs.34447 


113840 


W58343 


R72137 


Hs.7949 


113844 


W59949 


AI369275 


Hs.243010 


113902 


W74644 


AA340111 


Hs.100009 


113904 


W74761 


AF1 25044 


Hs.19196 


113905 


W74802 


R81733 


Hs.33106 


113931 


W81205 


BE255499 


Hs.3495 


113932 


W81237 


AA256444 


Hs.126485 


131965 


W90146 


W79283 


Hs.35962 


114035 


W92798 


W92798 


Hs.269181 


114106 


Z38412 


AW602528 




457308 


Z38709 


AI416988 


Hs.238272 


114161 


Z38904 


BE548222 


Hs.299883 


424949 


Z39103 


AF052212 


Hs.153934 


457548 


Z39930 


AW069534 Hs.279583 


128937 


Z39939 


AA251380 


Hs.10726 


432554 


Z40012 


AI479813 


Hs.278411 


114277 


Z40377 


AI052229 


Hs.25373 


114304 


Z40820 


AI934204 


Hs.16129 


114364 


Z41680 


AL1 17427 


Hs.172778 


432620 


AA005112 


AA777749 


Hs.5978 


129034 


AA005432 


AA481 157 


Hs.108110 


131881 


AA010163 


AW361018 Hs.3383 


332421 


AA026356 


AI909968 


Hs.108106 


114465 


AA026901 


BE621056 


Hs.131731 


451271 


AA036867 


AK001644 


Hs.26156 


332498 


AA044644 


AA303661 





ESTs 

gb:yf16g11.s1 Soares fetal liver spleen 

ubiquitin-conjugating enzyme E2G 2 (homo 

ESTs 

ESTs 

ESTs 

KIAA0440 protein 

gb:HSCZYB122 normalized infant brain cDN 

cullin 4A 

ESTs 

hypothetical protein DKFZp761N0624 
ESTs 

hypothetical protein FLJ10648 

src homology 3 domain-containing protein 

ESTs 

gb:yi40a10.s1 Soares placenta Nb2HP Homo 

Homo sapiens, clone MGC:5564, mRNA, comp 

hypothetical protein FLJ1 1264 

EST 

ESTs 

hypothetical protein FLJ20392 
hypothetical protein FLJ22344 
ESTs 

like mouse brain protein E46 
K1AA0478 gene product 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
Homo sapiens clone IMAGE:451939, mRNA se 
acetyl-Coenzyme A carboxylase alpha 
eukaryotjc translation initiation factor 
KIAA1151 protein 
ESTs 

protein tyrosine phosphatase, receptor t 
ESTs 

ESTs, Highly similar to IGF-II mRNA-bind 

ESTs 

ESTs 

ESTs 

EST 

neurocalcin delta 

ESTs 

ESTs 

ESTs 

KIAA0563 gene product 
hypothetical protein FLJ 13605 
T-cell leukemia/lymphoma 6 
DKFZP434H132 protein 
gb:ye53h05.s1 Soares fetal liver spleen 
ESTs 
ESTs 

DKFZP586B2420 protein 

Homo sapiens cDNA FLJ14445 fis, clone HE 

acyl-Coenzyme A oxidase 1, palmitoyl 

ubiquitin-conjugating enzyme HBUCE1 

ESTs 

hypothetical protein MGC15749 
hypothetical protein FLJ12604; KIAA1692 
ESTs 
ESTs 

gb:RC5-BT0562-260100-011-A02 BT0562 Homo 
inositol 1,4,5-triphosphate receptor, ty 
hypothetical protein FLJ23399 
core-binding factor, runt domain, alpha 
CGI-81 protein 

ESTs, Weakly similar to ALU INHUMAN ALU S 

NCK-associated protein 1 

ESTs, Weakly similar to T2041 0 hypotheti 

ESTs 

Homo sapiens mRNA; cDNA DKFZp566P013 (fr 
LIM domain only 7 
DKFZP547E2110 protein 
upstream regulatory element binding prot 
transcription factor 
hypothetical protein FLJ11099 
hypothetical protein FLJ10782 
lymphocyte-specific protein 1 
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431555 AA046426 
132944 AA054515 
114618 AA084162 
332509 AA085749 
114648 AA101056 
114658 AA102746 
132456 AA1 14250 
450847 AA126561 
132225 AA128980 
437197 AA129757 
114709 AA129921 
456926 AA133331 
114750 AA135958 
426806 AA136524 
114763 AA147044 
114767 AA148885 
114774 AA150043 
129388 AA151621 
457742 AA155743 
456200 AA156335 
130207 AA1 56336 
114798 AA159181 
114800 AA159825 
114828 AA234185 
114846 AA234929 
114848 AA234935 
114902 AA236359 
132271 AA236466 
114907 AA236535 
420170 AA236935 
132204 AA236942 
114928 AA237018 
132481 AA237025 
114932 AA242751 
314162 AA242760 
131006 AA242763 
114935 AA242809 
408908 AA243133 
437754 AA243495 
114957 AA243706 
114974 AA250848 
114977 AA250868 
114995 AA251152 
115005 AA251544 
417177 AA251792 
115026 AA252144 
115045 AA252524 
115068 AA253461 
133138 AA255522 
332668 AA255522 
115114 AA256468 
129584 AA256528 
115137 AA257976 
417187 AA258296 

115166 AA258409 

115167 AA258421 
436719 AA262077 
115239 AA278650 
115243 AA278766 
428419 AA280791 
115322 AA280819 
413303 AA280828 
115372 AA282195 
409962 AA283127 
130269 AA284694 
456570 AA291137 
332675 AA291708 
407864 AA293495 
115536 AA347193 
408799 AA398474 
115575 AA398512 
115601 AA400277 
434428 AA400896 
115683 AA410345 
115715 AA416733 
132952 AA425154 



AI815470 Hs.260024 
T96641 Hs.6127 
AW979261 Hs.291993 
AA128376 Hs.153884 
AA101056 

AA1 02383 Hs.249190 
AB011084 Hs.48924 
NM_003155Hs.25590 
AA1 28980 
W38586 

AA397651 Hs.301959 
AB018284 Hs.158688 
AA887211 Hs.129467 
T19228 Hs.172572 
AA810755 Hs.102500 
AJ859865 Hs. 154443 
AV656017 Hs.184325 
AA662477 Hs.1 10964 
BE561824 Hs.273369 
AA768242 Hs.80618 
AF044209 Hs.144904 
AA159181 Hs.54900 
Z19448 Hs.131887 
AA252937 Hs.283522 
BE018682 Hs.166196 
BE614347 Hs.169615 
AW275480 Hs.39504 
AB030034 Hs.1 15175 
N29390 Hs.13804 
U43374 Hs.95631 
AA235827 Hs.42265 
AA237018 Hs.94869 
W93378 Hs.49614 
AA971436 Hs.16218 
BE041820 Hs.38516 
AF064104 Hs.22116 
H23329 Hs.290880 
BE296227 Hs.250822 
R60366 Hs.5822 
AW170425 Hs.87680 
AW966931 Hs.302649 
AW296978 Hs.d7787 
AA769266 Hs.193657 
AI760825 Hs.153042 
NM_004458Hs.81452 
AA251972 Hs.188718 
AW014549 Hs.58373 
AW512260 Hs.87767 
AV657594 Hs.181161 
AV657594 Hs.181161 
AA527548 Hs.7527 
AV656017 Hs.184325 
AW968304 Hs.56155 
AB011151 Hs.334659 
AF095727 Hs.287832 
AA749209 Hs.43728 
Y11192 Hs.5299 
BE251328 Hs.73291 
AA806600 Hs.1 16665 
U49436 

L08895 Hs.78995 
AW836130 Hs.75277 
AW014385 Hs.88678 
U82671 Hs.57698 
F05422 Hs.1 68352 
AA286914 Hs.183299 
BE439944 

AF069291 Hs.40539 
AK001468 Hs.62180 
AA059412 Hs.47986 
AA393254 Hs.43619 
AA1 48984 Hs.48849 
D14540 Hs.199160 
AF255910 Hs.54650 
8E395161 Hs.1390 
AI658580 Hs.61426 



Cdc42 effector protein 3 

Homo sapiens cDNA: FLJ23020 fis, clone L 

ESTs 

ATP binding protein associated with ceil 
gb:zn25b03.s1 Stratagene neuroepitheiium 
tumor necrosis factor receptor superfami 
KIAA0512 gene product; ALEX2 
stanniocaicin 1 

gb:zo09a11.s1 Stratagene neuroepitheiium 
guanine nucleotide binding protein (G pr 
proline synthetase co-transcribed (bacte 
KIAA0741 gene product 
ESTs 

hypothetical protein FLJ20093 
hypothetical protein dJ511E16.2 
minichromosome maintenance deficient (S. 
CGI-76 protein 

hypothetical protein FU23471 

uncharacterized hematopoietic stem/proge 

hypothetical protein 

nuclear receptor co-repressor 1 

serologically defined colon cancer antig 

ESTs, Weakly similar to T24396 hypotheti 

Homo sapiens mRNA; cDNA DKFZp434J1912 (f 

ATPase, Class I, type 8B, member 1 

hypothetical protein FU20989 

hypothetical protein MGC4308 

sterile-alpha motif and leucine zipper c 

hypothetical protein dJ462023.2 

Human norma! keratinocyte mRNA 

ESTs 

ESTs 

ESTs 

KIAA0903 protein 

Homo sapiens, clone MGC: 15887, mRNA, com 

CDC14 (cell division cycle 14, S. cerevi 

ESTs, Weakly similar to ALU INHUMAN ALU S 

serine/threonine kinase 15 

Homo sapiens cDNA: FLJ22120 fis, clone H 

ESTs 

nucleosome assembly protein 1-like 1 

ESTs 

ESTs 

ESTs 

fatty-acid-Coenzyme A ligase, long-chain 

ESTs 

ESTs 

ESTs 

Homo sapiens cDNA FLJ14643 fis, clone NT 
ESTs 

small fragment nuclease 
CGI-76 protein 
ESTs 

hypothetical protein MGC14139 
myelin protein zero-like 1 
hypothetical protein 

aldehyde dehydrogenase 5 family, member 
hypothetical protein FLJ 10881 
KIAA1842 protein 
KIAA1856 protein 

MADS box transcription enhancer factor 2 
hypothetical protein FLJ13910 
ESTs, Weakly similar to Unknown [Ksapie 
Target CAT 

nucleoporin-like protein 1 

ESTs 

ESTs 

chromosome 8 open reading frame 1 
anillin (Drosophila Scraps homolog), act 
hypothetical protein MGC10940 
ESTs 

ESTs, Weakly similar to ALU4_HUMAN ALU S 
myeloid/lymphoid or mixed-lineage leukem 
junctional adhesion molecule 2 
proteasome (prosome, macropain) subunit 
Homo sapiens mesenchymal stem cell prate 
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115819 


AA426573 


AA466620 


Hs.41135 


409124 


AA431418 


AW292809 Hs.50727 


115895 


AA436182 


AB033035 


Hs.51965 


458073 


AA437099 


AA192669 


Hs.45032 


115962 


AA446585 


AI636361 


Hs.179520 


115967 


AA446887 


AI745379 


Hs.42911 


115974 


AA447224 


BE513442 


Hs.238944 


115985 


AA447709 


M447709 


Hs.268115 


129254 


AA453624 


AA252468 


Hs.1098 


446730 


AA455044 


BE384932 


Hs.64313 


116095 


AA456045 


AA043429 


Hs.62618 


426856 


AA460454 


R19768 


Hs.172788 


116210 


AA476494 


BE622792 


Hs.1 72788 


116213 


AA476738 


M292105 


Hs.326740 


432645 


AA481422 


D14041 


Hs.347340 


116265 


AA482595 


BE297412 


Hs.55189 


129334 


AA485084 


AW157022 Hs.343551 


116274 


AA485431 


AI129767 


Hs.182874 


426002 


AA489638 


BE514376 


Hs.165998 


116331 


AA491000 


N41300 


Hs.71616 


116333 


AA491250 


AF1 55827 


Hs.203953 


132994 


AA505133 


AA1 12748 


Hs.279905 


418538 


AA598447 


BE244323 


Hs.85951 


116391 


AA599243 


T86558 


Hs.75113 


116394 


AA599574 


NMJH)6033Hs.65370 


134531 


AA600153 


AI742845 


Hs.1 10713 


116417 


AA609309 


AW499664 




116429 


AA609710 


AF191018 


Hs.279923 


116439 


AA610068 


AA251594 


Hs.43913 


116459 


AA621399 


R80137 


Hs.302738 


427505 


AA621752 


AA361562 


Hs.1 78761 


409633 


C21523 


AW449822 Hs.55200 


116541 


D12160 


D12160 


Hs.249212 


132557 


D19706 


AA1 14926 


Hs.1 69531 


414964 


D25801 


AA337548 


Hs.333402 


116571 


D45652 


D45652 


Hs.211604 


451522 


D60208 


BE565817 


Hs.26498 


421919 


D80504 


AJ224901 


Hs.109526 


116643 


F03010 


A1367044 


Hs.1 53638 


116661 


F04247 


R61504 




116715 


F10966 


AL1 17440 


Hs.170263 


116729 


F13700 


BE549407 


Hs.1 15823 


318709 


H05063 


R52576 


Hs.285280 


418999 


H16758 


NM_000121Hs.89548 


116773 


H17315 


AI823410 


Hs.343581 


116780 


H22566 


H22566 


Hs.63931 


453884 


H48459 


AA355925 


Hs.36232 


116819 


H53073 


H53073 


Hs.93698 


427278 


H56559 


AL031428 


Hs.1 74174 


407833 


H57957 


AW955632 Hs.66666 


116844 


H64938 


H64938 


Hs.337434 


116845 


H64973 


AA649530 


Hs.348148 


116892 


H69535 


AI573283 


Hs.38458 


116925 


H73110 


H73110 


Hs.260603 


116981 


H81783 


N29218 


Hs.40290 


453133 


H86259 


AC005757 


Hs.31809 


117031 


H88353 


H88353 


Hs.347265 


117034 


H88639 


U72209 




431129 


H88675 


^AL137751 


Hs.263671 


417861 


H93708 


AA334551 




117280 


N22107 


M18217 


Hs.172129 


117344 


N24046 


R19085 


Hs.210706 


117422 


N27028 


AI355562 


Hs.43880 


117475 


N30205 


N30205 


Hs.93740 


117487 


N30621 


N30621 


Hs.44203 


117937 


N33258 


AF044209 


Hs.144904 


130207 


N33258 


AF044209 


Hs.144904 


117549 


N33390 


N33390 


Hs.44483 


117683 


N40180 


N40180 




117710 


N45198 


N45198 


Hs.47248 


117791 


N48325 


N48325 


Hs.93956 


117822 


N48913 


AA706282 


Hs.93963 


422544 


N49394 


AB018259 


Hs.118140 


1 17895 


N50656 


AW450348 Hs.93996 


452259 


N50721 


AA317439 


Hs.28707 


133057 


N53143 


AA465131 


Hs.64001 



endomucin-2 

N-acetylglucosaminidase, alpha- {Sanfili 

KIAA1209 protein 

ESTs 

hypothetical protein MGC10702 
ESTs 

hypothetical protein FLJ10631 

ESTs, Weakly similar to T08599 probable 

DKFZp434J1813 protein 

ESTs, Weakly similar to AF257182 1 G-pro 

ESTs 

ALEX3 protein 
ALEX3 protein 

hypothetical protein MGC10947 

H-2K binding factor-2 

hypothetical protein 

hypothetical protein FU22584 

guanine nucleotide binding protein (G pr 

PAI-1 mRNA-binding protein 

Homo sapiens mRNA; cDNA DKFZp586N1720 (f 

hypothetical protein FU10339 

clone HQ0310PRO0310p1 

exportin, tRNA (nuclear export receptor 

general transcription factor I1IA 

lipase, endothelial 

DEK oncogene (DNA binding) 

Human clone 23826 mRNA sequence 

putative nucleotide binding protein, est 

PIBF1 gene product 

Homo sapiens cDNA: FU21425 fis, done C 
26S proteasome-associated pad1 homotog 
ESTs 

polymerase (RNA) 111 (DNA directed) (155 
ESTs 

hypothetical protein MGC12760 
gb:HUMGS02848 Human adult lung 3' direct 
hypothetical protein FLJ21657 
zinc finger protein 198 
myeloid/lymphoid or mixed-lineage leukem 
gb:yh16a03.s1 Soares infant brain 1NIB H 
tumor protein p53-binding protein, 1 
ribonuclease P, 40kD subunit 
Homo sapiens cDNA: FLJ22096 fis, clone H 
erythropoietin receptor 
karyopherin alpha 1 (importin alpha 5) 
ESTs 

KIAA0186 gene product 
EST 

KIAA0601 protein 

ESTs, Weakly similar to S19560 proline-r 
ESTs, Weakly similar to A46010 X-linked 
gb:ns44f05.s1 NCI_CGAPJMv1 Homo sapiens 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
ESTs 

hypothetical protein 

gb:yw21a02.s1 Morton Fetal Cochlea Homo 

YY1 -associated factor 2 

Homo sapiens mRNA; cDNA DKFZp434l0812 (f 

sperm specific antigen 2 

Homo sapiens cDNA: FLJ21409 fis, clone C 

Homo sapiens cDNA FU 13182 fis, clone NT 

ESTs, Weakly similar to A46010 X-linked 

ESTs, Weakly similar to I38022 hypotheti 

ESTs 

nuclear receptor co-repressor 1 
nuclear receptor co-repressor 1 
EST 

gb;yy44d02.s1 Soares_mulfiple_sc[erosis_ 
ESTs, Highly similar to similar to Cdc14 
EST 
ESTs 

KIAA0716 gene product 
ESTs. Highly similar to SORL.HUMAN SORTI 
signal sequence receptor, gamma (translo 
Homo sapiens clone 25218 mRNA sequence 
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118103 


N55326 


AA401733 


Hs.184134 


ESTs 


118111 


N55493 


N55493 




gb:yv50c02.s1 Soares fetal liver spleen 


118129 


N57493 


N57493 




gb:yy54cQ8.s1 Soares_mu ltipIe_sderosis_ 


118278 


N62955 


N62955 


Hs.316433 


Homo sapiens cDNA FLJ11375 fis, clone HE 


118329 


N63520 


N63520 




gb:yy62f01 ,s1 Soares_multiple_sclerosis_ 


118336 


N63604 


BE327311 


Hs.47166 


HT021 


417098 


N64166 


AB017365 


Hs.173859 


frizzled (Drosophila) homolog 7 


118363 


N64168 


AI183838 


Hs.48938 


hypothetical protein FLJ21802 


118364 


N64191 


N46114 


Hs.29169 


hypothetical protein FU22623 


118475 


N66845 


N66845 




gb:za46c11.s1 Soares fetal liver spleen 


118491 


N67135 


AV647908 


Hs.90424 


Homo sapiens cDNA: FLJ23285 fis, clone H 


118500 


N67295 


W32889 


Hs.154329 


ESTs 


118584 


N68963 


AW136928 




gb:UI-H-BI1-adp-d-O&0-Ul.s1 NCI_CGAP_Su 


456647 


N69331 


AI252640 


Hs.1 10364 


peptidylprolyl isomerase C (cyclophilin 


118661 


N70777 


AL137554 


Hs.49927 


protein kinase NYD-SP15 


118684 


N71364 


N71313 


Hs.163986 


Homo sapiens cDNA: FLJ22765 fis, clone K 


118689 


N71545 


AW390601 


Hs.184544 


Homo sapiens, clone IMAGE:3355383, mRNA, 


118690 


N71571 


N71571 


Hs.269142 


ESTs 


118766 


N74456 


N74456 


Hs.50499 


EST 


118793 


N75594 


N75594 


Hs.285921 


ESTs, Moderately similar to T47135 hypot 


118817 


N79035 


AI668658 


Hs.50797 


ESTs 


118844 


N80279 


AL035364 


Hs.50891 


hypothetical protein 


118919 


N91797 


AW452696 Hs.130760 


myosin phosphatase, target subunit 2 


129558 


N92454 


AW580922 Hs.180446 


karyopherin (impora'n) beta 1 


407604 


N94581 - 


AW191962 Hs.288061 


collagen, type VIII, alpha 2 


118996 


N94746 


N94746 


Hs.274248 


hypothetical protein FLJ20758 


119021 


N98238 


N98238 


Hs.55185 


ESTs 


119039 


R02384 


AI160570 


Hs.252097 


pregnancy specific beta-1 -glycoprotein 6 


119063 


R16833 


R16833 


Hs.53106 


ESTs, Moderately similar to ALU1.HUMAN A 


332622 


R41828 


R10674 




CSR1 protein 


119111 


R43203 


T02855 


Hs.328321 


EST 


415115 


R46395 • 


AA214228 


Hs.127751 


hypothetical protein 


119146 


R58863 


R58863 


Hs.91815 


ESTs 


449224 


R78248 


AW995911 Hs.299883 


hypothetical protein FLJ23399 


119239 


T11483 


T11483 




gb:CHR90049 Chromosome 9 exon Homo sapie 


119281 


T16896 


AI692322 


Hs.65373 


ESTs, Weakly similar to T02345 hypotheti 


119298 


T23820 


NM_001241Hs.155478 


cyclin T2 


126502 


T30222 


T10077 


Hs.13453 


hypothetical protein FLJ14753 


419983 


W15275 


W55956 . 


Hs.94030 


Homo sapiens mRNA; cDNA DKFZp586E1624 (f 


119558 


W38194 


W38194 




Empirically selected from AFFX single pr 


42S641 


W42414 


AW081883 Hs.211578 


Homo sapiens cDNA; FLJ23037 fis, clone L 


419445 


W49632 


AA884471 


Hs.90449 


Human clone 23908 mRNA sequence 


119650 


W57613 


RS2342 


HSJ9856 


ESTs, Weakly similar to S65657 alpha-1C- 


119654 


W57759 


W57759 




gb:zd20g11.s1 Soares fetal hearLNbHH19W 


119683 


W61118 


W65379 


Hs.57835 


ESTs 


119694 


W65344 


AA041350 


Hs.57847 


ESTs, Moderately similar to ICE4_HUMAN C 


119718 


W69216 


W69216 


Hs.92848 


ESTs 


410365 


W69379 


AI287518 




Homo sapiens mRNA; cDNA DKFZp586D0923 (f 


119938 


W86728 


AW014862 Hs.58885 


ESTs 


120128 


238499 


BE379320 


Hs.91448 


MKP-1 like protein tyrosine phosphatase 


120130 


Z38630 


AA045767 


Hs.5300 


bladder cancer associated protein 


120148 


Z39494 


F02806 


Hs.65765 


ESTs 


120155 


Z39623 


Z39623 


Hs.65783 


ESTs 


451979 


Z40071 


F06972 


Hs.27372 


BMX non-receptor tyrosine kinase 


120183 


Z40174 


AW082866 Hs.65882 


ESTs 


120184 


Z40182 


Z40182 


Hs.65885 


EST 


120211 


Z40904 


Z40904 


Hs.66012 


EST 


120245 


AA166965 


AW959615 Hs.111045 


ESTs 


120247 


AA167500 


AA1 67500 


Hs.103939 


EST 


120254 


AA1 69599 


W90403 


Hs.111054 


ESTs 


120259 


AA171724 


AW014786 Hs. 192742 


hypothetical protein FLJ 12785 


120260 


AA171739 


AK000061 


Hs.101590 


hypothetical protein 


120275 


AA177105 


AA177105 


Hs.78457 


solute carrier family 25 (mitochondrial 


120284 


AA182626 


AA179656 




gb:zp54e11.s1 Stratagene NT2 neuronal pr 


417735 


AA186324 


AA188175 


Hs.82506 


KIAA1254 protein 


422137 


AA192099 


AJ236885 




zinc finger protein 148 (pHZ-52) 


120302 


AA192173 


AA837098 


Hs.269933 


ESTs 


120303 


AA192415 


AI216292 


Hs.96184 


ESTs 


120305 


M192553 


AW295096 Hs.101337 


uncoupling protein 3 (mitochondrial, pro 


120319 


AA194851 


T57776 


Hs.191094 


ESTs 


408729 


AA1 95520 


AA1 95764 


HS72639 


ESTs 


120326 


AA1 96300 


AA196300 


Hs.21145 


hypothetical protein RG083M05.2 


133145 


AA1 96549 


H94227 


Hs.6592 


Homo sapiens, clone IMAGE:2961368, mRNA, 


120327 


AA196721 


AK000292 


Hs.130732 


hypothetical protein FLJ20285 


120328 


AA196979 


AA923278 


Hs.290905 


ESTs, Weakly similar to protease [Ksapi 


120340 


AA206828 


AA206828 




gb:zq80b08.s1 Stratagene hNT neuron (937 
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417122 


M207123 


AI906291 


Hs.81234 


131522 


AA214539 


AI380040 


Hs.239489 


421787 


AA226914 


AA227068 


Hs.108301 


120375 


AA227260 


AF028706 


Hs.1 11227 


120376 


AA227469 


AA227469 




120390 


AA233122 


AA837093 


Hs.1 11460 


410804 


AA233334 


U64820 


Hs.66521 


434223 


AA233347 


AI825842 


Hs.3776 


312771 


AA233714 


AA018515 


Hs.264482 


120396 


AA233796 


AA134006 


Hs.79305 


120409 


AA235050 


AA235050 




120414 


AA235704 


AW137156 Hs.181202 


120420 


AA236031 


AI128114 


Hs.1 12885 


120422 


AA236352 


AL133097 


Hs.301717 


419326 


AA236390 


W94915 


Hs.42419 


120423 


AA236453 


AA236453 


Hs.1 8978 


120435 


AA243370 


AA243370 


Hs.96450 


120453 


AA250947 


AA250947 


Hs.170263 


120455 


M251083 


AA251720 


Hs.104347 


120456 


AA251113 


AA488750 


Hs.88414 


120473 


AA251973 


M251973 


Hs.269988 


128922 


AA252023 


AI244901 


Hs.9589 


120477 


AA252414 


AA252414 


Hs.43141 


120479 


AA252650 


AF006689 


Hs.110299 


120488 


AA255523 


AW952916 Hs.63510 


120510 


AA258128 


AI796395 


Hs.1 11377 


120527 


AA262105 


AA262105 


Hs.4094 


120528 


AA262107 


AI923511 


Hs.104413 


120529 


AA262235 


AI434823 


Hs.104415 


120541 


AA278298 


W07318 


Hs.240 


120544 


AA278721 


BE548277 


Hs.103104 


120562 


AA280036 


BE244580 


Hs.342307 


120569 


AA280648 


AA807544 


Hs.24970 


120571 


AA280738 


AB037744 


Hs.34892 


'120572 


AA280794 


H39599 


Hs.294008 


129434 


AA280837 


AW967495 Hs.186644 


130529 


AA280886 


AA178953 


Hs.309648 


120575 


AA280934 


AW978022 Hs.238911 


409339 


AA281535 


AB020686 


Hs.54037 


120591 


AA281797 


AF078847 


Hs.191356 


120593 


AA282047 


AA748355 


Hs.193522 


430275 


AA283002 


Z11773 


Hs.237786 


440303 


AA283709 


AA306166 


Hs.7145 


120609 


AA283902 


AW978721 Hs.266076 


409702 


AA284108 


AI752244 




456870 


AA284109 


AI241084 


Hs.154353 


132614 


AA284371 


AA284371 


Hs.1 18064 


458750 


AA284744 


AA1 15496 


Hs.336898 


135376 


AA284784 


BE617856 


Hs.99756 


120621 


AA284840 


AW961294 Hs.143818 


452279 


AA286844 


AA286844 


Hs.61260 


332484 


AA287032 


AW172431 


Hs.13012 


120644 


AA287038 


A1869129 


Hs.96616 


120660 


AA287546 


AA286785 


Hs.99677 


135370 


AA287553 


BE622187 


Hs.99670 


120661 


AA287556 


AA287556 


Hs.263412 


429828 


AA287564 


AB019494 


Hs.225767 


452291 


AA291015 


AF015592 


Hs.28853 


120699 


AA291716 


AI683243 


Hs.97258 


100690 


AA291749 


AA383256 


Hs.1657 


120726 


AA293656 


AA293655 


Hs.21198 


120737 


AA302430 


AL049176 


Hs.82223 


120745 


AA302809 


AA302809 




443574 


AA302820 


U83993 


Hs.321709 


120750 


AA310499 


AI191410 


Hs.96693 


120761 


AA321890 


AA321890 




120768 


AA340589 


AA340589 


Hs.104560 


120769 


AA340622 


AI769467 


Hs.9475 


135232 


AA342457 


AL038812 


Hs.96800 


120793 


AA342864 


AA342864 


Hs.96812 


120796 


AA342973 


AI247356 


Hs.96820 


120809 


AA346495 


AA346495 




332633 


AA347573 


AL1 20071 


Hs.48998 


120825 


AA347614 


AI280215 


Hs.96885 


120827 


AA347717 


AA382525 


Hs.132967 


120839 


AA348913 


AA348913 





immunoglobulin superfamily, member 3 
TIA1 cytotoxic granule-associated RNA-bi 
nuclear receptor subfamily 2, group C, m 
Zic family member 3 (odd-paired Drosophi 
gb:zr18a07.s1 Stratagene NT2 neuronal pr 
caldum/calmoduHn-dependent protein kin 
Machado-Joseph disease (spinocerebellar 
zinc finger protein 216 

Homo sapiens mRNA; cDNA DKFZp761A0411 (f 
eukaryotic translation initiation factor 
gb:zs38e04.s1 Soares_NhHMPu__S1 Homo sap! 
hypothetical protein FLJ10038 
spinal cord-derived growth factor-B 
hypothetical protein DKFZp434N1928 
ESTs 

Homo sapiens cDNA: FLJ22822 fis, clone K 
EST 

tumor protein p53-binding protein, 1 
ESTs, Weakly similar to ALUC^HUMAN !!!! 
BTB and CNC homology 1, basic leucine zi 
ESTs 
ubiquilin 1 

DKFZP727C091 protein 
mitogen-activated protein kinase kinase 
KIAA0141 gene product 
ESTs 

Homo sapiens cDNA FU 14208 fis, done NT 

ESTs 

ESTs 

M-phase phosphoprotein 1 
ESTs 

hypothetical protein RJ10330 

ESTs, Weakly similar to B34323 GTP-bindi 

KIAA1323 protein 

ESTs 

ESTs 

gb:zp39e03.s1 Stratagene muscle 937209 H 
hypothetical protein DKFZp762E1511; KIAA 
ectonucleotide pyrophosphatase/phosphodi 
general transcription factor IIH, pofype 
ESTs 

zinc finger protein 187 
calpain 7 

ESTs, Weakly similar to A46010 X-Iinked 

eukaryotic translation elongation factor 

nonselective sodium potassium/proton exc 

similar to rat nuclear ubiquitous casein 

Homo sapiens, Similar to RIKEN cDNA 1810 

mitochondrial ribosome recycling factor 

hypothetical protein FLJ23459 

hypothetical protein FLJ13164 

ESTs 

ESTs 

ESTs 

ESTs, Weakly similar to I38022 hypotheti 
ESTs, Weakly similar to ALUB_HUMAN III! 
IDN3 protein 

CDC7 (cell division cycle 7, S. cerevisi 
ESTs, Moderately similar to S29539 ribos 
estrogen receptor 1 
ESTs 

chordin-like 

gb:EST10426 Adipose tissue, white I Homo 
purinergic receptor P2X, ligand-gated io 
ESTs, Moderately similar to 2109260A B c 
branched chain keto acid dehydrogenase E 
EST 
ESTs 

ESTs, Moderately similar to ALU7.HUMAN A 

ESTs 

ESTs 

gb:EST52657 Fetal heart II Homo sapiens 
fibronectin leucine rich transmembrane p 
ESTs 

Human EST clone 122887 mariner transposo 
gb:EST55442 Infant adrenal gland II Homo 
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120850 


AA349647 


AA349647 


Hs.96927 


Homo sapiens cDNA FU12573 fis, clone NT 


120852 


AA349773 


AA349773 


Hs.191564 


ESTs 


128852 


AA350541 


R40622 


Hs.1 06601 


ESTs 


135240 


AA357159 


AA357159 


Hs.96986 


EST 


120870 


AA357172 


AA357172 


Hs.292581 


ESTs, Moderately similar to ALU1 JHUMAN A 


120894 


AA370132 


AA370132 


Hs.97063 


ESTs 


435737 


AA370472 


AF229839 


Hs.173202 


l-kappa-B-interacting Ras-like protein 1 


120897 


AA370867 


AA370867 


Hs.97079 


ESTs, Moderately similar to AF174605 1 F 


120915 


AA377296 


AL1 35556 


Hs.97104 


ESTs 


120935 


AA383902 


AL048409 


Hs.97177 


ESTs, Weakly similar to ALU1JHUMAN ALU S 


120936 


AA385934 


AA385934 


Hs.97184 


EST, Highly similar to (defline not avai 


120937 


AA386255 


AA386255 


Hs.97186 


EST 


120938 


AA386260 


AA386260 


Hs.104632 


EST 


417632 


AA386266 


R20855 


Hs.5422 


glycoprotein M6B 


120960 


AA398014 


AA398014 


Hs.104684 


EST 


120985 


AA398222 


AI219896 


Hs.97592 


ESTs 


120988 


AA398235 


AA398235 


Hs.97631 


ESTs 


121008 


AA398348 


AA398348 


Hs.130546 


Human DNA sequence from clone RP11-251J8 


121029 


AA398482 


AA398482 


Hs.97641 


EST 


121032 


AA398504 


AA393037 


Hs.161798 


ESTs 


121033 


AA398505 


AA398505 


Hs.97360 


ESTs 


121034 


AA398507 


AL389951 


Hs.271623 


nucleoporin 50kD 


121035 


AA398523 


AA398523 


Hs.210579 


ESTs 


121058 


AA398625 


AA398625 


Hs.97391 


ESTs 


121060 


AA398632 


AA398632 


Hs.97395 


ESTs 


121061 


AA398633 


AA393288 


Hs.97396 


ESTs 


121091 


AA398894 


AA398894 


Hs.97657 


ESTs, Moderately similar to ALU8_HUMAN A 


121092 


AA398895 


AA398895 


Hs.97658 


EST 


121094 


AA398900 


AA402505 




gb:zt62h10.rl SoaresJestis.NHT Homo sap 


121096 


AA398904 


AA398904 


Hs.332690 


ESTs 


121115 


AA399122 


AA398187 


Hs.104682 


ESTs, Weakly similar to mitochondrial ci 


121121 


AA399371 


AA399371 


Hs.189095 


similar to SALL1 (sal (Drosophila)-like 


121122 


AA399373 


AI126713 


Hs.192233 


ESTs, Highly similar to T00337 hypotheti 


121125 


AA399441 


AL042981 


Hs.251278 


KIAA1201 protein 


121151 


AA399636 


AA399636 


Hs.143629 


ESTs 


121153 


AA399640 


AA399640 


Hs.97694 


ESTs 


121163 


AA399680 


AI676062 


Hs.111902 


ESTs 


121176 


AA400080 


AL121523 


Hs.97774 


ESTs 


121192 


AA400262 


AA400262 


Hs.190093 


ESTs 


121223 


AA400725 


AI002110 


Hs.97169 


ESTs, Weakly similar to dJ667H1 2.2. 1 [H. 


121227 


AA400748 


AA400748 


Hs.97823 


Homo sapiens mRNA; cDNA DKFZp434D024 (fr 


121231 


AA400780 


AA814948 


Hs.96343 


ESTs, Weakly similar to ALUC.HUMAN !!!! 


121278 


AA401631 


AA037121 


Hs.98518 


Homo sapiens cDNA FU11490 fis, clone HE 


121279 


AA401688 


AA292873 


Hs.177996 


ESTs 


121282 


AA401695 


AA401695 


Hs.97334 


ESTs 


121299 


AA402227 


AA402227 


Hs.22826 


tropomodulin 3 (ubiquitous) 


121301 


AA402329 


NM_006202Hs.89901 


phosphodiesterase 4A, cAMP-specific (dun 


121302 


AA402398 


AA402587 


Hs.325520 


LAT1-3TM protein 


121304 


AA402449 


AA293863 


Hs.97316 


EST 


121305 


AA402468 


AA402468 


Hs.291557 


ESTs 


134721 


AA403268 


AK000112 


Hs.89306 


hypothetical protein FU20105 


121323 


AA403314 


AA291411 


Hs.97247 


ESTs 


121324 


AA404229 


AA404229 


Hs.97842 


EST 


444422 


AA404260 


AI768623 


Hs. 108264 


ESTs 


131074 


AA404271 


U16125 


Hs.181581 


glutamate receptor, ionotropic, kainate 


121344 


AA405026 


AA405026 


Hs.193754 


ESTs 


121348 


AA405182 


AA405182 


Hs.97973 


ESTs 


121350 


AA405237 


AA405237 




gb:zt06e10.s1 NCLCGAP_GCB1 Homo sapiens 


121400 


AA406061 


AA406061 


Hs.98001 


EST 


121402 


AA406063 


AA406063 


Hs.98003 


ESTs 


121403 


AA406O7O 


AA406070 


Hs.98004 


EST 


121408 


AA406137 


AA406137 


Hs.98019 


EST 


121431 


AA406335 


AA035279 


Hs.176731 


ESTs 


121471 


AA411804 


AA411804 


Hs.261575 


ESTs 


121474 


AA411833 


AA402335 


Hs.188760 


ESTs, Highly similarto Trad [H.sapiens] 


121526 


AA412219 


AW665325 


Hs.98120 


ESTs 


121530 


AA412259 


AA778658 


Hs.98122 


ESTs 


121558 


AA412497 


AA412497 




gb:zt95g12.s1 Soares testis_NHT Homo sap 


121559 


AA412498 


AI192044 


Hs.104778 


ESTs 


121584 


AA416586 


AI024471 


Hs.98232 


ESTs 


121609 


AA416867 


AA416867 


Hs.98185 


EST 


121612 


AA416874 


AA416874 


Hs.98168 


ESTs 


121737 


AA421133 


AA421133 


Hs.104671 


erythrocyte transmembrane protein 


121740 


AA421138 


AA421138 


Hs.143835 


EST 


436032 


AA422079 


AA150797 


Hs.109276 


latexin protein 


121784 


AA423837 


T90789 


Hs.94308 


RAB35, member RAS oncogene family 
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121802 AA424328 

121803 AA424339 
135286 AA424469 
332778 AA424469 
121806 AA424502 
129517 AA425004 
121845 AA425734 
121853 AA425887 
121891 AA426456 
121895 AA427396 
121899 AA427555 

121917 AA428218 

121918 AA428242 

121919 AA428281 

121941 AA428865 

121942 AA428994 
121970 AA429666 
121993 AA430181 
418706 AA430184 
122022 AA431293 

122050 AA431478 

122051 AA431492 
122055 AA431732 
122105 AA432278 

• 122125 AA434411 
135235 AA435512 
122162 AA435698 
422072 AA435711 
415106 AA435815 
122186 AA435842 
122235 AA436475 
412970 AA436489 
419288 AA442060 
122310 AA442079 
122334 AA443151 
122382 AA446133 
122425 AA447145 
122431 AA447398 
122450 AA447643 
426284 M447742 
122477 AA448226 
122500 AA448825 
122522 AA449444 
122536 AA450087 
122538 AA450211 
122540 AA450244 
122560 M452123 
421919 AA452155 
122562 AA452156 
122585 AA453036 
122608 AA453526 

122635 AA454085 

122636 AA454103 
122653 AA454642 
122660 AA454935 
122703 AA456323 
122724 AA457395 
122749 AA458850 
122772 AA459662 
430242 AA459668 
429838 AA459679 
122777 AA459702 
135362 AA460017 
122798 AA460324 
122837 AA461509 

122860 M464414 

122861 AA464428 
122910 AA470084 
132899 AA476606 
122967 AA478521 
422845 AA478523 
123009 AA479949 
128917 AA481252 
123081 AA485351 
123133 AA487264 
123184 AA489072 



AI251870 Hs,188898 
AI338371 Hs.157173 
AW023482 Hs.97849 
AW023482 Hs.97849 
AA424313 Hs.98402 
AW972853 Hs.1 12237 
AI732692 Hs.165066 
AA425887 Hs.98502 
AA426456 Hs.98469 
AA427396 

R55341 Hs.50421 
AA406397 Hs.139425 
BE274689 Hs.184175 
AA428281 Hs.98560 
M428865 Hs.98563 
AW452701 Hs.293237 
AA429666 Hs.98617 
AW297880 Hs.98661 
U73524 Hs.87465 
M431293 Hs,98716 
AI453076 

AA431492 Hs.98742 
AA431732 Hs.98747 
AW241685 Hs.98699 
AK000492 Hs.98806 
AW298244 Hs.266195 
M628233 Hs.79946 
AB018255 Hs.111138 
U40763 Hs.77965 
AA398811 Hs.104673 
AA436475 Hs.1 12227 
AB026436 Hs.177534 
AA256106 Hs.87507 
AW192803 Hs.98974 
BE465894 Hs.98365 
M446440 Hs.98643 
AB007859 Hs.100955 
AA447398 Hs.99104 
AA447643 Hs.112095 
AJ404468 Hs.284259 
AA448226 Hs.324123 
AA448825 Hs.99190 
AA299607 Hs.98969 
AF060877 Hs.99236 
AA450211 Hs.99239 
AA476741 Hs.98279 
AW392342 Hs.283077 
AJ224901 Hs.109526 
M452156 

AI681654 Hs.170737 
AA453525 Hs.1 43077 
AA454085 

AW651706 Hs.99519 
AW009166 Hs.99376 
AI816827 Hs.180069 
AA456323 Hs.269369 
AA457395 Hs.99457 
AA458850 Hs.293372 
AW1 17452 Hs.99489 
U66669 Hs.236642 
AW904907 Hs.30732 
AK001022 Hs.214397 
AA978128 Hs.99513 
AW366286 Hs.145696 
AA461509 Hs.293565 
AA464414 

AA335721 Hs.213628 
AA470084 Hs.98358 
AA476606 Hs.59666 
M806187 Hs.289101 
AA317841 Hs.7845 
AA535244 Hs.78305 
AI365215 Hs.206097 
AI815486. Hs.243901 
AA487264 Hs.154974 
BE247767 Hs.18166 



ESTs 
ESTs 
ESTs 
ESTs 
ESTs 
ESTs 

ESTs, Moderately similar to ALU2.HUMAN A 

hypothetical protein FLJ14303 

ESTs 

gb:zw33a02.s1 Soares ovary tumor NbHOT H 

KIAA0203 gene product 

ESTs 

chromosome 2 open reading frame 3 

EST 

ESTs 

ESTs 

EST 

ESTs 

ATP/GTP-binding protein 

ESTs, Moderately similar to T42650 hypot 

ELAV (embryonic lethal, abnormal vision, 

EST 

EST 

ESTs 

hypothetical protein 
ESTs 

cytochrome P450, subfamily XIX (aromatiz 
KIAA0712 gene product 
peptidyl-prolyl isomerase G (cyclophtlin 
ESTs 

membrane-associated nucleic add binding 
dual specificity phosphatase 10 
ESTs 

ESTs, Weakly similar to S65824 reverse t 
ESTs, Weakly similar to LB4D_HUMAN NADP- 
ESTs 

KIAA0399 protein 
ESTs 

hypothetical protein DKFZp434F1819 

dynein, axonemal, heavy polypeptide 9 

ESTs 

ESTs 

ESTs 

regulator of G-protein signalling 20 
ESTs 

ESTs, Weakly similar to A43932 mucin 2 p 
centrosomal P4.1 -associated protein; unc 
zinc finger protein 198 

gb:zx29c03.s1 Soares_totaL_fetus_Nb2HF8_ 

hypothetical protein FLJ23251 

ESTs 

gb:zx33a08.s 1 Soares_totaLfetus_Nb2H F8_ 

hypothetical protein FLJ14007 

ESTs 

nuclear respiratory factor 1 

ESTs 

ESTs 

ESTs, Weakly similar to 834087 hypotheti 
ESTs. 

3-hydroxyisobutyryl-Coenzyme A hydrolase 

hypothetical protein FU13409; KIAA1711 

hypothetical protein FU10160 similar to 

ESTs, Weakly similar to T17454 diaphanou 

splicing factor (CC1.3) 

ESTs, Weakly similar to putative p150 [H 

gb:zx78g01 ,s1 Soares ovary tumor NbHOT H 

ESTs 

ESTs 

SMAD in the anlisense orientation 
glucose regulated protein, 58kD 
hypothetical protein MGC2752 
RAB2, member RAS oncogene family 
oncogene TC21 

Homo sapiens cDNA FLJ20738 fis, clone HE 
Homo sapiens mRNA; cONA DKFZp667N064 (fr 
KIAA0870 protein 
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332467 


AA489630 


NM_O147OOHs,119004 


123233 


AA490225 


AW974175 Hs.151875 


123234 


AA490227 


NM_001938Hs.16697 


123236 


AA490255 


AW968504 Hs.123073 


123255 


AA490890 


AA830335 


Hs.105273 


430015 


AA490916 


AW768399 Hs.106357 


448892 


AA490925 


AF084535 


Hs.22464 


123259 


AA490955 


AI744152 


Hs.283374 


123284 


AA495812 


AA488988 


Hs.293796 


123286 


AA495824 


AA495824 


Hs.188822 


123315 


AA496369 


AA496369 




457397 


AA504125 


AW969025 Hs.109154 


433049 


AA521473 


AU076668 


Hs.334884 


123421 


AA598440 


AA598440 


Hs.291154 


123449 


AA598899 


AL049325 


Hs.1 12493 


426981 


AA599244 


AL044675 


Hs.1 73081 


409986 


AA599694 


NMJ)14777Hs.57730 


123497 


AA600037 


AA765256 


Hs.135191 


123604 


AA609135 


AA609135 


Hs.293076 


123712 


AA609684 


AA609684 




123731 


AA609839 


AA609839 


Hs.334437 


123800 


AA620423 


AA620423 


Hs.1 12862 


123841 


AA620747 


AA620747 


Hs.1 12896 


123929 


AA621364 


AA621364 


Hs.1 12981 


123978 


C20653 


T89832 


Hs.170278 


133184 


D200B5 


AA001021 


Hs.6685 


132835 


D20749 


Z83844 


Hs.5790 


435147 


D51285 


AL1 33731 


Hs.4774 


128695 


D59972 


NM_003478Hs.101299 


124029 


F04112 


F04112 


Hs.312553 


124057 


F13604 


AA902384 


Hs.73853 


449316 


H01662 


AI609045 


Hs.321775 


130973 


H05135 


AI638418 


Hs.1440 


124106 


H12245 


H12245 




124136 


H22842 


H22842 


Hs.101770 


124165 


H30894 


H30039 


Hs.1 07674 


429627 


H43442 


NM_015340Hs.2450 


124178 


H45996 


BE463721 


Hs.97101 


129948 


H69281 


AI537162 


Hs.263988 


452114 


H69485 


N22687 


Hs.8236 


12440826254 


H69899 


H69899 


129056 


H70627 


AI769958 


Hs.108336 


427580 


H73260 


AK001507 


Hs.44143 


426793 


H77531 


X89887 


Hs.1 72350 


124274 


H80552 


H80552 


Hs.102249 


129078 


H80737 


AI351010 


Hs.102267 


457658 


H93412 


AW952124 Hs. 13094 


124315 


H94892 


NMJJ05402HS.288757 


437712 


H95643 


X04588 


Hs.85844 


124324 


H96552 


H96552 


Hs.159472 


452933 


H97146 


AW391423 


Hs.288555 


132231 


H99131 


AA662910 


Hs.42635 


421877 


H99462 


AW250380 


Hs.109059 


443123 


H99837 


AA094538 


Hs.272808 


132963 


N22140 


AA099693 


Hs.34851 


420473 


N22197 


AL1 18782 


Hs.300208 


417381 


N23756 


AF164142 


Hs.82042 


130365 


N24134 


W56119 


Hs.155103 


456610 


N24195 


AF172066 


Hs.106346 


439311 


N26739 


BE270668 


Hs.151945 


124383 


N27098 


N27098 


Hs.102463 


124387 


N27637 


N27637 


Hs.109019 


129341 


N33090 


AI193519 


Hs.226396 


419793 


N35967 


AI364933 


Hs.168913 


124433 


N39069 


AA280319 


Hs.268840 


124441 


N46441 


AW450481 


Hs.161333 


132338 


N48270 


AA353868 


Hs.182982 


436575 


N48365 


AI473114 




124466 


N51316 


R10084 


Hs.113319 


408048 


N51499 


NMJJ07203HS.42322 


124483 


N53976 


A1821780 


Hs.179864 


124484 


N54157 


H66118 


Hs.285520 


124485 


N54300 


AB040933 


Hs.15420 


124494 


N54831 


N54831 


Hs.271381 


129200 


N59849 


N59849 


Hs.13565 


124527 


N62132 


N79264 


Hs.269104 



KIAA0665 gene product 

ESTs, Weakly similar to MAPB_HUMAN MICRO 

down-regulator of transcription 1,TBP-b 

CDC2-related protein kinase 7 

ESTs 

ESTs 

epilepsy, progressive myoclonus type 2, 
ESTs, Weakly similar to CA15 JWMAN COLLA 
ESTs 

ESTs, Weakly similar to A46010 X-linked 
gb:zv37d10.s1 Soares ovary tumor NbHOT H 
ESTs 

SEC10(S.cerevlsiae)-like1 

EST, Weakly similar to I38022 hypothetic 

Homo sapiens mRNA; cDNA DKFZp564D036 (fr 

KIAA0530 protein 

KIAA0133 gene product 

ESTs, Weakly similar to unnamed protein 

ESTs 

Homo sapiens cDNA: FU21543 fis, clone C 

gb:ae62f01.s1 Stratagene lung carcinoma 

EST 

ESTs 

ESTs 

ESTs 

thyroid hormone receptor interactor 8 
hypothetical protein dJ37E16.5 
Homo sapiens mRNA; cDNA DKFZp761C1712 (f 
cutlin 5 

gb:HSC2JH062 normalized infant brain cDN 
bone morphogenetic protein 2 
hypothetical protein DKFZp434D1428 
DEAD/H (Asp-Glu-Ala-Asp/His) box polypep 
gb:ym17a12.r1 Soares infant brain 1NIB H 
EST 
ESTs 

leucyl-tRNA synthetase, mitochondrial 
putative G protein-coupled receptor 
ESTs 
ESTs 

gb:yu70c12.s1 Weizmann Olfactory Epithel 
ESTs, Weakly similar to ALUEJHUMAN !!!! 
Homo sapiens clone FLB6914 PR01821 mRNA, 
HIR (histone cell cycle regulation defec 
EST 

lysosomal 

preseniiins associated rhomboid-like pro 
v-ra! simian leukemia viral oncogene horn 
neurotrophic tyrosine kinase, receptor, 
Homo sapiens cDNA: FLJ22224 fis, clone H 
Homo sapiens cDNA: FU22425 fis, clone H 
hypothetical protein DKFZp434K2435 
mitochondrial ribosomal protein L12 
putative transcription regulation nuclea 
epsilon-tubulin 

Sec23-interacting protein p125 
solute carrier family 23 (nucleobase tra 
eukaryotic translation initiation factor 
retinoic acid repressive protein 
mitochondrial ribosomal protein L43 
EST 
ESTs 

hypothetical protein FLJ11126 

serine/threonine kinase 24 (Ste20, yeast 

PR01575 protein 

ESTs 

goIgin-67 

ESTs 

kinesin heavy chain member 2 
A kinase (PRKA) anchor protein 2 
ESTs 

ESTs, Weakly similar to 2109260A B cell 
KIAA1500 protein 

ESTs, Weakly similar to I38022 hypotheti 
Sam68-like phosphotyrosine protein, T-ST 
ESTs 
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124532 N62375 
133213 N63138 
124539 N63172 
129196 N63787 
5 124575 N68168 

124576 N68201 

124577 N68300 

124578 N68321 
124593 N69575 

10 128501 N75007 
332434 N75542 
128473 N90066 
128639 N91246 
124652 N92751 

15 133137 N93214 
124671 N99148 
133054 R07876 
425266 R10865 
124720 R11056 

20 124722 R11488 
128944 R23930 
132965 R26589 
426504 R37588 
438828 R37613 

25 124757 R38398 
124762 R39179 
124773 R40923 
135266 R41179 
427961 R41294 

30 414303 R42307 
128540 R43189 
124785 R43306 

124792 R44357 

124793 R44519 
35 124799 R45088 

124812 R47948 
124821 R51524 
424123 R54950 
124835 R55241 

40 124845 R59585 
124847 R60044 
440630 R60872 
124861 R66690 
332503 R67266 

45 124879 R73588 
124892 R79403 
124906 R87647 
124922 R93622 
124940 R99599 

50 124941 R99612 
124943 T02888 
124947 T03170 
124954 T10465 
456862 T15418 

5 5 410653 T15597 
418133 T15652 
440014 T16898 
131082 T26644 
124980 T40841 

60 124984 T47566 
124991 T50116 
457222 T50145 
125000 T58615 
132932 T59940 

65 444484 T63595 

125008 T64891 

125009 T64924 
' 445384 T64933 

125017 T68875 
70 125018 T69027 
125020 T69924 
437871 T70353 
134204 T79780 
125050 T79951 
75 125052 T80174 
125054 T80622 



N62375 Hs.102731 
AA903424 Hs.6786 
D54120 Hs.146409 
BE296313 Hs.265592 
N68168 
N68201 

N68300 Hs.138485 
N68321 Hs.231500 
N69575 Hs.102788 
AL133572 Hs.199009 
AI680737 Hs.289068 
T78277 Hs.100293 
AW582962 Hs.102897 
W19407 Hs.3862 
AB002316 Hs.65746 
AK001357 Hs.102951 
AA464836 Hs.291079 
J00077 Hs.155421 
R05283 

T97733 Hs.185685 
AL137586 Hs.52763 
AI248173 Hs.191460 
AW162919 Hs.170160 
AL134275 Hs.6434 
H11368 Hs.141055 
M553722 Hs.92096 
R45154 Hs.338439 
R41179 Hs.97393 
AW293165 Hs.143134 
NMJ04427HS.165263 
AW297929 Hs.328317 
W38537 Hs.280740 
R44357 Hs.48712 
R44519 
R45088 

R47948 Hs.188732 
H87832 Hs.7388 
AW966158 Hs.58582 
R55241 Hs.101214 
R59585 Hs.101255 , 
W07701 Hs.304177 
BE561430 Hs.239388 
R67567 Hs.107110 
NM_004455Hs.150956 
R73588 Hs.101533 
AI970003 Hs.23756 
H75964 Hs.107815 
R93622 Hs. 12163 
AF068846 Hs.103804 
AI766661 Hs.27774 
AW963279 Hs.123373 
T03170 Hs.100165 
AW964237 Hs.6728 
U55184 Hs.154145 
BE383768 Hs.65238 . 
R43504 Hs.6181 
AW960782 Hs.6856 
AI091121 Hs.246218 
T40841 Hs.98681 
BE313210 Hs.334798 
T50116 

NMJ)04477Hs.203772 
T58615 Hs.235887 
AW1 18826 Hs.6093 
AK002126 Hs.11260 
T91251 

T64924 Hs.303046 
T79136 Hs.127243 
T68875 

T69027 Hs.269481 
T69981 

AI084813 Hs.1 14088 
AI873257 Hs,7994 
AW970209 Hs.1 11805 
T85104 Hs.222779 
T80622 Hs.268601 



EST 
ESTs 

cell division cycle 42 (GTP-binding prot 
ESTs, Weakly similar to I38022 hypotheti 
gb:za1 1 c01 .s1 Soares fetal liver spleen 
ESTs, Weakly similar to I38022 hypotheti 
gb:za12gQ7.s1 Soares fetal (iver spleen 
EST 
ESTs 

protein containing CXXC domain 2 
Homo sapiens cDNA FLJ1 1918 fis; clone HE 
O-linked N-acetylglucosamine (GlcNAc) tr 
CGI-47 protein 

regulator of nonsense transcripts 2; DKF 
KIAA031 8 protein 

Homo sapiens cDNA FLJ10495 fis, clone NT 
ESTs, Weakly similar to T27173 hypotheti 
alpha-fetoprotein 

gb:ye91c08.s1 Soares fetal liver spleen 
ESTs 

anaphase-promoting complex subunit 7 

hypothetical protein MGC12936 

RAB2, member RAS oncogene family-like 

hypothetical protein DKFZp761F2014 

Homo sapiens clone 23758 mRNA sequence 

ESTs, Moderately similar to A46010 X-iin 

ESTs 

K1AA0328 protein 
ESTs 

early development regulator 2 (homotog o 
EST 

hypothetical protein MGC3040 
hypothetical protein FLJ20736 
gb:yg24h04.s1 Soares infant brain 1NIB H 
gb:yg38g04.s1 Soares infant brain 1N1B H 
ESTs 

kelch (Drosophila)-like 3 

Homo sapiens cDNA FLJ 12789 fis, clone NT 

EST 

ESTs 

Homo sapiens clone FLB8503 PR02286 mRNA, 
Human DNA sequence from clone RP1-304B14 
ESTs 

exostoses (multiple)-like 1 
ESTs 

hypothetical protein similar to swine ac 
ESTs 

eukaryotic translation initiation factor 
heterogeneous nuclear ribonucleoprotein 
ESTs, Highly similar to AF161349 1 HSPC0 
ESTs, Weakly similar to ALU1JHUMAN ALU S 
ESTs 

KIAA1548 protein 

hypothetical protein FLJ11585 

95 kDa retinoblastoma protein binding pr 

ESTs 

ash2 (absent, small, or homeotic, Drosop 
Homo sapiens cDNA: FLJ21781 fis, done H 
ESTs 

eukaryotic translation elongation factor 
gb:yb77c10.s1 Stratagene ovary (937217) 
FSHD region gene 1 
ESTs 

Homo sapiens cDNA: FU22783 fis, clone K 
hypothetical protein FLJ11264 
gb:yd60a10.s1 Soares fetal liver spleen 
ESTs 

Homo sapiens mRNA for KIAA1724 protein, 
gb:yc30f05.s1 Stratagene liver (937224) 
sex comb on midleg homolog 1 
gb:yc19d03.r1 Stratagene lung (937210) H 
ESTs 

hypothetical protein FU20551 
ESTs 

ESTs, Moderately similar to similar to N 
ESTs, Weakly similar to envelope [H.sapi 
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125063 


T85352 


T85352 




125064 


T85373 


T85373 




125066 


T86284 


T86284 




416507 


T89579 


AL045364 


Hs.79353 


125080 


T90360 


T90360 


Hs.268620 


125097 


T94328 


AW576389 Hs.335774 


125104 


T95590 


T95590 




135107 


T97257 


T97257 


Hs.94560 


423122 


T97599 


AA845462 


Hs.124024 


125118 


T97620 


R10606 


Hs.269890 


125120 


T97775 


T97775 


Hs.100717 


134160 


T98152 


T98152 


Hs.79432 


125136 


W31479 


AW962364 Hs.129051 


125144 


W37999 


AB037742 


Hs.24336 


125150 


W38240 


W38240 




450142 


W40150 


AW207469 Hs.24485 


131987 


W45435 


AW453069 Hs.3657 


125178 


W58202 


W93127 


Hs.31845 


125180 


W58344 


W58469 


Hs.103120 


125182 


W58650 


AA451755 


Hs.263560 


446888 


W68736 


AL030996 


Hs.16411 


125197 


W69106 


AF086270 


Hs.278554 


133497 


W69111 


BE617303 


Hs.74266 


429922 


W69399 


Z97630 


Hs.226117 


129232 


W69459 


R98881 


Hs.109655 


422166 


W72424 


W72424 


Hs.1 12405 


125209 


W72724 


W72724 


Hs.103174 


125212 


W72834 


AA746225 


Hs.103173 


456631 


W73955 


BE383436 


Hs.108847 


125223 


W74701 


AI916269 


Hs.109057 


125225 


W76540 


W74169 


Hs. 16492 


125228 


W79397 


AA033982 


Hs.1 10059 


132393 


W85888 


AL1 35094 


Hs.47334 


125238 


W86038 


■ N99713 


Hs.109514 


125247 


W86881 


AA694191 


Hs.163914 


129296 


W87804 


AI051967 


Hs.1 10122 


125263 


W88942 


AA098878 




125266 


W90022 


W90022 


Hs.1 86809 


450862 


W92272 


1191 543 


Hs.25601 


452401 


W92764 


NM_007115Hs.29352 


428243 


W93040 


H05317 


Hs.283549 


125277 


W93227 


W93227 


Hs.103245 


125276 


W93523 


AI218439 


Hs.129998 


125280 


W93659 


AI123705 


Hs.106932 


448205 


W94003 


W93949 


Hs.33245 


131844 


W94401 


AI419294 


Hs.324342 


125284 


W94688 


NM 002666HS.103253 


417111 


W94787 


AW016321 Hs.82306 


445424 


Z38294 


AB028945 


Hs.12696 


125289 


Z38311 " 


T34530 


Hs.4210 


446313 


Z38465 


H06245 


Hs.1 06801 


431342 


Z38525 


AW971018 Hs.21659 


433227 


Z38538 


AB040923 


Hs.106808 


428306 


Z38551 


AB037715 


Hs.183639 


424624 


Z38783 


AB032947 


Hs.151301 


125295 


Z39113 


AB022317 


Hs.25887 


125298 


Z39255 


AW972542 Hs.289008 


125300 


Z39591 


Z39591 


Hs.101376 


448376 


Z39783 


BE622770 


Hs.264915 


444582 


Z39920 


R55344 


Hs.22142 


130882 


Z40166 


AA497044 


Hs.20687 


128888 


Z40388 


AI760853 


Hs.241558 


125310 


Z40646 


R59161 


Hs.124953 


125315 


Z41697 


R38110 


Hs.106296 


125317 


Z99349 


Z99348 


Hs.1 12461 


135096 


Z99394 


AA081258 





gb:yd82d01.s1 Soares fetal liver spleen 
gb:yd82f07,s1 Soares fetal liver spleen 
gb:yd77b07.s1 Soares fetal liver spleen 
transcription factor Dp-1 
ESTs, Highly similar to ALU6_HUMAN ALU S 
EST, Moderately similar to S65657 alpha- 
gb:ye40a03.s1 Soares fetal liver spleen 
ESTs, Moderately similar to 138022 hypot 
deltex (Drosophila) homclog 1 
gb:yf35f 1 1 .s1 Soares fetal liver spleen 
EST 

fibrillin 2 (congenital contractural ara 
ESTs 

KIAA1321 protein 

Empirically selected from AFFX single pr 

chondroiiin sulfate proteoglycan 6 (bama 

activity-dependent neuroprotective prate 

ESTs 

ESTs 

ESTs 

hypothetical protein LOC57187 

heterochromatin-iike protein 1 

hypothetical protein MGC4251 

H1 histone family, member 0 

sex comb on midleg (DrosophilaHike 1 

S100 calcium-binding protein A9 (calgran 

ESTs, Weakly similar to TSP2.HUMAN THROM 

ESTs 

hypothetical protein MGC2749 

ESTs, Weakly similar to ALU5.HUMAN ALU S 

OKFZP564G2022 protein 

ESTs, Weakly similar to 138022 hypotheti 

hypothetical protein FLJ14495 

ESTs 

ESTs 

ESTs 

gb:zn45g10.r1 Stratagene HeLa cell s3 93 
ESTs, Highly similar to LCT2„HUMAN LEUKO 
chromodomain helicase DNA binding protei 
tumor necrosis factor, alpha-induced pro 
ESTs 
EST 

enhancer of polycomb 1 

ESTs 

ESTs 

ESTs 

perilipin 

destrin (actin depolymerizing factor) 
cortactin SH3 domain-binding protein 
Homo sapiens cDNA FLJ13069 fis, clone NT 
ESTs, Weakly similar to PC4259 ferritin 
ESTs 

kelch (Drosophila)-like 1 
hypothetical protein FLJ10210 
Ca2-Klependent activator protein for seer 
sema domain, immunoglobulin domain (Ig), 
Homo sapiens cDNA: FLJ21814 fis, clone H 
EST 

Homo sapiens cDNA FLJ12908 fis, clone NT 

cytochrome b5 reductase b5R.2 

hypothetical protein FLJ10392 

ariadne (Drosophila) homolog 2 

ESTs 

ESTs 

ESTs, Weakly similar to 138022 hypotheti 
zinc finger protein 36 (KOX18) 
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Table 3A shows the accession numbers for those pkeys (acklng unlgenelD's for Table 3. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 



Pkey: Unique Eos probeset Identifier number 

CAT number: Gene cluster number 
Accession: Genbank accession numbers 



Pkey 



CAT Number Accession 



108469 
124106 
108501 
108562 
101300 



116761.1 
125446 J 
13684.-12 
36375.1 
4669.1 



132091 94851J 



117034 
100752 



20113.2 
33207.21 



116417 5418.11 



AA079487 M128547 AA128291 AA079587 AA079600 

H12245AA094769R14576 

AA083256 

AA100796 AF020589 AA074629 AA075946 AA100849 AA085347 AA1 26309 AA079311 AA079323 AA085274 
BE535511 M62098 AA306787 AW891766 AA348998 AA338869 AA344013 AW956561 AW389343 AW403607 L40391 
AW408435 AA121738 AI568978 H13317 R20373 AW948724 AW948744 M335023 AA436722 AA448690 C21404 
AW884390 AA345454 AA303292 AA174174 BE092290 T90614 AA035104 R76028 AA126924 AA741086 AW022056 
AW1 18940 AA121666 AI832409 AA683475 AI140901 AI623576 AW519064 AW474125 AI953923 A1735349 AW150109 
AI436154 AW1 18130 AW270782 AI804073 N27434 AA876543 AA937815 AI051 166 AA505378 AI041975 A1335355 
AI089540 AA662243 AI127912 AI925604 AI250880 AI366874 AI564386 A1815196 AI683526 AI435885 AI160934 H79030 
AI801493 AA448691 AI673767 A1076042 AI804327 AA813438 AA680002 AI274492 T16177 AI287337 AI935050 
AA907805 AA911493 AI589411 AI371358 AW576236 AI078866 AW516168 AA346372 A1560185 M471009 R75857 
AA296025 AA523155 AA853168 AI696593 A1658482 AI566601 AW072797 AA128047 AA035502 AW243274 AA992517 
R43760 

AW954243 AA829930 AA412478 AA828434 AA814538 AI927418 A1192435 W52897 AA443666 AA031913 AI683306 
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1 13947 genbanLW84768 W84768 

122562 genbank_AA452156 AA452156 

122635 genbankJ\A454085 AA454085 

108277 genbank_AA064859 AA064859 

108403 genbankJ\A075374 AA075374 

122860 genbankJWi64414 AA464414 

108427 genbank_AA076382 AA076382 

108439 genbank_AA078986 AA078986 

131353 231290.1 AW4 11 259 H23555AW01 5049 AI684275AW01 5886 AW068953AW01 4085 AI027260 R52686 AA91 8278 Al 129462 

AA969360 N34869 AI948416 AA534205 AA702483 AA705292 

108533 genbanJeAA084415 AA084415 

1 24254 genbank_H69899 H69899 

101447 entrez_M21305 M21305 

101458 entrezjrf22092 M22092 

101667 13349 1 NMJM5381 M60858 AW373732 AW373724 AW373689 AW373629 AW373609 AW373776 AA187808 AW386946 



AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 AL036757 AW373711 AW403124 AW403640 
AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AAO96007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF114096 AW375993 AI133569 W52561 AA603040 AA133710 AI928796 AW176370 AA827519 
AW338437 AA521142 T29341 AI800461 AW317002 M703914 AA86083O AI859203 AI445772 AA714334 AI817066 
AI832027 AW510442 AI635802 AW088306 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 
AA074040 AA931657 AJ051154 AW410203 AI921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW080082 AW383778 AI933053 AI919097 W31557 N90245 AA931591 AA563995 
F36352 AA056184 AA476294 AA641327 AA533550 AI749630 W58323 AA569119 AA508573 AI809050 AI378996 
AA41 1 362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 AI2701 28 AI472365 AA41 1363 AI523216 
AI719965 AI816302 AA182681 AI707990 AA133588 AI758537 W60253 AI460308 AA135423 AI083904 F04188 N89693 
AW408776 AI678595 AI270568 AA722059 W58234 F33650 AA090547 AA285108 AA425981 N85079 D20218 AI273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AA952015 F03558 
AI752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 AI357360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 AA152476 
AW351940 AW373683 AI940524 AW374953 T56500 N24329 AI940720 AW374933 AW374947 AW391913 AL1 38337 
AW376241 AW062943 F26666 AW41 0202 AW062958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AI438973 AI305271 AA433818 AA612807 AI831809 AI940409 
AA158663 AI572988 



1 24720 144582 J R05283 R1 1056 

124793 genbankJR44519 R44519 

1 24799 genbanklR45088 R45088 

103138 entrez_X65965 X65965 

1 17683 genbankJJ40180 N40180 

124991 genbank JT50116 T50116 

103432 entrezj<97748 X97748 

119174 genbank_R71234 R71234 

119239 95573 2 T11483 T11472 

133678 ' 11235J AW247252 AA346143 NM_000270 AA381085 N91995 X00737 AA381079 AA296473 AA296110 AA315735 AA311617 



AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 AA279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W86334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 AI024906 AI096488 AI885005 M194872 N90489 AI452544 H7241 1 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW5 19082 AA345018 AA582183 AI961789 R65918 N30611 
AI979189 AI280889 AW273191 R66531 AI285845 AI675927 A(421990 AW190879 H37794 AA699667 H68427 AA954388 
A1188757 AI140048 AA430382 AI204151 AW247864 AA559099 AI431420 M548276 AI149466 AA772669 AA694388 
AA724168 AA301651 AA281952 AA779925 AA234760 W86290 AA913603 AW51 1745 AI500697 AA814922 AA835040 



124576 
108931 
108941 



genbank JJ68201 N68201 
genbank J\A147186 
genbank_AA148650 



AA147186 
AA148650 
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T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 AI597614 AA970746 AA972548 
AI3771 16 R62962 H16737 R89070 AA731329 R66532 N54354 AI818832 H81944 N71567 T95122 W86463 AA437095 
AI431999 AI915724 N63851 AI674743 AA457307 AA211475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 AI478666 T27651 
AI699770 AW025156 H69719 AI984717 N69225 AI459856 AA953577 AI424691 H13843 R22404 AI873796 AI336002 
N70898 AI420854 AA541792 AA346142 AI000814 AI828348 AA045090 T51257 N90434 H13890 N73184 A1708083 
AA781606 AA329050 AA339985 R6B954 H84795 W04186 H16845 



119416 


genbank_T97186 T97186 




119558 


NOLFOUND entrez W38194 


W38194 


119559 


NOT_FOUND_entrez_W38197 


W38197 


119654 


genbanLW57759W57759 




121350 


genbanK_AA405237 


AA405237 


121558 


genbank_AA412497 


AA412497 


105985 


genbank_AA406610 


AA406610 


114648 


genbankJ\A101056 


AA101056 


121895 


genbankJ\A427396 


AA427396 


100327 


entrezj)55640 D55640 




123315 


714071J AA496369 M496646 


123473 


genbank_AA599143 


AA599143 
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TABLE 4: 



Pkey: 
Accession: 
ExAccn: 
UnigenelD: 
Vnigene Title: 



Unique Eos probeset identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTitle 



100405 
100420 
100481 
100484 
100718 
100991 
101097 
101168 
101194 
101261 
101345 
101447 
101485 
101543 
101550 
101560 
101674 
101714 
101741 
101838 
101857 
102012 
102024 
102164 
102241 
102283 
102303 
102564 
102663 
102759 
102778 
102804 
102887 
102898 
102915 
103036 
103037 
103095 
103158 
103166 
103185 
103280 
103554 
103850 
104465 
104592 
104764 
104786 
104850 
104865 
104894 
104952 
104974 
105178 
105263 
105330 
105376 
105729 
105826 
105977 
106008 
106031 
106124 



D86425 
D86983 

HG1098-HT1098 

HG1103-HT1103 

HG3342-HT3519 

J03764 

L06797 

L15388 

L20971 

L35545 

L76380 

M21305 

M24736 

M31166 

M31551 

M32334 

M61916 

M68874 

M74719 

M92934 

M94856 

U03057 

U03877 

U18300 

U27109 

U31384 

U33053 

U59423 

U70322 

U81607 

U83463 

U89942 

X04729 

X06256 

X07820 

X54925 

X54936 

X60957 

X67235 

X67951 

X69910 

X79981 

Z18951 

AA187101 

N24990 

R81003 

AA025351 

AA027168 

AA040465 

AA045136 

AA054087 

AA071089 

AA085918 

AA187490 

AA227926 

AA234743 

AA236559 

AA292694 

AA393243 

AA406363 

AA411465 

AA412284 

AA423987 



AW291587 Hs.82733 
D86983 Hs.1 18893 
X70377 Hs.121489 
NIVL005402HS.288757 
BE295928 Hs.75424 
J03836 Hs.82085 
BE245301 Hs.89414 
NM 005308HS.211569 
L20971 Hs.188 
D30857 Hs.82353 
NM_005795Hs.152175 
M21305 

AA296520 Hs.89546 
M31166 Hs.2050 
Y00630 Hs.75716 
AW958272 Hs.347326 
NM_0O2291Hs.82124 
M68874 Hs.211587 
NM_003199Hs.326198 
BE243845 Hs.75511 
BE550723 Hs.1 53179 
BE259035 Hs.1 18400 
AA301867 Hs.76224 
NM 000107HS.77602 
NM_007351Hs.268107 
AW161552 Hs.83381 
U33053 Hs.2499 
U59423 Hs.79067 
NM_002270Hs.1 68075 
NM_005100Hs.788 
AF000652 Hs.8180 
NM_002318Hs.83354 
J03836 Hs.82085 
NM_002205Hs.149609 
X07820 Hs.2258 
M13509 Hs.83169 
BE018302 Hs.2894 
NM_005424Hs.78824 
BE242587 Hs.1 18651 
AA1 59248 Hs.1 80909 
NM 006825HS.74368 
U84722 Hs.76206 
AI878826 Hs.74034 
AA187101 Hs.213194 
Z44203 Hs.26418 
AW630488 Hs.25338 
AI039243 Hs.278585 
AA027167 Hs.10031 
AL133035 Hs.8728 
T79340 Hs.22575 
AF065214 Hs.18858 
AW076098 Hs.345588 
Y12059 Hs.278675 
AA313825 Hs.21941 
AW388633 Hs.6682 
AW338625 Hs.22120 
AW994032 Hs.8768 
H46612 Hs.293815 
AA478756 Hs.194477 
AK001972 Hs.30822 
AB033888 Hs.8619 
X64116 Hs.171844 
H93366 Hs.7567 



nidogen 2 

Melanoma associated gene 
cystatin D 

v-ral simian leukemia viral oncogene horn 
inhibitor of DNA binding 1, dominant neg 
serine (or cysteine) proteinase inhibito 
chemokine (C-X-C motif), receptor 4 (fus 
G protein-coupled receptor kinase 5 
phosphodiesterase 4B, cAMP-specific (dun 
protein C receptor, endothelial (EPCR) 
calcitonin receptor-like 
gb:Human alpha satellite and satellite 3 
selectin E (endothelial adhesion molecul 
pentaxin-related gene, rapidly induced b 
serine (or cysteine) proteinase inhibito 
intercellular adhesion molecule 2 
laminin, beta 1 

phospholipase A2, group IVA (cytosolic, 
transcription factor 4 
connective tissue growth factor 
fatty acid binding protein 5 (psoriasis- 
singed (Drosophila)-like (sea urchin fas 
EGF-containing fibulin-Iike extracellula 
damage-specific DNA binding protein 2 (4 
multimerin 

guanine nucleotide binding protein 11 

protein kinase (Mike 1 

MAD (mothers against decapentaplegic, Dr 

karyopherin (importin) beta 2 

A kinase (PRKA) anchor protein (gravin) 

syndecan binding protein (syntenin) 

lysyl oxidase-like 2 

serine (or cysteine) proteinase inhibito 
integrin, alpha 5 (fibronectin receptor, 
matrix metalloproteinase 10 (stromelysin 
matrix metalloproteinase 1 (interstitial 
placental growth factor, vascular endoth 
tyrosine kinase with immunoglobulin and 
hematopoietically expressed homeobox 
peroxiredoxin 1 

transmembrane protein (63kD), endoplasmi 
cadherin 5, type 2, VE-cadherin (vascula 
caveolin 1, caveolae protein, 22kD 
hypothetical protein MGC10895 
ESTs 

protease, serine, 23 
ESTs 

KIAA0955 protein 

hypothetical protein DKFZp434G171 
B-cell CLL/lymphoma 6, member B (zinc fi 
phospholipase A2, group IVC (cytosolic, 
desmoptakin (DPI, DPI!) 
bromodomain-containing 4 
AD036 protein 

solute carrier family 7, (cationic amino 
ESTs 

hypothetical protein FLJ10849 

Homo sapiens HSPC285 mRNA, partial cds 

E3 ubiquitin ligase SMURF2 

hypothetical protein FU11110 

SRY (sex determining region Y)-box 18 

Homo sapiens cDNA: FLJ22296 fis, clone H 

Homo sapiens cDNA: FLJ21962 fis, clone H 
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106155 AA425309 AA425414 Hs.33287 

106302 AA435896 AA398B59 Hs.18397 

106423 AA448238 AB020722 Hs.16714 

106793 AA478778 H94997 Hs.16450 

5 107174 AA621714 BE122762 Hs.25338 

107216 D51069 D51069 Hs.211579 

107295 T34527 AA186629 Hs.80120 

107385 U97519 NMJ)05397Hs.16426 

108756 AA127221 AA127221 Hs.117037 

10 108846 AA132983 AL117452 Hs.44155 

108888 AA135606 AA135606 Hs.189384 

109001 AA156125 AI056548 Hs.72116 

109166 AA179845 AA219691 Hs.73625 

109456 AA232645 AW956580 Hs.42699 

15 109768 F10399 F06838 Hs.14763 

110107 H16772 AW151660 Hs.31444 

110906 N39584 AA035211 Hs.17404 

110984 N52006 AW613287 Hs.80120 

111006 N53375 BE387014 Hs.166146 

20 111018 N54067 AI287912 Hs.3628 

111133 N64436 AW580939 Hs.97199 

111760 R26892 BE551929 Hs.268754 

113073 T33637 N39342 Hs.103042 

113195 T57112 H83265 Hs.8881 

25 113923 W80763 AW953484 Hs.3849 

114521 AA046808 AW139036 Hs.108957 

115061 AA253217 AI751438 Hs.41271 

115096 AA255991 A1683069 Hs.175319 

115145 AA258138 AA740907 Hs.88297 

30 115619 AA426573 AA486620 Hs.41135 

115947 AA443793 R47479 Hs.94761 

116314 AA490588 AI799104 Hs.178705 

116339 AA496257 AK000290 Hs.44033 

116430 AA609717 AK001531 Hs.66048 

35 116589 D59570 AI557212 Hs.17132 

116733 F13787 AL157424 Hs.61289 

117023 H88157 AW070211 Hs.102415 

117186 H98988 H98988 Hs.42612 

117563 N34287 AF055634 Hs.44553 

40 117997 N52090 N52090 Hs.47420 

118475 N66845 N66845 

118581 N68905 N68905 

119073 R32894 BE245360 Hs.279477 

119155 R61715 R61715 Hs.310598 

45 119174 R71234 R71234 

119221 R98105 C14322 Hs.250700 

119416 T97186 T97186 

119866 W80814 AA496205 Hs.193700 

121335 AA404418 AA404418 

50 121381 AA405747 AW088642 Hs.97984 

123160 AA488687 AA488687 Hs.284235 

123473 AA599143 AA599143 

123523 AA608588 AA608588 

123533 AA608751 AA608751 

55 123964 C13961 C13961 

124006 D60302 AI147155 Hs.270016 

124315 H94892 NM_005402Hs.288757 

124659 N93521 AI680737 Hs.289068 

124669 N95477 AI571594 Hs.102943 

60 124847 R60044 W07701 Hs.304177 

124875 R70506 AI887664 Hs.285814 

125091 T91518 T91518 

125103 T95333 AA570056 Hs.122730 

125355 R45630 R60547 Hs.170098 

65 125565 R20839 R20840 

125590 R23858 R23858 Hs. 143375 

423765 R23858 R23858 Hs.143375 

126511 AI024874 T92143 Hs.57958 

100286 W26247 BE247550 Hs.86859 

70 126563 W26247 AA516391 Hs.181368 

126649 AA856990 AA001860 Hs.279531 

449602 AA856990 AA001860 Hs.279531 

126872 AA136653 AW450979 

456000 AA136653 BE180876 Hs.11614 

75 414221 AA136653 AW450979 

127402 AA358869 AA358869 Hs.227949 



nuclear factor l/B 

hypothetical protein FLJ23221 

Rho guanine exchange factor (GEF) 15 

ESTs 

ESTs 

melanoma cell adhesion molecule 

UDP-N-acetyl-alpha-D-galactosamine:polyp 

podocalyxin-like 

ESTs 

DKFZP586G1517 protein 

gb:zl1 0aO5.s1 Soares_pregnantjjterus_NbH 

hypothetical protein FLJ20992 similar to 

RAB6 interacting, kinesin-like (rabkines 

ESTs 

ESTs 

ESTs 

ESTs 

UDP-N-acetyi-alpha-D-galactosamine:polyp 
Homer, neuronal immediate eariy gene, 3 
mitogen-adivated protein kinase kinase 
complement component C1q receptor 
Homo sapiens cDNA FU1 1949 fis, clone HE 
microtubule-associated protein 1B 
ESTs, Weakly similar to S41044 chromosom 
hypothetical protein FLJ22041 similar to 
40S ribosomal protein S27 isoform 
Homo sapiens mRNA full length msert cDN 
ESTs 
ESTs 

endomucin-2 
KIAA1691 protein 

Homo sapiens cDNA FLJ1 1333 fis, clone PL 

dipeptidyi peptidase 6 

hypothetical protein FLJ10669 

ESTs, Moderately similar to I54374 gene 

synaptojanin 2 

Homo sapiens mRNA; cDNA DKFZp586N0121 (f 
ESTs, Weakly similar to ALU1_HUMAN ALU S 
unc5 (C.elegans homolog) c 
EST 

gb:za46c11.s1 Soares fetal liver spleen 
gb:za69b09.s1 Soares JetalJung_NbHL19W 
ESTs 

ESTs, Moderately similar to ALU1_HUMAN A 
gb:yi54c08.s1 Soares placenta Nb2HP Homo 
uyptase beta 1 

gb:ye50h09.s1 Soares fetal liver spleen 
Homo sapiens mRNA; cDNA DKFZp586l0324 (f 
gb:zw37e02.s1 SoaresJotal_fetusJJb2HF8_ 
hypothetical protein FU22252 similar to 
ESTs, Weakly similar to I38022 hypotheti 
gb:ae52d04.s1 Stratagene lung carcinoma 
gb:ae54e06.s1 Stratagene lung carcinoma 
gb:ae56h07.s1 Stratagene lung carcinoma 
gb:C 13961 Clontech human aorta polyA+ mR 
ESTs 

v-ral simian leukemia viral oncogene horn 

Homo sapiens cDNA FLJ 11918 fis, clone HE 

hypothetical protein MGC12916 

Homo sapiens clone FLB8503 PR02286 mRNA, 

sprouty (Drosophtla) homolog 4 

gb:ye20f05.s1 Stratagene lung (937210) H 

ESTs, Moderately similar to KIAA1215 pro 

KIAA0372 gene product 

gb:yg05c08.r1 Soares infant brain 1NIB H 

Homo sapiens, done IMAGE:3840937, mRNA, 

Homo sapiens, clone IMAGE:3840937, mRNA, 

EGF-TM7-latrophiIin-retated protein 

growth factor receptor-bound protein 7 

U5 snRNP-specific protein (220 kD), orth 

ESTs 

ESTs 

gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCLCGAP_Su 
HSPC065 protein 

gb:UI-H-BI3-ala-a-12-0-Ul.s1 NCLCGAP^Su 
SEC 13 (S. cerevisiae)-like 1 
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127651 AM23976 AA382523 Hs.1056B9 

424806 AI123976 AA382523 Hs.105689 

128062 AA379500 AA379621 Hs.105547 

128992 R49693 H04150 Hs.107708 

5 129046 AA195678 AB029290 Hs.108258 

129188 M30257 NM_001078Hs. 109225 

129314 AA028131 BE622768 Hs.290356 

129371 M10321 X06828 Hs.1 10802 

129468 J03040 AW410538 Hs.111779 

10 129765 M86933 M86933 Hs.1238 

129805 AA012933 AA012848 Hs.12570 

129884 AA286710 AF055581 Hs.13131 

130495 AA243278 AW250380 Hs.109059 

130639 D59711 AI557212 Hs.17132 

15 130657 T94452 AW337575 Hs.201591 

130828 AA053400 AW631469 Hs.203213 

130972 AA370302 D81866 Hs.21739 

131080 J05008 NM 001955HS.2271 

131137 U85193 W27392 Hs.33287 

20 131182 AA256153 AI824144 Hs.23912 

131486 X83107 F06972 Hs.27372 

131573 AA046593 AA040311 Hs.28959 

131647 AA410480 AA359615 Hs.30089 

131756 D45304 AA443966 Hs.31595 

25 131859 M90657 AW960564 

131881 AA010163 AW361018 Hs.3383 

132050 AA136353 AI267615 Hs.38022 

132083 Y07867 BE386490 Hs.279663 

132164 U84573 AI752235 Hs.41270 

30 132358 X60486 NM_003542Hs.46423 

132413 AA132969 AW361383 Hs.260116 

132456 AA114250 AB011084 Hs.48924 

132490 F13782 NM_001290Hs.4980 

132676 AA283035 N92589 Hs.261038 

35 132687 AB002301 AB002301 Hs.54985 

132718 AA056731 NM_004600Hs,554 

132736 U68019 AW0B1883 Hs.211578 

132760 H99198 AA125985 Hs.56145 

132933 AA598702 BE263252 Hs.6101 

40 132968 N77151 AF234532 Hs.61638 

132994 AA505133 AA1 12746 Hs.279905 

133061 AB000584 AI186431 Hs.296638 

133147 D12763 M026533 Hs.66 

133161 AA253193 AW021103 Hs.6631 

45 133200 AA432248 AB037715 Hs.183639 

133260 AA083572 M403045 Hs.6906 

133363 AA479713 AI866286 Hs.71962 

133491 L40395 BE619053 Hs.170001 

133517 X52947 NML000165HS.74471 

50 133550 W80846 AI129903 Hs.74669 

133607 M34539 BE273749 

133614 D67029 NMJ303003HS.75232 

133627 U09587 NM 002047HS.75280 

133691 M85289 M85289 Hs.211573 

55 133696 D10522 AI878921 Hs.75607 

133913 W84712 AU076964 Hs.7753 

133975 D29992 C18356 Hs.295944 

133985 L34657 L34657 Hs.78146 

134039 S78569 NM_002290Hs.78672 

60 134088 D43636 AI379954 Hs.79025 

134161 U97188 AA634543 Hs.79440 

134299 AA487558 AW580939 Hs.97199 

134416 M28882 X68264 Hs.211579 

116470 X70683 AI272141 Hs.83484 

65 134656 X14787 AI750878 Hs.87409 

134989 AA236324 AW968058 Hs.92381 

135051 C15324 AI272141 Hs.83484 

135073 AA452000 W55956 Hs.94030 

135349 D83174 AA1 14212 Hs.9930 

70 100114 D00596 X0230B Hs.82962 

100130 D11428 NM_000304Hs.103724 

100143 D13640 AU076465 Hs.278441 

100168 D14874 H73444 Hs.394 

100208 D26129 NMJ)02933Hs.78224 

75 100224 D28476 AL121516 Hs.138617 

100405 D86425 AW291587 Hs.82733 



MSTP031 protein 
MSTP031 protein 

neural proliferation, differentiation an 
ESTs 

actin binding protein; macrophin (microf 
vascular cell adhesion molecule 1 
mesoderm development candidate 1 
von Willebrand factor 
secreted protein, acidic, cysteine-rich 
amelogenln (Y chromosome) 
tubulin-specific chaperone d 
lysosomal 

mitochondrial ribosomal protein L12 
ESTs, Moderately similar to 154374 gene 
ESTs 
ESTs 

Homo sapiens mRNA; cDNA DKFZp586l1518 (f 
endothelin 1 
nuclear factor \IB 
ESTs 

BMX non-receptor tyrosine kinase 

ESTs 

ESTs 

ESTs 

transmembrane 4 superfamily member 1 
upstream regulatory element binding prot 
ESTs 
Pirin 

procollagen-lysine, 2-oxoglutarate 5-dio 
H4 histone family, member G 
metalloprotease 1 (pitrilysin family) 
KIAA0512gene product; ALEX2 
LIM domain binding 2 
ESTs, Weakly similar to I38022 hypotheti 
KIAA0303 protein 

Sjogren syndrome antigen A2 (60kD, ribon 
Homo sapiens cDNA: FU23037 fis, clone L 
thymosin, beta, identified in neuroblast 
hypothetical protein MGC3178 
myosin X 

clone HQ0310 PRO0310p1 
prostate differentiation factor 
interieukin 1 receptor-like 1 
hypothetical protein FLJ20373 
hypothetical protein FLJ10210 
Homo sapiens cDNA: FLJ23197 fis, done R 
ESTs, Weakly similar to B36298 proline-r 
eukaryotic translation initiation factor 
gap junction protein, alpha 1, 43kD (con 
vesicle-associated membrane protein 5 (m 
FK506-binding protein 1A (12kD) 
SEC14 (S. cerevisiae)-like 1 
glycyl-tRNA synthetase 
heparan sulfate proteoglycan 2 (perlecan 
myristoylated alanine-rich protein kinas 
calumenin 

tissue factor pathway inhibitor 2 
platelet/endothellal cell adhesion molec 
laminin, alpha 4 
KIAA0096 protein 
IGF-ll mRNA-binding protein 3 
complement component C1q receptor 
melanoma cell adhesion molecule 
SRY (sex determining region Y)-box 4 
thrombospondin 1 

nudix (nucleoside diphosphate linked moi 

SRY (sex determining region Y)-box 4 

Homo sapiens mRNA; cDNA DKFZp586E1624 (f 

serine (or cysteine) proteinase inhibito 

thymidylate synthetase 

peripheral myelin protein 22 ^ 

KIAA0015 gene product 

adrenomedullin 

ribonuclease, RNase A family, 1 (pancrea 
thyroid hormone receptor interactor 12 
nidogen 2 
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100420 D86983 

100455 D87953 

100529 HG1862-HT1897 

100618 HG2614-HT2710 

100619 HG2639-HT2735 
100658 HG2855-HT2995 
100676 HG3044-HT3742 
100718 HG3342-HT3519 
100752 HG3543-HT3739 
100828 HG4069-HT4339 
100850 HG417-HT417 
100991 J03764 
101097 L06797 
101110 L08246 
101142 L12711 
101156 L13977 
101168 L15388 
101184 L19871 
101192 L20859 
101317 L42176 
101336 L49169 
101345 L76380 
101400 M15990 
101475 M23254 
101485 M24736 
101496 M26576 
101505 M27396 
101543 M31166 
101557 M31994 
101560 M32334 
101587 M35878 
101592 M36429 

101633 M57730 

101634 M57731 
101667 M60858 
101682 M62994 
101714 M68874 
101720 M69043 
101741 M74719 
101744 M75126 
101793 M84349 

101837 M92843 

101838 M92934 
101840 M93056 
101857 M94856 
101864 M95787 
101931 S76965 
101966 S81914 

102012 U03057 

102013 U03100 
102024 U03877 
102059 U08021 
(02121 U14391 
102283 U31384 
102300 U32944 
102378 U40369 
102395 U41767 
102460 U48959 
102491 U51010 
102499 U51478 
102523 U53445 
102560 U59289 
102564 U59423 
102589 U62015 
102600 U63825 
102645 U67963 
102687 U73379 
102693 U73824 
102709 U77604 
102759 U81607 
102804 U89942 
102882 X04412 
102907 X06985 
102915 X07820 
102927 X12876 
102960 X15729 



D86983 Hs.1 18893 

AW888941 Hs.75789 

BE313693 Hs.334330 

AI752163 Hs.1 14599 

N24433 Hs.241567 

U56725 Hs.180414 

X02761 Hs.287820 

BE295928 Hs.75424 
T81309 

AL048753 Hs.303649 

AA836472 Hs.297939 

J03836 Hs.82085 

BE245301 Hs.89414 

AI439011 Hs.86386 

L12711 Hs.89643 

AA340987 Hs.75693 

NMJ)05308Hs.211569 
NM.001674Hs.460 
BE247295 Hs.78452 
L42176 Hs.8302 
NM_006732Hs.75678 
NM_005795Hs.1 52175 
M15990 Hs.194148 
BE410405 Hs.76288 
AA296520 Hs.89546 
X12784 Hs.1 19129 
AA307680 Hs.75692 
M31166 Hs.2050 
BE293116 HsJ6392 
AW958272 Hs.347326 
AI752416 Hs.77326 
AF064853 Hs.91299 
NM_004428Hs.1624 
AV650262 Hs.75765 
NMJ)05381 
AF043045 Hs.81008 
M68874 Hs.211587 
M69043 Hs.81328 
NM_003199Hs.326198 
AI879352 Hs.1 18625 
W01076 Hs.278573 
M92843 Hs.343586 
BE243845 Hs.75511 
AA236291 Hs.183583 
BE550723 Hs.153179 
BE392588 Hs.75777 
NM_006823Hs.75209 
X96438 Hs.76095 
BE259035 Hs.1 18400 
BE616287 Hs.178452 
AA301867 Hs.76224 
AI752666 Hs.76669 
NM__004998Hs.82251 
AW161552 Hs.83381 
AI929721 Hs.5120 
AU076887 Hs.28491 
AU077005 Hs.92208 
U48959 Hs.211582 
U51010 

BE243877 Hs.76941 
U53445 Hs.15432 
R97457 Hs.63984 
U59423 Hs.79067 
AU076728 Hs.8867 
A1984144 Hs.66713 
AL119566 Hs.6721 
NM_007019Hs.93002 
AA532780 Hs.183684 
AA122237 Hs.81874 
NM_Q05100Hs.788 
NM_0023l8Hs.83354 
A1767736 Hs.290070 
BE409861 H3.202833 
X07820 Hs.2258 
BE512730 Hs.65114 
AI904738 Hs.76053 



Melanoma associated gene 

N-myc downstream regulated 

calmodulin 2 (phosphorylase kinase, delt 

collagen, type VIII, alpha 1 

RNA binding motif, single stranded inter 

heat shock 70kD protein 2 

fibronectin 1 

inhibitor of DNA binding 1, dominant neg 
insulin-like growth factor 2 (somatomedi 
small inducible cytokine A2 (monocyte ch 
cathepsin B 

serine (or cysteine) proteinase mhibito 

chemokine (C-X-C motif), receptor 4 (fus 

myeloid cell leukemia sequence 1 (BCL2-r 

transketolase (Wernicke-Korsakoff syndro 

prolylcarboxypeptidase (angiotensinase C 

G protein-coupled receptor kinase 5 

activating transcription factor 3 

solute earner family 20 (phosphate tran 

four and a half LIM domains 2 

FBJ murine osteosarcoma viral oncogene h 

calcitonin receptor-like 

v-yes-1 Yamaguchi sarcoma viral oncogene 

calpain 2, (m/ll) large sub-unit 

selectin E (endothelial adhesion molecul 

collagen, type IV, alpha 1 

asparagine synthetase 

pentaxin-related gene, rapidly induced b 

aldehyde dehydrogenase 1 family, member 

intercellular adhesion molecule 2 

insulin-like growth factor binding prote 

guanine nucleotide binding protein (G pr 

ephrin-A1 

GR02 oncogene 

nucteolin 

filamin B, beta (actin-binding protein-2 
phospholipase A2, group IVA (cytosoBc, 
nuclear factor of kappa light polypeptid 
transcription factor 4 
hexokinase 1 

CD59 antigen p18-20 (antigen identified 
zinc finger protein homologous to Zfp-36 
connective tissue growth factor 
serine (or cysteine) proteinase inhibito 
fatty acid binding protein 5 (psoriasis- 
transgelin 

protein kinase (cAMP-dependent, catalyti 
immediate early response 3 
singed (Drosophila)-like (sea urchin fas 
catenin (cadherin-associated protein), a 
EGF-containing fibulin-Iike extracellula 
nicotinamide N-methyltransferase 
myosin IE 

guanine nucleotide binding protein 11 
dynein, cytoplasmic, light polypeptide 
spermidine/spermine N1-acetyltransferase 
a disintegrin and metalloprotelnase doma 
myosin, light polypeptide kinase 
gb:Human nicotinamide N-methyltransferas 
ATPase, Na+/K+ transporting, beta 3 poly 
downregulated in ovarian cancer 1 
cadherin 13, H-cadherin (heart) 
MAD (mothers against decapentaplegic, Dr 
cysteine-rich, angiogenic inducer, 61 
hepatitis delta antigen-interacting prot 
lysosomal 

ubiquitin carrier protein E2-C 
eukaryotic translation initiation factor 
microsomal glutathione S-transferase 2 
A kinase (PRKA) anchor protein (gravin) 
lysyl oxidase-like 2 
gelsolin (amyloidosis, Finnish type) 
heme oxygenase (decycling) 1 
matrix metalloproteinase 10 (stromelysin 
keratin 18 

DEAD/H (Asp-Glu-Ala-Asp/His) box potypep 
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103011 


X52541 


AJ243425 


Hs.326035 


103020 


X53416 


X53416 


Hs.195464 


103029 


X54489 


AW800726 Hs.789 


103036 


X54925 


M13509 


Hs.83169 


103056 


X57206 


Y18024 


Hs.78877 


103080 


X59798 


AU077231 


Hs.82932 


103095 


X60957 


NM_005424Hs.78824 


103138 


X65965 


X65965 




103176 


X69111 


AL021154 


Hs.76884 


103195 


X70940 


AA351647 


Hs,2642 


103347 


X87838 


AU077309 


Hs.171271 


103371 


X91247 


X91247 


Hs.13046 


103432 


X97748 


X97748 




103471 


Y00815 


Y00815 


Hs.75216 


103967 


AA303711 


AL1 20051 


Hs.144700 


104447 


L44538 


AW204145 


Hs.156044 


104764 


AA025351 


AI039243 


Hs.278585 


104783 


AA027050 


AA533513 


Hs.93659 


104798 


AA029462 


AW952619 


Hs.17235 


104865 


AA045136 


T79340 


Hs.22575 


104877 


AA047437 


AI13B635 


Hs.22968 


104894 


AA054087 


AF065214 


Hs.18858 


104952 


AA071089 


AW076098 


Hs.345588 


105113 


AA1 56450 


AB037816 


Hs.8982 


105178 


AA1 87490 


AA313825 


Hs.21941 


105196 


AA1 95031 


W84893 


Hs.9305 


105215 


AA205724 


AA205759 


Hs.10119 


105263 


AA227926 


AW388633 


Hs.6682 


105271 


AA227986 


AA807881 


Hs.25329 


105330 


AA234743 


AW338625 


Hs.22120 


105461 


AA253216 


BE539071 


Hs.69388 


105492 


AA256210 


AI805717 


Hs.289112 


105493 


AA256268 


AL047586 


Hs.10283 


105594 


AA279397 


AB024334 


Hs.25001 


105727 


AA292379 


AL135159 


Hs.20340 


105732 


AA292717 


AW504170 


Hs.274344 


105767 


AA346551 


AW370946 


Hs.23457 


105882 


AA400292 


W46802 


Hs.81988 


105936 


AA404338 


AI678765 


Hs.21812 


106031 


AA412284 


X64116 


Hs.171844 


106124 


AA423S87 


H93366 


Hs.7567 


106222 


AA428594 


AA356392 


Hs.21321 


106241 


AA430108 


8E019681 


Hs.6019 


106263 


AA431462 


W21493 


Hs.28329 


106264 


AA431470 


AL046859 


Hs.3407 


106366 


AA443756 


AA186715 


Hs.336429 


106454 


M449479 


NM_014038Hs.5216 


106634 


AA459916 


W25491 


Hs.288909 


106724 


AA465226 


N48670 


Hs.28631 


106793 


AA478778 


H94997 


Hs.16450 


106799 


AA479037 


BE313412 


Hs7961 


106842 


AA482597 


AF1 24251 


Hs.26054 


106868 


AA487561 


BE185536 


Hs.301183 


106890 


AA489245 


AA489245 


Hs.88500 


106961 


AA504110 


AW243614 Hs.18063 


106974 


AA520989 


AI817130 


Hs.9195 


107030 


AA599434 


AL1 17424 


Hs.25035 


107061 


AA608649 


BE147611 


Hs.6354 


107086 


AA609519 


NMJ312331HS.26458 


107216 


D51069 


D51069 


Hs.211579 


107385 


U97519 


NM_005397Hs.16426 


107444 


W28391 


W28391 


Hs.343258 


107985 


AA035638 


T40064 


Hs.71968 


108507 


AA083514 


AI554545 


Hs.68301 


108695 


AA121315 


AB029000 


Hs.70823 


108931 


AA147186 


AA147186 




109001 


AA156125 


AI056548 


Hs.72116 


109195 


AA1 88932 


AF047033 


Hs.132904 


109390 


AA219653 


AW007485 Hs.87125 


109456 


AA232645 


AW956580 Hs.42699 


109737 


F10078 


AA055415 


Hs.13233 


110411 


H48032 


AW001579 Hs.9645 


110660 


H82117 


AA782114 


Hs.28043 


110906 


N39584 


AA035211 


Hs.17404 


111018 


N54067 


AI287912 


Hs.3628 


111091 


N59858 


AA300067 


Hs.33032 



early growth response 1 

filamin A, alpha (actin-binding protein- 

GR01 oncogene (melanoma growth slimulali 

matrix metalloproteinase 1 (Interstitial 

inositol 1 ,4,5-trisphosphate 3-kinase B 

cyclin D1 (PRAD1: parathyroid adenomatos 

tyrosine kinase with immunoglobulin and 

gb:H.sapfens SOD-2 gene for manganese su 

inhibitor of DNA binding 3, dominant neg 

eukaryotic translation elongation factor 

catenin (cadherin-associated protein), b 

thioredoxin reductase 1 

gb:H.sapiens PTX3 gene promotor region. 

protein tyrosine phosphatase, receptor t 

ephrin-B1 

ESTs 

ESTs 

protein disulfide isomerase related prot 

Homo sapiens clone TCCCIA00176 mRNA sequ 

B-ceil CLUymphoma 6, member B (zinc fi 

Homo sapiens clone IMAGE451939, mRNA se 

phospholipase A2, group IVC (cytosolic, 

desmoplakin (DPI, DPII) 

Homo sapiens, clone IMAGE:3506202, mRNA, 

AD036 protein 

angiotensin receptor-like 1 

hypothetical protein FLJ 14957 

solute carrier family 7, (cationic amino 

ESTs 

ESTs 

hypothetical protein FLJ20505 

CGI-43 protein 

RNA binding motif protein 8B 

tyrosine 3-monooxygenase/tryptophan 5-mo 

KIAA1002 protein 

hypothetical protein MGC12942 

ESTs 

disabled (Drosophila) homolog 2 (mitogen 
ESTs 

Homo sapiens cDNA: FLJ22296 fis, clone H 

Homo sapiens cDNA: FLJ21962 fis, clone H 

Homo sapiens clone FLB9213 PR02474 mRNA, 

Homo sapiens cDNA: FLJ21288 fis, clone C 

hypothetical protein FLJ 14005 

protein kinase (cAMP-dependent, catalyti 

RIKEN cDNA 9130422N19 gene 

HSPC028 protein 

hypothetical protein FLJ22471 

Homo sapiens cDNA; FLJ22141 fis, clone H 

ESTs 

Homo sapiens clone 25012 mRNA sequence 
' novel SH2-containing protein 3 
molecule possessing ankyrin repeats indu 
mitogen-activated protein kinase 8 inter 
Homo sapiens cDNA FLJ 10768 fis, clone NT 
Homo sapiens cDNA FLJ13698 fis, clone PL 
chloride intracellular channel 4 
stromal cell derived factor receptor 1 
methionine sulfoxide reductase A 
melanoma cell adhesion molecule 
podocatyxin-like 

proliferation-associated 2G4, 38kD 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 

ESTs 

KIAA1077 protein 

gb:zo38d01 .s1 Stratagene endothelial eel 
hypothetical protein FLJ20992 similar to 
solute carrier family 4, sodium bicarbon 
EH-domain containing 3 
ESTs 

ESTs, Moderately similar to A47582 B-cel 
Homo sapiens mRNA for KIAA1741 protein, 
ESTs 
ESTs 

mitogen-activated protein kinase kinase 
hypothetical protein DKFZp434N185 
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111356 
111378 
111741 
111769 
112318 
112951 
113057 
113195 
113490 
113542 
113803 
113847 
113910 
113947 
114047 
115061 
115819 
115870 
115964 
116228 
116264 
116314 
116589 
117023 
117112 
117156 
117176 
117280 
119559 
119866 
120655 
121314 
121335 
121822 
121835 
122331 
122577 
123160 
123486 
124059 
124339 
124358 
124364 
124726 
124763 
125167 
125304 
125307 
125329 
107985 
125598 
125609 
116024 
418000 
126399 
127435 
127566 
127619 
434190 
128453 
128495 
128515 
128580 
128623 
128642 
128669 
128903 
128914 
129087 
129188 
129226 
129265 
129345 
129468 
129488 
101838 



N90933 

N93764 

R26124 

R27957 

R55470 

T16550 

T26674 

T57112 

T88700 

T90527 

W42789 

W60002 

W78175 

W84768 

W94427 

AA253217 

AA426573 

AA432374 

AA446622 

AA478771 

AA482594 

AA490588 

D59570 

H88157 

H94648 

H97538 

H98670 

N22107 

W38197 

W80814 

AA287347 

AA402799 

AA404418 

AA425107 

AA425435 

AA442872 

AA452860 

AA488687 

AA599674 

F13673 

H99093 

N22495 

N23031 ■ 

R15740 

R39610 

W45560 

Z39833 

Z40583 

AA825437 

R66613 

R66613 

AA868063 

AA128075 

AA1 28075 

AA1 28075 

N66570 

AI051390 

AA627122 

AA627122 

X02761 

AF010193 

AA149044 

U82108 

D78676 

L35240 

AA598737 

R69417 

AA232837 

N72695 

M30257 

M96843 

X68277 

AA292440 

J03040 

AA228107 

AA449789 



BE301871 Hs.4867 
AW160993 Hs.326292 
AB020653 Hs.24024 
AW629414 Hs.24230 
AW083384 Hs.11067 
AA307634 Hs.6650 
AW194301 Hs.339283 
H83265 Hs.8881 
BE178110 Hs.173374 
H43374 Hs.7890 
AW880709 Hs.283683 
NM_005032Hs.4114 
AA113262 Hs.17901 
W84768 

AL035858 Hs.3807 
A1751438 Hs.41271 
AA486620 Hs.41135 
NMJ)05985Hs.48029 
AA987568 Ha.74313 
AI767947 Hs.50841 
D51174 Hs.272239 
AI799104 Hs.178705 
AI557212 Hs.17132 
AW070211 Hs.102415 
AW969999 Hs.293658 
W73853 



H45100 
M18217 
W38197 
AA496205 



Hs.49753 
Hs.172129 

Hs.193700 

AA305599 Hs.238205 

W07343 Hs.182538 
AA404418 
AI743860 

AB033030 Hs.300670 

AL133437 Hs.1 10771 

AA829725 Hs.334437 

AA488687 Hs.284235 

BE019072 Hs.334802 

BE387335 Hs.283713 

H99093 Hs.343411 

AW070211 Hs.102415 

AF265555 Hs.250646 
NIVL003654HS.104576 

BE4 10405 Hs.76288 

AL137540 Hs.102541 

AL359573 Hs.124940 

AW580945 Hs.330466 

AA825437 Hs.58875 

T40064 Hs.71968 

T40064 Hs.71968 

AA868063 Hs.104576 

AA088767 Hs.83883 

AA932794 Hs.83147 

AA088767 Hs.83883 

X69086 Hs.286161 

AI051390 Hs.116731 

AA627122 Hs.163787 

AA627122 Hs.163787 

X02761 Hs.287820 
NM_005904Hs.100602 

BE395085 Hs.10086 



U82108 
BE076608 
Z28913 
W28493 



Hs.101813 
Hs. 105509 
Hs.102948 
Hs.180414 



AW1 50717 Hs.345728 
AW867491 Hs.107125 
AI348027 Hs.108557 
NM_001078Hs.109225 
BE222494 Hs.180919 
AA530892 Hs.171695 
R22497 Hs.1 10571 
AW410538 Hs.111779 
AW966728 Hs.54642 
BE243845 Hs.75511 



mannosyl (alpha-1 ^-glycoprotein beta- 
hypolhetical gene DKFZp434A1 1 14 
KIAA0846 protein 
ESTs 

ESTs, Highly similar to T46395 hypotheti 
vacuolar protein sorting 45B (yeast homo 
Human DNA sequence from clone RP1-187J1 1 
ESTs, Weakly similar to S41044 chromosom 
Homo sapiens cDNA FLJ10500 fis, clone NT 
Homo sapiens mRNA for KIAA1671 protein, 
chromosome 8 open reading frame 4 
plastin 3 (T isofomn) 

Homo sapiens, clone IMAGE:3937015, mRNA, 
gb:zh53d03.s1 Soares_fetaUiver_$pleen_ 
FXYD domain-containing ion transport reg 
Homo sapiens mRNA full length insert cDN 
endomucin-2 

snail 1 (drosophifa homofog), zinc finge 

KIAA1265 protein 

ESTs 

lysosomal 

Homo sapiens cDNA FLJ11333 fis, clone PL 
ESTs, Moderately similar to I54374 gene 
Homo sapiens mRNA; cDNA DKF2p586N0121 (f 
ESTs 
ESTs 

uveal autoantigen with coiled coil domai 
Homo sapiens cDNA: FU21409 fis, clone C 
Empirically selected from AFFX single pr 
Homo sapiens mRNA; cDNA DKF2p586l0324 (f 
hypothetical protein PRO2013 
phospholipid scramblase 4 
gb:zw37e02.s1 Soares_totaiJetus_Nb2HF8_ 
metallothionein 1E (functional) 
KIAA1204 protein 

Homo sapiens cONA: FLJ21904 fis, clone H 
hypothetical protein MGC4248 
ESTs, Weakly similar to I38022 hypotheti 
Homo sapiens cDNA FLJ14680 fis, clone NT 
ESTs, Weakly similar to S64054 hypotheti 
DEAD/H (Asp-G(u-AIa-Asp/His) box polypep 
Homo sapiens mRNA; cDNA DKF2p586N0121 (f 
baculoviral IAP repeat-containing 6 
carbohydrate (keratan sulfate Gal-6) sul 
cal^aln 2, (m/ll) large subunit 
netrin4 

GTP-binding protein 

ESTs 

ESTs 

Homo sapiens mRNA; cDNA DKF2p564F053 (fr 
Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
carbohydrate (keratan sulfate Gal-6) sul 
transmembrane, prostate androgen induced 
guanine nucleotide binding protein-like 
transmembrane, prostate androgen induced 
Homo sapiens cDNA FLJ13613 fis, done PL 
ESTs 
ESTs 
ESTs 

fibronectin 1 

MAD (mothers against decapentaplegic, Dr 
type I transmembrane protein Fn14 
solute earner family 9 (sodium/hydrogen 
CTL2gene 

enigma (LIM domain protein) 
heat shock 70kD protein 8 
STAT induced STAT inhibitor 3 
plasmalemma vesicle associated protein 
hypothetical protein PP1057 
vascular cell adhesion molecule 1 
inhibitor of DNA binding 2, dominant neg 
dual specificity phosphatase 1 
growth arrest and DNA-damage-inducible, 
secreted protein, acidic, cysteine-rich 
methionine adenosyltransferase H, beta 
connective tissue growth factor 
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413731 AA449789 BE243845 Hs.75511 

129557 W01367 AL045404 Hs.46366 

129619 AA610116 AA209534 Hs.284243 

129627 AA258308 T40064 Hs.71968 

5 129762 AA460273 AA453694 Hs. 12372 

129884 AA286710 AF055581 Hs.13131 

130018 T68873 AA353093 

130147 D63476 D63476 Hs.172813 

130178 M62403 U20982 Hs.1516 

10 130282 X55740 BE245380 Hs.153952 

130431 L10284 AW505214 Hs.155560 

130495 AA243278 AW250380 Hs.109059 

130553 AA430032 . AF062649 Hs.252587 

130638 H16402 AW021276 Hs.17121 

15 1 30639 D59711 AI557212 Hs.17132 

130657 T94452 AW337575 Hs.201591 

130686 AA431571 BE548267 Hs.337986 

130776 R79356 AF167706 Hs.19280 

130818 AA280375 AW190920 Hs.19928 

20 130840 Z49269 BE048821 Hs.20144 

130899 Z41740 AI077288 Hs.296323 

131002 AA121543 AL050295 Hs.22039 

131080 J05008 NM_001955Hs.2271 

131084 AA101878 NM_017413Hs.303084 

25 131091 T35341 AJ271216 Hs.22880 

131107 N87590 BE620886 HsJ5354 

131182 AA256153 AI824144 Hs.23912 

131207 W74533 AF104266 Hs.24212 

131319 U25997 NM_003155Hs.25590 

30 131328 V01512 AW939251 Hs.25647 

131509 X56681 X56681 Hs.2780 

131555 AA161292 T47364 Hs.278613 

131564 AA491465 T93500 Hs.28792 

131573 AA046593 AA040311 Hs.28959 

35 131692 D50914 BE559681 Hs.30736 

131756 D45304 AA443966 Hs.31595 

131859 M90657 AW960564 

131909 W69127 NM_016558Hs.274411 

131915 AA316186 AI161383 Hs.34549 

40 132046 AA384503 AI359214 Hs.179260 

132050 AA136353 AI267615 Hs.38022 

132151 AA044755 BE379499 Hs.173705 

132164 U84573 AI752235 Hs.41270 

132187 AA058911 AA235709 Hs.4193 

45 132303 AA620962 BE1 77330 Hs.325093 

132314 AA285290 AF1 12222 Hs.323806 

132358 X60486 NM_003542Hs.46423 

132398 R31641 AA876616 Hs.16979 

132421 AA489190 AW163483 Hs.48320 

50 132490 F13782 NM_001290Hs.4980 

132520 AA257993 AA257992 Hs.50651 

132546 M24283 M24283 Hs.168383 

132610 AA443114 AA1 60511 Hs.5326 

132716 T35289 BE379595 Hs.283738 

55 132840 N23817 BE218319 Hs.5807 

132883 M047151 AA373314 Hs.5897 

132968 N77151 AF234532 Hs.61638 

132989 AA480074 AA480074 Hs.331328 

132999 Y00787 Y00787 Hs.624 

60 133071 T99789 BE384932 Hs.64313 

133076 W84341 AW946276 Hs.6441 

133099 L09209 W16518 Hs.279518 

133147 D12763 AA026533 Hs.66 

133149 T16484 AA370045 Hs.6607 

65 133161 AA253193 AW021103 Hs.6631 

133200 AA432248 AB037715 Hs.183639 

133220 X82200 NM_006074Hs.318501 

133260 AA083572 AA403045 Hs.6906 

133295 L00352 AH47861 Hs.213289 

70 133349 N75791 AW631255 Hs.8110 

133391 X57579 AW103364 Hs.727 

133398 X02612 NMJ00499HS.72912 

133436 H44631 BE294068 Hs.737 

133454 AA090257 BE547647 Hs.177781 

75 133478 X83703 X83703 Hs.31432 

133491 L40395 BE619053 Hs.170001 



connective tissue growth factor 
KIAA0948 protein 
tetraspan NET-6 protein 

Homo sapiens mRNA; cDNA DKFZp564F053 (fr 
tripartite motif protein TRIM2 
lysosomal 
metallothionein 1 L 

PAK-interacting exchange factor beta 
insulin-tike growth factor-binding prote 
5' nucleotidase (CD73) 
calnexin 

mitochondrial ribosomal protein L12 
pituitary tumor-transforming 1 
ESTs 

ESTs, Moderately simitar to 154374 gene 
ESTs 

Homo sapiens cDNA FU10934 fis, clone OV 

cysteine-rich motor neuron 1 

hypothetical protein SP329 

small inducible cytokine subfamily A (Cy 

serum/giucocorticoid regulated kinase 

KIAA0758 protein 

endothelin 1 

apelin; peptide ligand for APJ receptor 

dlpeptidylpeptidase III 

GCN1 (general control of amino-acid synt 

ESTs 

latrophilin 

stanniocalcln 1 

v-fos FBJ murine osteosarcoma viral onco 

jun D proto-oncogene 

interferon, alpha-inducible protein 27 

Homo sapiens cDNA FLJ11041 fis, clone PL 

ESTs 

KIAA0124 protein 
ESTs 

transmembrane 4 superfamily member 1 

SCAN domain-containing 1 

ESTs, Highly similar to S94541 1 done 4 

chromosome 14 open reading frame 4 

ESTs 

Homo sapiens cDNA: FLJ22050 fis, clone H 

procollagen-lysine, 2-oxoglutarate 5-dio 

DKFZP58601624 protein 

Homo sapiens cDNA: FLJ21210 fis, clone C 

pinin, desmosome associated protein 

H4 histone family, member G 

ESTs, Weakly similar to A43932 mucin 2 p 

double ring-finger protein, Dorfin 

LIM domain binding 2 

Janus kinase 1 (a protein tyrosine kinas 

intercellular adhesion molecule 1 (CD54) 

amino acid system N transporter 2; porcu 

casein kinase 1, alpha 1 

GTPase Rab14 

Homo sapiens mRNA; cDNA DKFZp586P1622 (f 
myosin X 

hypothetical protein FU13213 
interleukin 8 

ESTs, Weakly similar to AF257182 1 G-pro 
Homo sapiens mRNA; cDNA DKFZp586J021 {fr 
amyloid beta (A4) precursor-like protein 
interleukin 1 receptor-like 1 
AXIN1 up-regulated 
hypothetical protein FLJ20373 
hypothetical protein FL110210 
Homo sapiens mRNA full length insert cDN 
Homo sapiens cDNA: FLJ23197 fis, clone R 
low density lipoprotein receptor (famili 
L-3-hydroxyacyl-Coenzyme A dehydrogenase 
inhibin, beta A (adivin A, activin AB a 
cytochrome P450, subfamily I (aromatic c 
immediate early protein 
hypothetical protein MGC5618 
cardiac ankyrin repeat protein 
eukaryotic translation initiation factor 
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133510 


AA227913 


AW880841 Hs.96908 


133517 


X52947 


NM_000165Hs.74471 


133526 


M11313 


AU077051 


HSJ4561 


133538 


L14837 


NMJ03257HS.74614 


133562 


M60721 


M60721 


Hs.74870 


133584 


D90209 


D90209 


Hs.181243 


133590 


T67986 


T70956 


Hs.75106 


133617 


AA148318 


BE244334 


Hs.75249 


133651 


U97105 


AI301740 


Hs.173381 


133671 


T25747 


AW503116 


Hs.301819 


133678 


K02574 


AW247252 




133681 


D78577 


AI352558 




133722 


X53331 


AW969976 


Hs.279009 


133730 


S73591 


BE242779 


Hs.179526 


133750 


X95735 


BE410769 


Hs.75873 


133802 


L16862 


AW239400 


Hs.76297 


133825 


U44975 


BE616902 


Hs.285313 


133838 


M97796 


BE222494 


Hs.180919 


133859 


U86782 


U86782 


Hs.178761 


133889 


AA099391 


U48959 


Hs.211582 


133960 


M19267 


M19267 


Hs.77899 


133975 


D29992 


C18356 


Hs.295944 


133977 


L19314 


AH 25639 


Hs.250666 


134039 


S78569 


NM 002290HS.78672 


134075 


U28811 


NIVL012201HS.78979 


134081 


L77886 


AL034349 


Hs.79005 


134164 


C14407 


AW245540 Hs.79516 


134203 


M60278 


AA161219 


Hs.799 


134238 


R81509 


AA102179 


Hs.160726 


134299 


AA487558 


AW580939 Hs.97199 


134332 


D86962 


D86962 


Hs.81875 


134339 


AA478971 


R70429 


Hs.81988 


134343 


D50683 


D50683 


Hs.82028 


134381 


U56637 


AI557280 


Hs.184270 


134403 


M61199 


AA334551 




134416 


M28882 


X68264 


Hs.211579 


134493 


X15183 


M30627 


Hs.289088 


134558 


S53911 


NM_001773Hs.85289 


134817 


U20734 


AU076592 


Hs.198951 


134983 


D28235 


D28235 


Hs.195384 


134989 


AA236324 


AW968058 Hs.92381 


135052 


AA148923 


AL136653 


Hs.93675 


135062 


AA174183 


AK000967 


Hs.93872 


135069 


AA456311 


AA876372 


Hs.93961 


135071 


L08069 


W27190 


Hs.94 


135073 


AA452000 


W55956 


Hs.94030 


135170 


AA282140 


T53169 


Hs.9587 


135196 


J02854 


C03577 


Hs.9615 


135348 


AA442054 


U80983 


Hs.268177 



p53-induced protein' 

gap junction protein, alpha 1, 43kD (con 

alpha-2-macroglobultn 

tight junction protein 1 (zona occludens 

H2.0 (Drosophila)-like homeo box 1 

activating transcription factor 4 (tax-r 

clusterin (complement lysis inhibitor, S 

ADP-ribosylation factor-like 6 interacti 

dihydropyrimidtnase-fike 2 

zinc finger protein 146 

nucleoside phosphorylase 

tyrosine 3-monooxygenase/tryptophan 5-mo 

matrix Gla protein 

upregulated by 1,25-dihydroxyvitamin D-3 
zyxln 

G protein-coupled receptor kinase 6 
core promoter element binding protein 
inhibitor of DNA binding 2, dominant neg 
26S proteasome-associated pad1 homolog 
myosin, light polypeptide kinase 
tropomyosin 1 (alpha) 
tissue factor pathway inhibitor 2 
hairy (Drasophila)-homolog 
laminin, alpha 4 
Golgi apparatus protein 1 
protein tyrosine phosphatase, receptor t 
brain abundant, membrane attached signal 
diphtheria toxin receptor (heparin-bindi 
Homo sapiens cDNA FLJ1 1680 fis, clone HE 
complement component C1q receptor 
growth factor receptor-bound protein 10 
disabled (Drosophila) homolog 2 (mitogen 
transforming growth factor, beta recepto 
capping protein (actin filament) muscle 
sperm specific antigen 2 
melanoma cell adhesion molecule 
heat shock 90kD protein 1, alpha 
CD34 antigen 
jun B proto-oncogene 
prostaglandin-endoperoxide synthase 2 (p 
nudix (nucleoside diphosphate linked moi 
decidual protein induced by progesterone 
K1AA1682 protein 

Homo sapiens mRNA; cDNA DKF2p667D095 (fr 
DnaJ (Hsp40) homolog, subfamily A, membe 
Homo sapiens mRNA; cDNA DKFZp586E1624 (f 
Homo sapiens cDNA: FLJ22290 fis, clone H 
myosin regulatory light chain 2, smooth 
phospholipase C, gamma 1 (formerly subty 
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TABLE 4A 

Table 4A shows the accession numbers for those pkeys lacking unigenelD's for Table 4. The pkeys in Table 7 lacking unigeneiD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column, 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey CAT Number Accession 

100752 33207 21 T81309 BE019033 R94181 BE019198 NM_000612 J03242 AW411299 BE300064 BE297544 R94182 AW630108 T53723 

058853 H78073 H80594 BE299560 T48899 H70196 M17426 H1W7 S77035 H58384 H61664 H78540 T84527 C17198 
H60255 H71980 R92644 W79050 X00910 M29645 R91055 M17863 M17862 T71815 BE299561 BE464561 X06260 
R94741 T54216 C18594 BE262015 X06161 AW409889 AA378400 BE263228 BE313278 R881 16 BE313457 H43500 
T48617 BE313761 H77309 AI207601 X06159 H40413 X03425 T87663 R10627 X03562 M141 18 W03982 R97520 H81229 
T83157 H83168 H48762 AA669898 BE263054 H47289 AA022807 R1 1555 H74260 R76968 R28338 H72534 H72464 
H62031 N72478 N45355 AW411300 R891 13 R69135 H58454 T83281 R93476 H69645 H68015 T82229 H71089 T851 21 
H59939 W65299 N78176 H53909 N72373 R21788 H04660 H59639 H61874 BE262219 T53614 N73335 N50464 W00943 
N77189 RB9257 AA570502 R89432 R06366 AA553480 AA776271 AA551359 AA551050 H51670 AA601052 BE299081 
H68198 H52276 BE207832 N91 192 H70332 X07868 X07868 H69464 H53782 H73710 R80435 AA553384 AW884176 
N53475 T71662 AW954036 AW954033 AA552931 H93206 AA430218 AA553476 AI918470 T54124 BE207982 BE300177 
N73994 AW882625 N39549 N53838 AA722389 H71878 H58909 H37849 H78435 T47933 R77174 R83814 M411890 
H94199 AA663208 BE205778 AA490137 H70492 R98232 H37800 AA679294 H40341 H74238 H47290 H73231 T48618 
AA025428 AI039521 H92969 N59389 H80538 H72933 T90630 AA411891 N55000 H74225 AA340290 AW957061 T54316 
AA340437 H57125 H58908 H79027 H63450 N74623 R93425 H68714 H68758 N68396 H48763 N69256 H57320 H53831 
H53589 N68833 N52453 H56048 H69870 H78074 R69253 R83375 T53615 H94330 H58455 H90864 T47934 H74261 
R89258 R97997 R91056 R28339 R86760 H78235 R97521 H67692 H40358 AA022688 H52513 H59601 T88690 H65256 
H63397 W65397 AA553588 R19280 N52645 W73930 R06367 R21743 H72372 N73921 AW883539 AW882639 T4061 6 
H47084 R95723AA634316AA862781 H77310 R91389 H93111 R92767 T54512 R89341 H70333 H57817H82941 H62032 
N52638 H58385 T91796 H51086 AA340292 T49918 H81230 R36121 N50411 T87664 N62436 N39340 AA665637 
AA340446 H93377 H92973 BE296290 BE269788 H61665 AA340444 N54605 AA454101 R10628 R94200 Al 200549 
AA342640 BE298855 BE250229 T49916 H82008 N28278 AW880662 H71268 N76791 H47685 H65255 W05193 
AW889144 N76677 H71702 H68036 H71915 R91612 R87807 H68059 AI133328 AI247866 AA621443 AW8B1050 
M700847 AA340413 AW878608 AW881 181 AW878249 H71916 N54596 BE161581 AW878082 W04212 AW881040 
AW885492 AW880519 AA334887 AW878715 W06882 AW630222 AW885381 H70869 AW381778 H47601 AW889982 
H63868 AW884986 AW878713 AW878685 R36391 AW87B694 AA368070 C03393 AW878695 AW878705 AW878665 
AW878742 AW878620 AW878823 AW878688 R29048 AW878690 AW878686 AW878810 AW878827 AW878733 
AW878659 AW878749 AW878681 AW883353 AW883277 AW883300 AW883565 AW883298 AW883143 AW883045 
AW883482 AW883352 AW883417 AW883357 AW883231 AW883474 AW883355 AW882620 AW882533 AW883754 
AW883139 AW882827 AW88364 1 AW883567 AW883481 AW882983 AW882982 AW382465 AW88341 9 AW882466 
AW883639 AW883230 AW882981 AW882534 AW882874 AW882619 AW883480 AW882826 AW882831 AW882835 
AW882830 AW883563 AW882456 AW627642 

1 17156 145392 1 W73853 AA9281 12 W77887 AW889237 AA148524 AI749182 AI754442 AI338392 AI253102 AI079403 AI370541 AI697341 

H97538 AW188021 AI927669 W72716 AI051402 AI188071 A1335900 N21488 AW770478 W92522 A1691028 AI913512 
AI144448 W73819 M604358 N28900 W95221 AI868132 H98465 AA148793 

131859 3672J AW960564 AA092457 T55890 D56120 T92525 AI815987 BE182608 BE182595 AW080238 M90657 AA347236 AW961686 

AW176446 AA304671 AW583735 T61714 AA316968 AI446615 AA343532 AA083489 AA488005 W52095 W39480 N57402 
D82638 W25540 W52847 D82729 D58990 BE619182 AA315188 AA308636 AA1 12474 W76162 AA088544 H52265 
AA301631 H80982AA1 13786 BE620997 AW651691 AA343799 BE613669 BE547180 BE546656 F11933 AA376600 
AW239185 AA376086 BE544387 BE619041 AA452515 AA001806 M190873 AA180483 AA159546 F00242 AI940609 
A1940602 AI189753 T97663 T66110 AW062896 AW062910 AW062902 AI051622 AI828930 AA102452 AI685095 AI819390 
AA557597 M383220 AI804422 AI633575 AW338147 AW603423 AW606800 AW750567 AW510672 AI250777 AA08351 0 
AW629109 AW513200 AA921353 AI677934 AI148698 AI955858 AA173825 AA453027 AI027865 AW375542 AA454099 
AA733014 AI591384 R79300 R80023 AA843108 AA626058 AA844898 AW375550 AA889018 AI474275 AW205937 
AI052270 AW388117 AW3881 11 AA699452 AI242230 N47476 H38178 AA366621 AA1 13196 AA130023 H39740 T61629 
AI885973 AW083671 AA179730 AA305757 AI285455 N83956 AA216013 AA336155 AW999959 T97525 AA345349 
T91762 AA771 981 AI285092 AI591386 BE392486 BE385852 AA682601 AI682884 AA345840 T85477 AA292949 
AA932079 AA098791 D826Q7 T48574 AW752038 C06300 

125565 1704098 J R20840R20839 

133607 1 227_6 BE273749 BE397561 BE387189 AL037858 AL037878 AI963094 BE259216 AA01 1363 AL036189 BE562325 AA251 169 

BE617431 N98537 AA158093 AL047800 M34539 NMJ300801 AA312140 D16971 AA158904 AA3071 14 AA312803 
T09203 AW629686 AL048504 BE388578 AA220957 AA158364 BE267385 AA294971 C18055 BE241757 AA1 15056 
AI936769 BE378435 BE206971 AW674924 BE622060 AA604674 AA1 15273 AW402159 AA338608 BE568819 M80199 
X55741 AA3751 1 1 AA376016 BE612671 AA805742 AW405588 N25850 N44580 H06031 AW403549 BE536552 AA056726 
BE543239 AA082517 A1201645 AI201642 AI192622 N40104 AA370921 BE547569 AI969602 AA302038 AI197890 
AW268354 AI014938 W45448 AI541395 AA037272 BE538826 AL039613 BE536130 AA299355 AW805147 AW974624 
H53220 AI471471 AA399303 AA007386 W35106 BE613277 R12739 R12738 AA304342 AA687802 BE409581 AI498844 
AV662092 AW904105 AA011375 BE315214 H99302 BE537893 N32299 AW855829 AI291320 BE078322 AI301395 
AA303362 N32719 M358328 AA357877 AI952540 H56279 H02758 H02048 AW805233 R82224 AA410772 AA291352 
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133681 13893J 



134403 17037.1 



126872 142696J 



121335 
130018 
121822 



279548J 
18986 1 
244391J 



BE171 109 N69935 BE169248 AA361 173 H44978 BE617887 D52560 AA084043 W03595 R67219 N36477 N42924 R67104 
H44901 H79695 W21105 AA393988 W30899 AA316096 BE622896 W46872 AA442678 BE544893 BE5401 12 BE621873 
AA338067 N55052 BE398154 BE621210 AA740760 C03739 C03206 BE396692 AA482370 AA031614 AA301575 
AA304710 AA132153 AA029796 AA994960 H19567 AA442969 H49781 H46871 AA035395 AA056185 AA149378 
AA643080 AL135479 AA292329 AA654337 M041228 AA454888 AA025039 W58331 AA625981 T94941 AA302448 
H19900 AA218956 AA513790 AA563962 AA398076 W44441 AA293276 W47373 AA625879 W30688 AA043029 T64284 
R79151 AA304340 AA485186 AA604939 R82470 AA421425 AW771456 A1339329 M304424 AA605236 AA936934 
AA587673 AI209162 A1697301 A1479995 AI679814 AI361950 AW189125 AI955888 AI986019 BE301019 AI084792 
AI310211 AW189307 AI022070 AW977204 AI146825 AW190163 AW303281 AI828345 BE046043 AW029257 AA482268 
AI246507 AI420729 AW084932 AW439514 AI890487 AW439692 AI523896 AM 86612 AI659953 AI889773 AA687527 
AW072694 AW262163 AW467371 AI613269 AI679238 D54404 AA158103 AW105527 AW149739 AW150361 AW268387 
AW117708 A1951682 AI687440 AW674285 AA678365 AI587082 AA732095 AA019899 W45661 AA627300 BE613304 
AA765891 AA61 2935 AI814658 AW316916 R66594 AA514640 AA025040 AA031472 AW732076 AA029797 AI244560 
AI128734 AW381720 AI092360 AI263283 AW613175 AI890675 AI720156 AW631348 AI635106 AI278045 AA303979 
AA703505 W45449 AW078661 AI292052 AW381707 AI147854 AW381743 AA158905 M303258 AA888144 AW195967 
AA428706 AA989559 AA617731 H19882 BE543418 AA830386 AA421302 W58652 T94995 AI869743 AI679145 
AW085971 N98425 AA765136 AI347027 AI356955 AA928038 AI679717 AA458459 AA679281 AI367973 AI270041 
AA765135 AA732793 A1798447 AA668646 AA251008 AI984538 AI401737 AA056186 BE043308 AW662375 AI3021 10 
N50724 W96332 BE537047 N26983 AI567172 AA765296 AW673237 N29784 AA534275 AAQ84044 AW067973 
AW300766 T63398 W46823 R39790 AI364185 AW298582 AA454814 AW069878 N67751 H05982 N23140 AI362647 
AI302086 AI767772 N25755 H531 14 AA706133 T9351 1 AA429291 M935294 AA987647 W02803 R66595 AI680795 
W23673 AW440794 AA722872 H49538 AW131042 AA531603 AA908665 AA040791 AA23531 2 W52205 N93444 R82180 
H02759 H79696 AW088894 H56079 AA961143 AW067776 AW973745 AA016311 AW071227 AA017511 AI753994 
W47374 T64155 AA296092 AI698626 AA558158 AA296088 AW794259 H01963 AA149267 AA485076 AA975856 H44938 
AA035396 AI955555 H46289 AA486161 AI631222 AA359047 AW794253 AI806962 AW243930 AA526145 AW878734 
AA018464 AA132031 R67220 R79152 AA296093 H54300 AI005160 BE242548 AW992803 AW878644 AW878666 T27742 
R82471 AW517604 AW472738 AI282904 R39791 AA486098 AW467891 AW960520 AA551736 AA056621 AW945197 
R66373 AA554236 BE242202 AI904376 AI832590 H19484 R00890 AI627677 AA302287 AI869451 A1734855 AI708073 
AI832902 AA585184 AW204299 AA055565 D12417 D1 1975 T63543 AW664099 R54423 BE61 2712 T96340 T63985 
AA598917 T40735 T64053 AA149284 AW272548 AA363445 AA042893 AW300697 BE261973 T53501 T53500 AW878729 
AW878657 AW794391 AA069193 R01553 H44875 AA385406 AA533968 M93060 AL135600 W96331 AA017651 
AA018849 AA017692 H85337 BE278690 AA731598 AA018512 AI076813 AI022644 R02585 X52220 AW296894 AA825671 
AI699321 AI393601 AW592611 AI146747 AA606921 AA158365 AW590007 AA354519 D20081 R02704 AW798339 
M92422 AA094903 AA007676 

AI352558 Z82248 X78138 NMJ03405 AU077248 AA223125 S80794 D78577 AI124697 AW403970 BE614089 BE296713 
BE621334 L20422 X80536 D54224 D54950 X57345 N29226 AA127798 AA340253 F08031 AA192540 H67636 AA321827 
AW950283 AA084159 BE538808 AW401377 AA256774 C03366 W46595 W47608 M305009 H69431 H69456 AL120082 
H1 1 706 AA303717 AA361357 H22042 H78020 AW999584 AA1 34368 AA32291 1 AA322961 H60980 N85248 N31 547 
H79624 T1 1718 W85826 AW894663 AW894624 BE167441 BE170015 AA304626 AW602163 AW998929 AA156681 
AA151067 BE002724 AA608688 H82692 BE155392 AW383636 BE155394 M487004 AW383504 AI342365 R82553 
W16498 BE155344 AI143938 R69901 AA322873 AW340648 R25364 AA367935 AI559406 AA033522 AA374252 
AW835019 AI922133 AI697089 N99662 AW189078 AI199076 AW151598 W59944 AA662875 W94022 AA299055 
AI039008 AIB29449 AA583503 AI635674 AW131665 AI473820 AW2731 18 AW900930 AA908944 AI688035 AW170272 
AI082545 AW468176 AI608761 AI082748 AI91 1682 A1248943 AI831016 AA192465 AI218477 AA938406 AA385288 
AI809817 AA905196 AI191245 AI470204 AI188296 AI421367 AI125315 AI087141 M629032 AA740589 AI554181 
AA150830 AI248541 A1077943 AA775958 AA864930 AI261476 AI123121 AI310394 AA862331 AA872478 BE537084 
AI205606 AA720684 AI872093 AW150042 AL120538 AA219627 AA988608 C21397 AI359337 H25337 AI089749 
AA605146 AI359620 AA150478 AI359738 AW383642 AW995424 AI766457 R56892 AI089839 W61343 N69107 W46459 
AA565955 N20527 AI279782 W46596 AA776573 H23204 AI866231 AI083995 N21530 AA126874 D82630 W65437 
AI086917 AW382095 AI086877 H69844 AW340217 W85827 L08439 AA262704 AA505380 W47413 W94135 AA223241 
AW089153 AA084101 BE538000 AA096126 T28031 AA491574 R84813 AA774536 AW383522 AA155615 AW383529 
AA491520 AW028427 AA171496 A1469689 AW664539 AI81 1102 AI811116 BE464590 BE350791 H78021 T15405 H21979 
AA219489 H13301 AA505883 AI864305 AI423963 AW084401 F04963 R69858 H67097 AI917740 AI655581 H69864 
AA033631 AW383484 AI886261 H25293 AA513281 AW271 187 H1 1617 N79982 AI174338 AI904207 AI904208 BE614558 
W94127 W65436 AI272249 AA700018 AI579932 AI085941 AW152629 

AA334551 BE008229 AA307537 AW961156AW995894 AW995826 NM 006751 M51199 AA045603 AL036372 AV645606 
AI688095 AW351901 AA101337 AA101345 N73342 BE018030 BE569044 AW841975 AA373388 BE090412 H95440 
N53845 R67867 AA093441 AA363427 H93708 AW023134 AW994986 AW994989 BE090429 R23614 AI567932 H03726 
H01101 H01867 AA548743 AI671806 AW872949 AW872941 AA742447 AI199788 M045604 AI637465 AI741796 
AW242217 AW131463 AI765302 AI683923 AA889762 AI804889 AI986437 C06049 BE502340 AI695651 AI491970 
AA496804 AA281008 AA665699 AI473814 BE301445 AA707837 AA551925 AI017348 AI208185 AA775203 AA156296 
AA557463 H95441 AA768547 AW769358 AA991197 AA181954 AI091389 AH47289 AW771837 AI638582 AA844411 
AI374750 T29320 AW951272 AW085923 H02834 AA843259 AA814696 AW183290 AA158453 N68125 N69039 AA100423 
AA101346 AI918720 H01102 R67868 H01868 N66438 R46580 AI858433 AA599560 AA187577 AA157481 AA361520 
AL047827 AA158452 R21688 AW964874 AA325161 R40871 AW752395 AW375924 R13355 AA281174 AA428908 
AW450979 AA136653 AA136656 AW419381 M984358 AA492073 BE168945 AA809054 AW238038 BE01 1212 BE01 1359 
BE011367 BE011368 BE011362 BE011215 BE011365 BE011363 
AA404418 AI217248 

AA353093 AW957317 AW872498 AI560785 A!289110 AW135512 X97261 T68873 

AI743860 N49543 AW027759 BE349467 AI656284 BE463975 R35022 AA370031 AW955302 AL042109 N53092 AI611424 
AL079362 AI969290 AI928016 BE394912 BE504220 BE467505 AI61 161 1 AI61 1407 AI61 1452 W56437 AI284566 
AI583349 AW183058 AI30B085 AI074952 AA437315 AA628161 AW301728 AI150224 AA400137 AA437279 AI223355 
AA639462 AI261 373 AI432414 AI984994 AI539335 AA401550 AA358757 AI609976 AA442357 AA359393 AA437046 
AA370301 AA429328 AW272055 AI580502 AI832944 AI038530 AA425107 AI014986 AI148349 AW237721 AW779756 - 
AW137877 AI125293 AA400404 R28554 



123523 genbank_AA608588 



AA608588 
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123533 genbank_AA608751 AA608751 

125091 genbankJ91518 T91518 

123964 genbank C13961 C13961 

102491 entrez_U51010 U51010 

1 18475 genbanLN66845 N66845 

118581 genbankJI68905 N68905 

113947 genbankJV84768W84768 

101447 entrez_M21305 M21305 

101657 13349 1 NMJJ05381 M60858 AW373732AW373724 AW373689 AW373629 AW373609 AW373776AA1 87806 AW386946 

AW374207 T05235 AA216203 AW385556 AA306940 AA306526 AA315461 AL036757 AW373711 AW403124 AW403640 
AW377084 T27360 H62638 F06957 AW377051 AA554779 AA378568 AA096007 AW352407 AW302637 F07929 H17433 
AW382712 H05665 F07292 N39875 AA089729 H62556 N42842 R12952 AW373735 AW364155 AA056183 W39185 
AW382708 N32488 AF114096 AW375993 AI133569 W52561 AA603040 AA133710 AI928796 AW176370 AA827519 
AW338437 AA521142 T29341 AI800461 AW317002 AA703914 AA860830 AI859203 AI445772 AA714334 AI817066 
AI832027 AW510442 AI635802 AW088306 AW068672 AW408555 AW467542 AA552657 AA152367 W32081 AA582124 
AA074040 AA931657 AI051154 AW410203 AI921644 H17434 AI832330 AW404836 AI925038 AA088423 AA954166 
AA580453 AW021292 AI267215 AW080082 AW383778 AI933053 AI919097 W31557 N90245 AA931591 AA563995 
F36352 AA0561 84 AA476294 AA641 327 AA533550 AI749630 W58323 AA5691 19 AA508573 AI809050 AI378996 
AA411362 AW407505 AA938104 AA074041 AA632876 AW193748 AA507873 AI270128 AI472365 AA411363 AI523216 
AI719965 A1816302 AA182681 AI707990 AA133588 AI758537 W60253 AI460308 AA135423 AI083904 F04188 N89693 
AW408776 AI678595 AI270568 AA722059 W58234 F33550 M090547 AA285108 AA425981 N85079 D20218 AI273980 
AA159028 F03226 AW247914 N26918 AW272741 N90109 H05666 N23327 AW247953 R44748 AA962015 F03558 
AI752394 AW409913 AW248396 AI816463 AI752393 AA325370 AA263089 AI570130 AI971951 AI160658 AI357360 
AW168686 AL121075 AW050536 N21672 W67748 AA514242 AI127386 H14607 AI185752 W79364 AA088520 M152476 
AW351 940 AW373683 AI940524 AW374953 T56500 N24329 AI940720 AW374933 AW374947 AW391913 AL138337 
AW376241 AW062943 F26666 AW410202 AW062958 F34529 AW381807 AW393315 W17147 AW176359 AA664576 
AW380424 AA306040 AI745674 AW300951 AI188579 AI438973 A/305271 AA433818 AA612807 A/831809 AI940409 
AA158663 AI572988 

108931 genbankJ\A147186 AA147186 

103138 entrez X65965 X65965 

103432 entrezj<97748 X97748 

119174 genbank_R71234 R71234 

133678 1 1235 1 AW247252 AA346143 NMJJQ0270 AA381085 N91995 X00737 AA381079 AA296473 AA2961 10 AA315735 AA31 1617 

AA326750 AA376804 AW403290 T95231 M13953 T47963 H82039 M279899 AA627997 N76320 N99527 H37842 
W20095 AA457308 AW469547 AA724143 H83220 AA319496 W88334 W30892 R89169 R99427 N41854 H47286 
AA348094 AA045089 R63016 AI922219 AI024906 AI096488 AI885005 AA1 94872 N90489 AI452544 H7241 1 AA282427 
AA430735 R68963 R22453 H70385 AW129369 AW467320 AW519082 AA345018 AA582183 AI961789 R65918 N3061 1 
AI979189 AI280889 AW273191 R66531 AI285845 AI675927 AI421990 AW190879 H37794 AA699667 H68427 M954388 
AI188757 AI140048 AA430382 AI204151 AW247864 AA559099 AI431420 AA548276 AI149466 AA772669 AA694388 
AA724168 AA301651 M281952 AA779925 AA234760 W86290 AA913603 AW511745 AI500697 AA814922 AA835040 
T47964 H53998 AA975804 R98710 AI077604 N70252 R98084 AW250171 H69268 AI597614 AA970746 AA972548 
A13771 16 R62962 H16737 R8907O AA731329 R66532 N54354 AI818832 H81944 N71567 T95122 W86463 AA437095 
AI431999 AI915724 N63851 AI674743 AA457307 AA211475 N64444 AI799146 H72853 R99335 H60413 AA770367 
AA156105 AI269937 H64029 H89728 R65819 AW470496 AI873318 AI735713 H82987 C02447 AI478666 T27651 
AI699770 AW025156 H69719 AI984717 N69225 AI459856 AA953577 AI424691 H13843 R22404 AI873796 AI336002 
N70898 AI420854 AA541792 AA346142 AI000814 AI828348 AA045090 T51257 N90434 H13890 N73184 AI708083 
AA781606 AA329050 AA339985 R68964 H64795 W04186 H16845 

119416 genbank_T97186 T97186 

119559 NOT_FOUND_entrez_W38197 W38197 

123473 genbank_AA599143 AA599143 
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TABLE 5: 



Pkey: 
Accession: 
ExAccn; 
UnigenelD: 
Unigene Title: 



Unique Eos probeset identifier number 
Accession number used for previous patent filings 
Exemplar Accession number, Genbank accession number 
Unigene number 
Unigene gene title 



Pkey Accession ExAccn UniGene UnigeneTitle 

115819 AA426573 AA486620 Hs.41135 AA486620 

132837 D58024 AA370362 Hs.57958 AA370362 

101545 M31210 8E246154 Hs.154210 BE246154 

102898 X06256 NMJ02205HS.149609 NM_002205 

101192 L20859 BE247295 Hs.78452 BE247295 

102915 X07820 X07820 Hs.2258 X07820 

105330 AA234743 AW338625 Hs.22120 AW338625 

107385 U97519 NM_005397Hs, 16426 NM_005397 

102024 U03877 AA301867 Hs76224 AA301867 

134416 M28882 X68264 Hs.211579 X68264 

103036 X54925 M13509 Hs.83169 M13509 

104865 AA045136 T79340 Hs.22575 T79340 

106124 AA423987 H93366 Hs7567 H93366 

105330 AA234743 AW338625 Hs.22120 AW338625 

109001 AA156125 AI056548 Hs72116 AI056548 

104764 AA025351 AI039243 Hs.278585 AI039243 

133200 AA432248 AB037715 Hs.183639 AB037715 

105263 AA227926 AW388633 Hs.6682 AW388633 

105178 AA187490 AA313825 Hs.21941 AA313825 

109456 AA232645 AW956580 Hs.42699 AW956580 
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TABLE 5A 

Table 5A shows the accession numbers for those pkeys lacking unigenelD's for Table 5. The pkeys in Table 7 lacking unigenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which (he oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the 'Accession- 
column. 



Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 



Pkey 

115819 



CAT Number 

10241.1 



102024 14505.1 



101545 
109456 
103036 



24607J 
180633J 
17145 1 



133200 28960.1 



132837 256666.1 



102898 24023 J 



Accession 

AA486620 AF205940 AA297524 AB034695 AA081335 NM.016242 AA188323 AA297537 H88204 AW953081 W31695 
AW582203 AA248250 AW68121 1 AA426230 M464807 AA426155 N44141 AA347390 AA770661 AI333225 N36136 
AW665724 AA431894 AI374976 AI400254 AI338446 AA186695 H88205 W04527 AA487066 AI051414 AA918383 
AA426573 AA425620 AW438654 AA090513 BE167284 BE167291 AI301726 

AA301867AW957981 R27614 AA1 55808 AI920990 AI740711 AA301026 AA301015 AI220981 AI857670 AI537140 
AW015210 AA030000 W46890 H44021 AI355967 AI651735 AA058479 AA146932 T58265 R85890 AA047810 AA017387 
AW026093 AA971133 AI827263 AI056416 AI355994 AI127691 H46603 U03877 NM.004105 AA157357 H42844 
AA146824 AA187709 AA187269 AA304348 AA147292 AA361687 AA156041 AA33Q636 R32929 AA321130 AW950260 
AA082157AA029129 AA303708 AA028155 D31561 T84689 AA302493 BE153057 BE153181 W39408AA1 87200 
BE153250 AW383337 AW382622 AW382647 AW750072 BE153060 AW382630 AW371865 AW392464 AW382664 
AW382658 AW382650 H61647 AW365075 AW365049 AA373397 BE072779 BE072781 Z30254 W24381 BE153254 
AA040442 BE072729 BE072731 N94740 AA146945 AW802737 AI826799 AI085395 R34034 H65140 AA082800 H88275 
AA147824 R63882 W80899 AA29641 3 AI765300 AI862426 AW022055 AW3000O3 AI743784 AI862635 AI985428 
AA147764 AW573245 AW190290 AI040898 D57613 N63457 AA148082 AI028458 AA1481 1 0 AW814489 N75105 
AW629443 AA704122 AW582220 AA181240 M057495 AI418224 AI261751 AW388595 AI472205 AW470672 AA102546 
AA789046 AA182416 AA062668 AW300732 AI288220 AA181982 AA146825 AA028130 AI985522 AA303344 AA081313 
N69082 AA182035 AI867128 AA100902 AA605087 N67178 AW020324 AW890446 AI472191 AI335691 AI597837 
AI081 143 AI335681 AA040443 AM 28067 AI678244 AA018303 AA157260 W80792 AI934590 AI096430 T54343 AI446350 
AA165196 AA780683 AA603631 AA047787 AA968580 AA912645 AW890504 AW026913 D56983 H52088 AA156121 
R30848 AW023036 AI590960 N67345 AI753225 AI753283 AI183768 AA147818 H89101 AI362141 H89205 AI14771 1 
AA321 1 29 AA668622 AA343479 AW069438 AI422376 AW629270 AA013413 AI221948 AA970605 N52335 H38366 
T91180 AA657841 AA017386 AA152227 AA187593 AI913340 AI719313 AI969943 AI701271 AI004328 AI863348 N93659 
H65093 H25736 D57007 D56957 C00987 D61839 D56661 AI472137 AI971002 D56971 BE048830 D57972 AI589286 
AI361055 AI361071 AI292223 AA155898 D57139 D57981 D57345 AI420034 D57332 D57959 AA875933 R33493 N67558 
D58353 AA188394 AA147966 AI160640 AI363165 H40638 AA578137 AW950265 AA300943 AI128999 H46584 AA917355 
N57820 AA320504 H51959 H25737 

BE246154 M31210 NMJ01400 AA193392 NM.016537 AF233365 AF022137 H27787 AA370448 F05373 T27666 W21494 
AA036907 AI249966 N93476 F01623 AA304390 AA308808 

AW956580 AA886361 AI147670 AI0901 15 AI168683 AA232645 H99504 AA374707 AA380875 AW139567 AI735132 
BE439385 AW629780 N28322 AA232789 AA232790 N73285 

M13509 X54925 NM.002421 M16567 X05231 M15996 W39354 AA186634 AA852324 AA187507 AA081 149 AA186524 
AA187264 AA187361 AA386155 AA186973 AA374217 U78045 AA081230 M188049 AA186393 W56827 AA852602 
AA157468 AA308204 AA186754 AA186808 AA082516 AA304334 AW376428 BE439384 AW376420 AA1 56273 T 18504 
AA186521 W49496 AW084608 AA083575 AA372360 AW963590 AA132297 W47445 AA186376 AA157628 AW003999 
AI037890 A1858060 AI589010 AI743739 AI452673 AW304188 AW1 17854 BE439933 AA1 57416 AW778966 AI038497 
AA081006 AA100829 AA181048 C02231 T27821 W23960 AW954802 A1471432 AW801296 AW801289 AW801603 
AW801523 AW801292 AW801542 AW801601 AA181 134 AI445147 AA191501 AA582862 N94407 AI147810 AA181880 
W49497 W52714 AA188249 AI932881 AI082493 AA503656 AA182682 AW801393 AA182830 AA181882 AA182826 
AI613182 N94510 W47343 AI085755 AI076956 AI918426 AA081208 AI282835 AA147528 AI081490 AI654536 AA181875 
AA081282 AA186389 C06085 AA083542 AI800644 AA157642 M101069 AA157752 AA158121 AA143331 AA081283 
AA852603 AA188296 AI932880 AW449528 M187348 C02091 AA514656 AA082736 AA308786 AA143201 M16567 
AB037715 AI351347 AI375796 AI884765 AL121124 W01068 AI807275 T95240 R42807 AW515645 AI057314 AI033520 
AA057671 N70215 AA054215 AW204183 AA552149 T95130 AW796310 AI866520 AW275564 AW796308 AI637901 
AW197404 T78406 AA456232 AW206463 AA779800 AI052696 AA026744 AA454623 AW470729 R45490 AW770258 
AI038393 AI290170 AA722734 AL1 21 125 R41608 AI862414 AA83861 1 R45582 AI278083 BE466849 BE219944 
AA418030 BE041555 AA578572 T16528 AW006344 Z39782 AI244848 AW137344 AA707400 AI032028 BE540464 
AI094265 AI184281 AA931890 AW382744 AW382729 AW020448 AW827237 AA431226 AI672059 AW772345 N70172 
AW022003 AI862704 H19344 R6151 1 AI080204 H16566 AA432248 AI767980 T16688 AI984342 AI217478 A1767095 
Z38551 AI359566 AI361437 A1041000 R07033 H16608 H19054 R12874 R61567 N98368 BE221199 Z42320 AA094554 
R07078 AW860885 AA418090 R41262 

AA370362 AA3641 10 AW959554 AW371737 AW382068 AW604716 AW604713 AA487827 AW371674 AA429137 
BE503321 T93570 W72803 AI093076 AA487977 AI241562 BE439445 AW204065 R51635 AI802994 T10362 W68553 
A1866215 AW152154 AA70071 6 Al 127443 R15824 AI537587 AA9531 10 D58024 AI52081 1 AA693670 A1453280 W76329 
AW023955 AW022563 

NM L002205 X06256 M1391 8 BE070866 AW239485 AW996127 BE273894 BE272590 BE410252 R25975 T1 1786 T1 1787 
AA301142AA301165AW960506 BE272819 AA386086 T39391 AA285303 AA370580 D58585 T58668 AA156213 W24142 
AA343323 AW796067 AA151 197 AA376121 R94782 AA302363 H90357 R82621 AA301677 H55997 AW796059 W92358 
AL046458 AA471 198 AA301952 R46287 R82694 H03186 AA187706 R32562 R27094 R25947 R25320 AW949809 H13505 
H79049 R32403 H11213 R39710 H49765 H21142 H21006 AA417664 W52075 N56771 AA284240 N98556 N30907 
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AA707335 AW603781 AI340367 AI8 14584 AA5241 82 AA370076 AA418785 AA704082 A1806851 H25513 T56388 
AA419627 H03986 H20963 T56245 A1459715 AW973768 AI334096 AI693020 T63414 R82646 AW167251 H55998 
AI274916 M778367 AI755253 AI033667 AW083222 AA181979 R26865 AA661627 AA706329 AI798648 AA612799 
A1160180 AI274973 A1039264 AA301880 AI042429 AA307632 AI085688 AI278366 AI498890 AA303865 AI954844 
AA502380 AA156334 AA723480 AI803584 AI581026 M304584 N51038 R94702 R69814 AW150962 AI570049 AA588807 
AA151 198 T53400 AI567709 AI185326 AA309205 AW338969 R53903 AA991891 AA301643 AI493337 AI026049 H25514 
AI741075 R28632 AW166445 AI333068 H49978 H91267 AA55B193 AW079663 AA627380 AA807401 AI199956 AA666118 
AI718216 AW193228 AI077745 AI500496 AI266059 AW080383 R06468 R26757 R32404 AA716599 W92322 AI077734 
AI270181 R46198 AI217540 AA304045 AA305421 AW074445 AI468256 AW0B9568 AW571605 BE162930 H41009 
AA578313 AW874497 AA181284 AA861947 T29451 D20841 T58618 AA418731 AI282500 AW081407 AA604560 
AA729855 AI262538 AI580225 

X07820 NMJ02425 BE271570 AI263526 AW296143 AI829878 AI9731 62 AI085155 AA857496 AA709305 C02220 
X68264 NM 006500 AF08986B BE257461 BE275425 AW997154 AI902799 AI902803 M78206 AA085691 AW392972 
AA325490 BE006161 AA349269 AA323568 AL042548 AA191148 AA187703 AA322791 AJ297452 T1 1625 AW366487 
AA303513 AA186961 AA173480 N28330 N28379 W40320 AA187118 H03695 AA402709 BE407476 H06354 BE276589 
M351284 AA379921 AL138060 BE410587 AA1 13094 AA3404 81 BE277483 R21191 R79518 N86170 AA320505 
AA296065 AW951900 AA658897 M650052 AA654304 AA191691 N26649 AW080963 AI265800 N72019 AI453458 
AA092563 M402310 A1439450 AI061054 AA302358 T71566 AA302047 AA303432 N21289 H27357 AA303504 AI174583 
AW151762 AA181958 AW880618 AA630773 AI8B9539 AW901058 AI373405 AA341941 AA086217 AI675590 AI653936 
AA633570 AA987619 AI270656 N93847 N40689 AW517517 N20030 W95985 AA303955 H89170 AA309917 N21642 
AA373132 W38517 AI687806 W76182 AA101065 AA036916 N45635 AI744510 AI669803 AI039157 AH 26355 AA634607 
AW131120 AW196838 AA190601 AA911130 BE221320 N92355 AA036752 H03696 AA588873 AI458868 AI041818 
AA090477 AI093248 M304755 AL137942 AL044688 AI083709 A11 50965 N88891 AA635675 AA594898 W94657 
AA182823 AW166205 F27886 R79246 F37329 AA565697 A1075739 AI088654 AI094287 AI204256 AA095203 T93020 
AA688298 AA057324 N23442 AA07541 1 AA305046 AI031688 AI191503 AA1 1 1887 AA1 12264 N27929 AA187509 
AI375522 AI474006 H06297 AI826177 N48880 H28333 AA075490 R22809 W79542 AI055934 AA042901 AA173481 
AA301986 W74531 AI051747 AA187715 AI888888 AA993017 AI057530 T92954 N80227 AW273595 AI351260 AW170643 
AW292979 AA302605 AA302330 BE349495 AA328602 AA302361 AI470984 AA155943 M155914 
AA313825 AW960347 AF223468 NIVL016613 AA186345 AA186508 AA081 195 AA147972 AA346943 AW961667 
AA187222 AA187207 AW371052 AW449751 AW748803 AW391606 AW371047 AW371057 AW371085 AW362895 
AW371092 AW377556 BE01 0930 At01 6882 AA247878 C04398 C05158 F11398 AA188315 H23385 R55086 H15346 
AA029106 AA228114 H17005 F08498 Z43376 AA095582 AA055186 AA463361 R15218 AA29 91 32 AW 103578 W21538 
AA428131 AA187115 AA157197 AA157167 AW371371 AA363562 AW965995 N55663 Z17878 AA228023 AI140342 
AA100927 AA496988 AA055917 AI089303 AW014967 AW090248 AW338371 AW131066 D62963 D79713 AI583950 
AI336781 AI500705 AI471485 AW090239 D79784 D61847 D62789 D61842 AI086327 AI273381 D61815 D63043 AI913548 
AI280560 AI510828 AA029996 C16343 C16513 AI075741 AW516308 AI804764 AA948068 AI356588 AW103452 
AW573063 Z39445 C16489 AI949870 F04712 AA147823 AW026284 AI151538 AA081303 AA613890 AI251865 AW036499 
AA9921 1 1 AI862091 AI373465 BE502094 AI922270 AA884288 AA157079 N56963 AW1 89145 AA428080 R55056 
AA684068 AW771716 AA186662 C16364 H1 5723 AI921 181 AA156888 H17006 AA 187490 A 14 00994 AA346942 H28533 
AW129047 R41656 H14636 AA995041 D58370 Z21131 D58186 AI383271 AA643977 D58044 AI934302 AW779425 
F09065 H14930 AA890693 H23274 

AW388633 AW378440 AW388283 AW388339 AW388333 AW388414 AW388413 AW388607 AW388453 AW388687 
AW388480 AW388591 AW38871 1 AW38851 1 AW388438 AW388570 AW388449 AI694383 AW237145 AI652991 
AI964041 AW366319 AW366321 AW961938 AW46921 1 AI634155 AI492186 AI624430 AI677965 N26502 AI963871 
AW378431 AW378421 AI015391 AW352126 N59336 AI352317 AW197113 N6799S AW778935 AI476054 AI206626 
R371 16 R4021 1 AA227926 AA639698 R38073 AI001745 T32854 AI619649 AI423703 F 10774 AW388615 T16595 H05894 
AW338625 R43226 R51640 AI307645 AI308100 AI085787 AI420357 AI692610 AA877160 AI953366 A4234743 
AI039243 R68234 AA025351 AA971063 AI537757 AA025362 R81636 T86650 

T79340 AI742317 AW182676 AW451460 AI420964 R43284 AA088179 AW590886 AW269529 M045187 AI521736 
AI827455 AA045136 AW271709 AI004344 AA639631 AA744417 AA744218 AA045357 AA045351 
H93366 AI653547AA336265AW966175 BE566451 R71178 AI630656 AA234331 N5 5039 AA3 05632 AW960431 R34044 
R32254 AW020970 AW451281 AW275041 AI636933 AI655640 AA423986 AA642466 AI684063 AI633876 A1624897 
AA814795 AW590328 AI889166 AW243541 AI439691 AW473445 AI475516 AA741228 AI127534 AA165143 AI074714 
AI654076 AA400674 AI560249 N50709 AW438621 AI806810 AI434579 AI3031 84 AA423987 AI141272 AI565586 
AI338440 AA219628 AI246643 AI985809 AA724260 AA633988 AI364172 AI798439 AI650801 R33503 A1435891 
AA903649 T96161 AA665538 AA219620 AI309962 AA400707 BE247066 R32178 AI275962 AA661602 AW003197 
BE466649 AA831 198 AI620052 AI825387 AI634037 AI670978 AI670979 AI655092 R32304 AA828858 AI382428 
AW023660 AA262892 T26891 AW089917 T26926 R32227 

NM_005397 U97519 AW899329 A1902387 AA077792 AA078525 AW376607 AA077946 AA070415 BE208721 AW167958 
BE293050 BE20B240 A1648698 AA101314 BE393348 BE305122 AA077591 BE274036 M313687 BE392220 BE378954 
AA171461 AA464821 AW938242 AW938224 AW938243 AW938232 AA147953 N64294 AA205218 AW305065 AW517478 
AA307983 AA377023 BE563629 R99976 N80294 T87719 T87928 AA496849 AA486344 AA204938 AW370448 AA318242 
AW964384 H92423 W95317 BE378774 BE391156 AA3491 38 AA1 73095 AW513198 AA037672 AA148029 M169726 
W04791 AA075508 BE382937 BE395034 AF139793 AA961734 N48612 H64714 AW151251 AI555113 AI566881 
AW087370 AA631168 AA622014 AW513098 AI857810 AW1 52287 AI052596 AI983246 AA024856 AI912456 AI677938 
AW026403 AA972537 A1088497 AW999869 W94582 AI140166 AI160659 AI566868 AA101263 AW190390 AW166466 
AI401207 AI416156 A1625265 AI146298 AW008592 BE223020 N58926 AI308797 AA037673 AI935992 AI304706 
AA024939 AI216589 AI610423 AI354621 AI500677 AI679389 A/799310 N64508 AI128756 AI679897 AW589535 
AA989333 AI500527 AA565479 AA913529 AI923295 F21691 M989376 AI699064 AA902447 AI69091O AA772659 
AA204983 AI337895 R99975 H65205 AA340766 AI339441 AI913855 AA450293 AW192010 AA070416 N72401 AI371481 
AI247108 AI371261 AI364987 AI280171 AI269104 AI868756 AA909836 AA983640 AI973271 AA913092 AI868205 
AI1441 12 AI190975 N58085 AI566638 N93405 AW150504 AW296846 AI687036 AA902984 AI824460 AI625047 AA653148 
AI61 1228 AW131922 AA862687 AA902519 C01732 AW798045 AL044660 

BE247295 AWQ68092 AL041313 AA159244 NMJ05415 L20859 AL135570 W47073 AW516906 BE388271 BE408629 
W46972 BE293646 BE256647 A(075010 AL041095 AA285300 AL039560 AA368740 W26602 AA399344 M039235 
W27631 AW834898 AW834914 R93390 AA378039 AV649660 T53674 N98824 M399974 AW843378 AA368267 R08256 
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AV653575 R27900 N48215 AW366371 N455Q0 AV652967 AIB89251 AI080457 N39021 AI738542 AW242849 AI857471 
AI859775 AI582830 R75850 N66564 AW341636 AI499006 AI887217 AW026694 AW182840 AA039313 AA831346 
AI393465 AW069210 AI743830 AA744243 AA401310 AW439758 AW088152 R93391 AA291379 AA225220 AW009358 
AI192879 AA291 202 AI565089 AA225089 AA807688 AI052058 AI341641 AI066625 AA333864 AA159147 AI923912 
5 R75851 AI761 143 AW768588 AA394195 AI288450 AW512564 AI452775 AI05652O AA468602 AA872566 AI434739 

AA291838 AI948623 AW768614 AI374753 AW068174 AA884908 AI199346 AM99347 W94946 AI159995 AA877642 
AI280646 A1307610 AA403310 R08205 AW182123 AI000999 R27808 AW026571 D20816 AI560350 T27667 AW960271 
AI174628 AI432042 A1424528 AA909562 T17342 AI783866 
109001 146370 J AI056546 AW409843 AW263540 AA723669 AA909334 AA156120 AA157141 AA156125 AW409866 W19499 AA157229 
10 AW887435 
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TABLE 6: 

Pkey: Unique Eos probeset identifier number 

ExAccn: Exemplar Accession number, Genbank accession number 

UnigenelD: Unigene number 

Unigene Title: Unigene gene title 

AUC1 : 70 th percentile of average intensity (Alj for probeset at each 0(2,6,15,24,48, and 96 hour timepolnts minus 70* percentile Al at 0 hrs, 

summed over 5 experiments. 

AUC2: AUC1/90» percentile of Al for aorta, aortic valve, vein, and artery. 



Pkey Ex-Accn UnigenelD UnigeneTitle 


AUC1 


AUC2 


314941 AA515902 Hs.13065O ESTs 


1038 


9 


327414 predicted exon 


303.2 


30.3 


321911 AF026944 Hs.293797 ESTs 


429.2 


42.9 


331578 AI246482 Hs.249989 ESTs 


677.4 


10.3 


332466 AB018259 Hs.118140 KIAA0716gene product 


395.2 


39.5 


313513 AW298600 Hs.141840 ESTs, Weakly similar to S59501 interfero 


324 


32.4 


320635 N50617 Hs.80506 small nuclear ribonucleoprotein polypept 


394.8 


39.5 


326230 predicted exon 


357.2 


35.7 


313556 AA628517 Hs.1 18502 


433.6 


12 


313665 AW751201 Hs.120932 ESTs 


-83 


0.5 


324852 AI380792 Hs.135104 ESTs 


348.2 


34.8 


314372 AL040178 Hs.142003 ESTs, Weakly similarto The KIAA0149 gen 


■492 


0.5 


311877 AA084248 Hs.85339 G protein-coupled receptor 39 


-1309 


0.2 


322262 AA632012 Hs.188746 ESTs 


-247.8 


1 


312173 AI821409 Hs,304471 ESTs, Highly simitar to AF1 16865 1 hedge 


-1025.8 


1 


319795 AB037821 Hs.146858 protocadherin 10 


203.6 


5.2 


313350 AW591949 Hs.57958 ETL protein 


183.8 


18.4 


326759 predicted exon 


1654.4 


1.2 


300318 AW444502 Hs.256982 ESTs, Highly similar to AF1 16865 1 hedge 


-346 


1 


313978 AI870175 Hs.13957 ESTs 


576.6 


2.3 


306840 AI077477 Hs.307912 EST 


56.4 


0.4 


310272 AF216389 Hs.148932 semaphorin Rs, short form 


-127.6 


0 


315044 BE547674 Hs.204169 ESTs 


-102.6 


0 


321325 AB033100 Hs.300646 KIAA protein (similar to mouse paladin) 


1080.6 


4.8 


303251 AF240635 Hs.1 15897 protocadherin 12 


1270.8 


5.3 


302378 AL109712 Hs.296506 Homo sapiens mRNA full length insert cDN 


915.8 


15.8 


315060 AA551104 Hs.189048 ESTs, Moderately similar to ALUC^HUMAN I 


1236.8 


4.9 


332048 AW337575 Hs.201591 ESTs 


522.6 


4.7 


337214 predicted exon 


269 


26.9 


311598 AW023595 Hs.232048 ESTs 


796.4 


20.2 


304782 AA582081 gb:nn32h08.s1 NCI_CGAP__Gas1 Homo sapiens 


3164 10.5 


312802 AA644669 Hs.193042 ESTs 


349.6 


7.6 


302680 AW192334 Hs.38218 ESTs 


638.6 


63.9 


317452 AA972965 Hs.135568 ESTs 


360.8 


36.1 


318558 AW402677 Hs.146381 RNA binding motif protein, X chromosome 


700.2 


6.6 


312149 T90309 Hs.269651 ESTs 


274.2 


7.5 


319267 F11802 Hs.6818 ESTs 


238.2 


23.8 


321510 H75391 Hs.255748 ESTs 


231.8 


23.2 


326198 predicted exon 


581.6 


8.2 


315730 H25899 Hs.201591 ESTs 


281.6 


9.7 


310442 AW072215 Hs.208470 ESTs 


-213 


0.3 


331237 W87874 Hs.25277 hypothetical protein FLJ21065 


285 


0.5 


300469 BE301708 Hs.233955 hypothetical protein FLJ20401 


26.6 


0.3 


338316 predicted exon 


1494.2 


34.7 


330968 R44557 Hs.23748 ESTs 


975.8 


1.8 


331019 NM_006033Hs.65370 lipase, endothelial 


201.2 


0.9 


331261 BE539976 Hs.103305 Homo sapiens mRNA; cDNA DKFZp434B0425 (f 


478.61.3 


301822 X17033 Hs.271986 integrin, alpha 2 (CD49B, alpha 2 subuni 


356.2 


1.7 


325544 predicted exon 


1014.6 


9.4 


328700 predicted exon 


627.4 


62.7 


322882 AW248508 Hs.279727 Homo sapiens cDNA FU 14035 fis, clone HE 84.8 


5.7 


336034 predicted exon 


782.6 


78.3 


316580 AA938198 Hs.146123 hypothetical protein FLJ 12972 


746.4 


13.8 


309931 AW341683 gb:hd13d01.x1 Soares_NFLJ"_GBC_S1 Homo s 


134.813.5 


330692 R39288 Hs.6702 ESTs 


137 


13.7 


319962 H06350 Hs.135056 Human DNA sequence from clone RP5-850E9 


14.6 0.5 


338033 predicted exon 


540.6 


14 


314943 Y00272 Hs.184572 cell division cycle 2, G1 to S and G2 to 


-494.8 


1 


332640 BE568452 Hs.5101 protein regulator of cytokinesis 1 


-600 


1 


338158 predicted exon 


311.2 


31.1 


327036 predicted exon 


351.8 


35.2 
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302655 AJ227892 Hs.146274 ESTs 180.2 18 

327568 predicted exon 229 22.9 

324801 AW770553 Hs.14553 sterol O-acyltransferase (acyl-Coenzyme 161.2 16.1 

317850 AI681545 Hs.152982 hypothetical protein FLJ13117 -690 1 

5 322818 AW043782 Hs.293616 ESTs 126.4 4.5 

324626 AI685464 Hs.292638 ESTs 170.2 17 

317224 X73608 Hs.93029 sparc/osteonectin, cwcv and kazaHike d -80 0 

310955 AI476732 Hs.263912 ESTs 466.8 46.7 

315240 R38772 Hs.172619 KIAA1 106 protein 277 27.7 

10 338388 predicted exon 267.6 26.8 

338442 predicted exon 256 25.6 

318617 AW247252 Hs.75514 nucleoside phosphorylase 1247.8 24.2 

338645 predicted exon 206 20.6 

313135 N58907 Hs.162430 ESTs 204.8 20.5 

15 324716 BE169746 Hs.12504 hypothetical protein DKFZp761D081 203.6 20.4 

330305 predicted exon 199.8 20 

308248 AI560919 gb:tq41g10.x1 NCLCGAPJJM Homo sapiens 199.4 19.9 

308886 AI833240 gb:at76d10.x1 Barstead colon HPLRB7 Homo 198.2 19.8 

315622 AI796144 Hs.258188 Homo sapiens cDNA FU1 1674 fis, clone HE 191.2 19.1 

20 323675 R43240 Hs.272 168 tumor differentially expressed 1 189.2 18.9 

312164 T91980 Hs.221074 ESTs 187.6 18.8 

300378 Z45270 H s. 235873 hypothetical protein FU22672 271.6 18.7 

317478 AI343569 Hs. 107000 Homo sapiens mRNA for WDC1 46, complete c 187 18.7 

317559 AW452344 Hs.129977 ESTs 184.2 18.4 

25 317207 AI873346 Hs.214505 ESTs 182.8 18.3 

334834 predicted exon 178.8 17.9 

320925 D62892 gb:HUM337C07B Clontech human aorta polyA 177.217.7 

303289 AL121460 Hs.272673 hypothetical protein FU20508 316.4 17.6 

32854S predicted exon 174.6 17.5 

30 317108 AA884000 Hs.8173 hypothetical protein FLJ1 0803 172.4 17.2 

318013 AI188183 Hs.144078 ESTs 326 17.2 

314299 AW382682 Hs.154840 ESTs 170.8 17.1 

317702 AW173339 Hs.135665 ESTs 169.8 17 

316094 AW975920 Hs.283361 ESTs 169.4 16.9 

35 323706 AA377578 Hs.65234 hypothetical protein FLJ20596 169.2 16.9 

325843 predicted exon 321.4 16.9 

316012 AA764950 Hs.1 19898 ESTs 1047.2 16.9 

309687 AW236154 Hs.77385 myosin,lightpolypeptide6,alkall,smoothmu 168.2 16.8 

323329 AL134744 Hs.10852 ESTs 168 16.8 

40 312853 W05086 Hs.114256 ESTs 167.4 16.7 

313070 AI422023 Hs.161338 ESTs 298.6 16.6 

314096 AW977642 Hs.291742 ESTs 165.6 16.6 

338728 predicted exon 165.4 16.5 

316609 AW292520 Hs.122082 ESTs 165 16.5 

45 305989 AA888220 gb:oj15h01.s1 NCi_CGAP_Kid5 Homo sapiens 164.616.5 

312642 AW052128 gb:wx26c02.x1 Nc7_ CGAP_Kid1 1 Homo sapien 164 16.4 

339236 predicted exon 163.6 16.4 

317058 AI217713 Hs.147586 ESTs 161.8 16.2 . 

311137 AW207582 Hs.196042 ESTs 582.2 16.2 

50 310178 AI936450 Hs.147482 ESTs 161.2 16.1 

320745 H51696 Hs.89278 hypothetical protein FLJ1 1 186 161 16.1 

317336 AW014637 Hs.130212 ESTs 160 16 

309871 AW300366 gb:xs63b05.x1 NCI_CGAPJ<id11 Homo sapien 159.816 

302038 AC004076 Hs.129709 Homo sapiens chromosome 19, cosmid R3021 159 15.9 

5 5 332237 N52883 Hs. 102676 EST 159 15.9 

312362 AW015994 gb:UI-H-BI0p-abh-g-09-0-Ul.s1 NCLCGAP_S 158.6 15.9 

331558 N62401 Hs.48531 EST 158.6 15.9 

316215 A1684535 Hs.200811 ESTs 158.4 15.8 

336059 predicted exon 157.4 15.7 

60 302790 AJ245245 gb:Homo sapiens mRNA for immunoglobulin 155.8 15.6 

328418 predicted exon 153.8 15.4 

304229 AK000149 Hs.29493 hypothetical protein FLJ20142 153.6 15.4 

331606 AW273285 Hs.50802 ESTs 153 15.3 

338962 predicted exon 664.4 15.3 

65 317959 AI204202 Hs.130264 ESTs 152.6 15.3 

336228 predicted exon 152.4 15.2 

313534 AW072916 Hs.78743 zinc finger protein 131 {clone pHZ-10) 152.2 15.2 

317404 AI806867 Hs.126594 ESTs 152.2 15.2 

311943 A1469911 Hs.26498 hypothetical protein FLJ21 657 152 15.2 

70 314680 AI247425 Hs.152182 ESTs 151.4 15.1 

331484 N29696 Hs.44076 EST 151.2 15.1 

338116 predicted exon 151.2 15.1 

329863 predicted exon 150.6 15.1 

315555 AW452886 Hs.239107 ESTs 149.6 15 

75 317039 M868583 Hs.126153 ESTs 149.6 15 

331138 R63816 Hs.28445 ESTs 149.6 15 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



f\ a ^ ft a 

316561 


x i jA A mm A I* A\ f\ A f\m* m* r> J~L ■ 

AI917222 Hs, 121 655 ESTs 


149.4 


14.9 


0\ AA #» ftm* 

328695 


predicted exon 


149.2 


14.9 


j-l <N /-w 

302282 


BE396283 Hs.173987 eukaryotic translation initiation factor 


148.4 


14.8 


318781 


i—* a a ft ft ft it ft ft A ft ^ >*mm 

F11802 Hs.6818 ESTs 


A\ M 0h ttm 

148.2 


A\ A ft 

14,8 


323709 


AW297246 Hs.288546 Homo sapiens cDNA FU14190 fis, clone NT 


148 


A M 0% 

14.8 


310790 


AW192063 Hs.248865 ESTs 


147.8 


14.8 


316833 


AW292614 Hs.124367 ESTs 


147.8 


A A ft 

14.8 


323176 


NM_007350Hs.821 01 pleckstrin homoiogy-like domain, family 


229 


14.8 


324188 


AW274439 Hs.252709 ESTs 


147.6 


14.8 


317441 


AA922798 Hs.196583 ESTs 


147.4 


14.7 


317584 


AI825890 Hs.220513 ESTs 


146.8 


14.7 


321798 


AI308206 Hs.181959 ESTs 


146.8 


14.7 


304363 


AA206045 gb:zq77fD5.s1 Stratagene hNT neuron (937 


146.6 


14.7 


313952 


F20956 gb:HSPD05390 HM3 Homo sapiens cDNA clone 


146.614.7 


301909 


AI702609 Hs.15713 ESTs 


263.8 


14.7 


309196 


AI904895 Hs.9614 nucleophosmin (nucleolar phosphoprotein 


146.2 


14.6 


321860 


N47474 Hs.212631 ESTs 


146.2 


14.6 


330187 


predicted exon 


146 


14.6 


323042 


AA463571 Hs.172550 polypyrimidine tract binding protein (he 


145.6 


14.6 


313636 


AA262397 Hs.201366 ESTs 


145.2 


14.5 


302437 


AB024729 Hs.227473 UDP-N-acety!glucosamine:a-1,3-D-mannosid 145 


14.5 


318197 


A1473096 Hs.133403 ESTs 


144.8 


14.5 


302749 


M16951 gb:Human Ig mu-chain mRNA VDJ4-region, 5 144.6 


14.5 


322357 


AI734258 Hs.245367 ESTs, Weakly similar to ALULHUMAN ALU S 


144.614.5 


300391 


AI927371 Hs.288839 hypothetical protein FL J 121 78 


144.4 


14.4 


326077 


predicted exon 


144.4 


14.4 


302004 


Y18264 Hs.123094 sal (Drosophila)-like 1 ' 


144 


14.4 


320668 


AA805666 Hs.146217 Homo sapiens cDNA: FLJ23077 fis, clone L 


144 


14.4 


331212 


T88693 Hs.226410 ESTs 


144 


14.4 


311268 


AI969727 Hs.231859 ESTs 


143.2 


14.3 


305159 


AA659166 Hs.275668 EST,WeaklysimilartoEF1D_HUMANELONGATIONF 


143 14.3 


304510 


AA457391 Hs.1 19122 ribosomalproteinL13a 


142.8 


14.3 


320852 


AA772920 Hs.303527 ESTs 


142.8 


14.3 


330854 


AW291944 Hs.1 22139 ESTs 


142.8 


14.3 


318275 


AW449952 Hs.190125 basic-helix-loop-helix-PAS protein 


142.6 


14.3 


314992 


AI824879 Hs.211286 ESTs, Weakly similar to 1207289A reverse 


142.2 


14.2 


322631 


AA001697 Hs.293565 ESTs, Weakly similar to putative p1 50 [H 


142.2 


14.2 


332283 


R40855 Hs.100839 EST 


142 


14.2 


302894 


AA719572 Hs.274441 Homo sapiens mRNA; cDNA DKFZp434N011 (fr 


141.214.1 


301808 


R35391 Hs.252831 reticulon 3 


141 


14.1 


318608 


AI204491 Hs.151502 ESTs 


141 


14.1 


316499 


AW292947 Hs.1 22872 ESTs 


140.8 


14.1 


317011 


AI248760 Hs.150276 ESTs 


140.8 


14.1 


321840 


N45600 Hs.46534 Homo sapiens mRNA; cDNA DKFZp434P0714 (f 


140.8 14.1 


327365 


predicted exon 


140.8 


14.1 


331264 


AA278898 Hs.225979 hypothetical protein similar to small G 


140.8 


14.1 


324545 


AW501944 Hs.1 27243 Homo sapiens mRNA for KIAA1 724 protein, 


140.4 


14 


312986 


AA211586 gb:zn56d05.s1 Stratagene muscle 937209 H 140.2 


14 


316053 


AA825814 Hs.149065 ESTs 


140.2 


14 


330723 


BE247449 Hs.31082 hypothetical protein FL J 10525 


140.2 


14 


304876 


AA595765 gb:nj28g06.s1 NCI_CGAP_AA1 Homo sapiens 


139.814 


311379 


AW1 34766 Hs.202450 ESTs 


139.8 


14 


318265 


AW019873 Hs.146840 ESTs 


139.8 


14 


324137 


AA393127 Hs.222762 ESTs 


139.8 


14 


328262 


predicted exon 


139.6 


14 


322349 


AK001279 Hs.18Q171 Homo sapiens cDNA FLJ1 0417 fis, clone NT 


139.4 


13.9 


323504 


AA280223 Hs.1 30865 ESTs 


139.4 


13.9 


304261 


AA059387 gb:zf66d01 .s1 Soares retina N2b4HR Homo 


139.2 


13.9 


310489 


AW451493 Hs.235516 hypothetical protein PR02955 


139.2 


13.9 


335946 


predicted exon 


139.2 


13.9 


318155 


AI041546 Hs.132133 ESTs 


138.8 


13.9 


313796 


AI797169 Hs.208486 ESTs 


138.6 


13.9 


333977 


predicted exon 


138.6 


13.9 


324845 


AW969635 Hs.283718 ESTs 


138.2 


13.8 


331139 


R65706 gb:yi16g12.s1 Soares placenta Nb2HP Homo 138.2 


13.8 


331131 


R54797 gb:yg87b07.s1 Soares infant brain 1NIB H 


669.6 


13.8 


321250 


H58539 Hs.151692 ESTs 


138 


A ft ft 

13.8 


312498 


AA668782 Hs.191284 ESTs. Weakly similar to ALU LHUMAN ALU S 


137.813.8 


331252 


W52470 Hs.34578 aipha2,3-siaryltransferase 


137.8 


13.8 


337407 


predicted exon 


137.8 


13.8 


303973 


AW512014 gb:xx68a03 jc1 NCL.CGAPJ.ym1 2 Homo sapien 


137.4 13.7 


314582 


AA412258 Hs.138817 ESTs 


137.4 


13.7 


327373 


predicted exon 


137.2 


13.7 


323367 


AA234591 Hs.304123 ESTs 


136.6 


13.7 


316207 


AA832065 Hs.120260 ESTs • 


136.4 


13.6 


315231 


AA705809 Hs.1 19922 ESTs 


136.2 


13.6 
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318592 T39310 Hs.1139 cold shock domain protein A 136.2 13.6 

320906 AW969706 Hs.293332 ESTs 136.2 13.6 

328937 predicted exon 136.2 13.6 

329073 predicted exon 136.2 13.6 

5 318231 AV659082 Hs.134228 ESTs 136 13.6 

311992 AL360200 Hs.114145 ESTs 135.8 13.6 

316497 AA766457 Hs.136849 ESTs 135.8 13.6 

317677 AA968594 Hs.127868 ESTs 135.8 13.6 

321680 W02848 Hs.93704 ESTs 135.8 13.6 

10 326080 predicted exon 135.8 13.6 

330938 AF036943 Hs. 17261 9 KIAA1 106 protein 135.8 13.6 

306573 AL134878 Hs.1 19500 ribosomal protein, large P2 135.6 13.6 

307383 AI223207 Hs.147888 EST ■ 135.6 13.6 

311114 AW449382 Hs.195297 ESTs 135.6 13,6 

15 320579 R15138 Hs.165570 Homo sapiens clone 25052 mRNA sequence 135 13.5 

301328 AA884104 Hs.125546 ESTs 134.8 13.5 

312063 N58198 Hs.182898 ESTs 134.8 13.5 

323036 H09604 Hs.13268 ESTs 134.6 13.5 

332776 AF241850 Hs.1 51428 ret finger protein 2 134.4 13.4 

20 332494 AA282330 Hs.145668 ESTs 134.2 13.4 

334376 predicted exon 134.2 13.4 

313264 N93416 Hs.118228 ESTs 133.6 13.4 

313669 AA351109 Hs.5437 Taxi (human T-cell leukemia virus type I 133.2 13.3 

312083 T87398 Hs.205816 ESTs 132.6 13.3 

25 319354 AA993807 Hs.167367 ESTs 132.6 13.3 

307414 AI242106 gb:qh92aQ2.x1 SoaresJIFL.J_GBC_S1 Homo s 132.2 132 

312771 AA018515 Hs.264482 Apg12 (autophagy 12,S.cerevisiae)-like 1318 13.2 

313004 AI274963 Hs.145900 ESTs 131.2 13.1 

300995 AW510641 Hs.258018 ESTs 220.6 13 

30 319323 F12650 Hs.13287 ESTs 125.4 12.5 

329451 predicted exon 123.4 12.3 

337603 predicted exon 572 12.2 

312480 R68651 Hs.144997 ESTs 121.4 12.1 

324934 AW452051 Hs.147546 ESTs 119,4 11.9 

3 5 320723 BE178025 Hs.7942 hypothetical protein FLJ20080 117 11.7 

318188 AI792566 gb:qi74f02.y5 NCI_CGAP_Ov26 Homo sapiens 116.611.7 

320873 AF238869 Hs.283955 Homo sapiens clone GLSH-2 similar to gli 112.8 11.3 

331005 BE003191 Hs.119555 ESTs 112.6 11.3 

304969 AA614406 gb:np46f05.s1 NCLCGAP_Br11 Homo sapiens 112.4 11.2 

40 319799 AI139253 Hs.227767 zinc finger protein 41 111.2 11.1 

302610 AA347945 Hs.256024 ESTs 111 11.1 

309485 AW130320 Hs.108124 ribosoma!proteinS4,X-linked 111 11.1 

311880 AVV419225 Hs.256247 ESTs 110.2 11 

313981 AW452334 Hs.128148 ESTs 110.2 11 

45 322442 W49701 Hs.29667 ESTs 109.4 10.9 

315099 AA806536 Hs.291841 ESTs 109 10.9 

304793 AA583264 Hs.182979 ribosomalproteinL12 108.8 10.9 

330815 AA019211 Hs.236463 KIAA1 238 protein 108.8 10.9 

304044 T81656 Hs.252259 ribosomal protein S3 714.8 10.8 

5 0 325222 predicted exon 135 10.8 

325889 predicted exon 814.6 10.8 

321447 AW891130 Hs.38173 ESTs 107.8 10.8 

302990 AA496212 Hs.180182 ESTs 106.2 10.6 

308106 AI476803 gb:tj77e12.x1 Soares_NSF_F8_9W_OT.PA^P_S 270.6 10.6 

55 310536 AI301041 Hs.150174 ESTs 106 10.6 

315257 AW157431 Hs.248941 ESTs 233 10.6 

318787 Z42313 Hs.22657 ESTs 105.8 10.6 

312305 AI927226 Hs.175610 ESTs 105.2 10.5 

326788 predicted exon 104.4 10.4 

60 312234 AA830640 Hs.206934 ESTs 104 10.4 

314482 AW085525 Hs.134182 ESTs 234 10.4 

323597 AI185693 Hs.135119 ESTs 102.4 10.2 

302623 AW836724 Hs.194110 hypothetical protein PRO2730 162.4 10.2 

323594 AI791531 Hs.129993 ESTs 101 10.1 

65 324315 N55761 Hs.194718 zinc finger protein 265 100.2 10 

314217 AA256465 Hs.188725 ESTs 99.2 9.9 

320932 AA554913 Hs.162297 ESTs 98.2 9.8 

327876 predicted exon 98.2 9.8 

319736 R17424 Hs.6650 vacuolar protein sorting 45B (yeast homo 98 9.8 

70 327747 predicted exon ' 97.6 9.8 

327844 predicted exon 97.4 9.7 

318200 AI061192 Hs.166517 ESTs 97.2 9.7 

329414 predicted exon 97.2 9.7 

318296 AI089667 Hs.270713 ESTs 121.4 .9.7 

75 307010 AJ140014 gb:qa68fD9.x1 Soares_fetaLneart_NbHH19W295 9.7 

319792 A1138635 Hs.22968 ESTs 385,4 9.6 
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305671 AA811688 Hs.82113 dUTPpyrophosphatase 96 9.6 

329440 predicted exon 93.8 9.4 

310381 AI263059 Hs.145594 ESTs 93.4 9.3 

318824 F06771 Hs.27226 ESTs 93.4 9.3 

5 328957 predicted exon 92.2 9.2 

318804 Z42549 Hs.160893 ESTs 92 9.2 

330836 AA055611 Hs.226568 ESTs, Moderately similar to ALU4JHUMAN A 92 9.2 

324592 AW752437 Hs.325708 ESTs 91.8 9.2 

311820 AW274545 Hs.254333 ESTs 91.4 9.1 

10 321614 H86161 gb:ys94b01.r1 Soares retina N2b5HR Homo 91 9.1 

330306 predicted exon 91 9.1 

303096 AL080276 Hs.268562 regulator of G-protein signalling 17 90 9 

313275 AI027604 Hs.159650 ESTs 110.4 8.8 

302593 H54855 Hs.36958 ESTs 88 8.8 

15 321421 BE465115 Hs.171688 ESTs 86.2 8.6 

330832 AI133530 Hs.62930 ESTs 456.4 8.6 

311847 AW301807 Hs.297260 ESTs 86 8.6 

322036 BE002723 Hs.301905 Homo sapiens cDNA FL J 14080 fis, clone HE 145.8 8.6 

328688 predicted exon 85.6 , 8.6 

20 325251 predicted exon 85.4 8.5 

329088 predicted exon 85.4 8.5 

322524 W79027 Hs.271762 ESTs 84 8.4 

337953 predicted exon 451 8.3 

323529 AA284397 Hs.201485 Homo sapiens clone FLC0664 PR02866 mRNA, 82.6 8.3 

25 307041 AI144243 gb:qb85b12.x1 SoaresJetaLhearLNbHH19W 306.88.2 

318285 AI332454 Hs.158412 ESTs 81.4 8.1 

312021 AA759263 Hs.14041 ESTs 81 8.1 

329350 predicted exon 81 8.1 

326169 predicted exon 80.4 8 

30 338038 predicted exon 1024.2 7.9 

312549 AI214510 Hs.146304 ESTs 77.4 7.7 

312542 D60076 gb:HUM084E1QAC!ontech human fetal brain 76.8 7.7 

320992 AB026891 Hs.225972 solute carrier family 7, (cationic amino 76 7.6 

318596 AI470235 Hs.172698 EST 150.6 7.5 

35 315650 AA649042 Hs.269615 ESTs 73.4 7.3 

324328 AA447276 Hs.292020 ESTs 210.4 7.1 

332622 R10674 Hs.128856 CSR1 protein 70.2 7 

328229 predicted exon 69.4 6.9 

319110 T75260 Hs.98321 hypothetical protein FLJ 141 03 68.6 6.9 

40 316133 AI187742 Hs.125562 ESTs 308.6 6.9 

303992 AW515800 gb:hd88g01.x1 NCI_CGAP_GC6 Homo sapiens 67.8 6.8 

322675 AA017655 Hs.146580 enolase 2, (gamma, neuronal) 377.2 6.7 

325753 predicted exon 105.2 6.6 

312539 AI004377 Hs.200360 Homo sapiens cDNA FLJ 13027 fis, clone NT 92.2 6.4 

45 302592 AA294921 Hs.250811 v-ral simian leukemia viral oncogene horn 361.6 6.3 

314578 AA410183 Hs.137475 ESTs 201.6 6.1 

335986 predicted exon 108.6 6 

321478 AW402593 Hs.123253 hypothetical protein FLJ 22009 528 6 

305192 AA666019 gb:ag44a04.s1 Jia bone marrow stroma Horn 58.6 5.9 

5 0 304275 AA070605 gb:zm53h09.s1 Stratagene fibroblast (937 78.6 5.6 

302779 AJ235667 gb:Homo sapiens mRNA for immunoglobulin 278.8 5.5 

301976 T97905 Hs.77256 enhancer of zeste (Drosophila) homolog 2 479.2 5.4 

316021 AW293399 Hs.144904 nuclear receptor co-repressor1 792.4 5.3 

320802 BE336699 Hs.185055 BENE protein 2423.8 5.3 

55 317282 AI733112 Hs.176101 ESTs 523.2 5.1 

316827 AI380429 Hs.172445 ESTs 578 5.1 

303190 BE280787 Hs.16079 hypothetical protein FLJ10233 223 5.1 

315587 AI268399 Hs.140489 ESTs 136.2 5 

333122 predicted exon 399 5 

60 310214 A1220072 Hs.165893 ESTs 234.4 4.9 

320089 D43945 Hs.113274 transcription factor EC 68 4.9 

309328 AW024348 Hs.233191 EST, Weakly similar to A2721 7 glucose tr 258.8 4.8 

318971 Z44067 Hs.10957 ESTs 376.6 4.8 

327220 predicted exon 47.4 4.7 

65 315757 AW014605 Hs.179872 ESTs 177.4 4.7 

320730 R68869 Hs.1 51072 ESTs 205.2 4.6 

313339 AI682536 Hs.163495 Homo sapiens cDNA FLJ 13608 fis, clone PL 260 4.5 

318634 T49598 Hs.156832 ESTs 475.2 4.5 

320955 AW820035 Hs,278679 a dislntegrin and metalloproteinase doma 388.6 4.4 

70 306605 AI000497 Hs.1 19500 ribosomalprotein,largeP2 .81.6 4.4 

309349 AW051913 gb:wx24a09.x1 NCLCGAP_Kid1 1 Homo sapien 102.44.3 

306004 AA889992 Hs.2186 eukaryotictranslationelongationfadorlga 451.2 4.2 

330020 predicted exon 61.2 4.1 

302308 AW327279 Hs.91379 ribosomal protein L26 342 3.9 

7 5 314648 AW979268 gb:EST391378 MAGE resequences, MAGP Homo 56.4 3.8 

315131 AI753709 Hs.152484 ESTs 130.4 3.7 
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313690 AI493591 Hs.78146 platelet/endothelial cell adhesion molec 3179.6 3.6 

333585 predicted exon 175.4 3,5 

312911 H93366 Hs.7567 Homo sapiens cDNA; FU21962 fis, clone H 219 3.5 

322966 AA633669 Hs.235920 Homo sapiens cell recognition molecule C 350.2 3.4 

5 312492 R71072 Hs.191269 ESTs 322.8 3 

318988 Z44203 Hs.26418 ESTs 25 2.5 

332363 AI123705 Hs.106932 ESTs 773.4 2.5 

324181 AI025476 Hs.131628 ESTs 634.8 2.4 

311717 AW205369 Hs.312830 ESTs 54.2 2.4 

10 321342 AA1 27984 Hs.222024 transcription factor BMAL2 23.4 2.3 

308852 AI829848 Hs.182937 peptidylpralylisomeraseA(cydophilinA) 92 2.3 

331466 AA373210 Hs.43047 Homo sapiens cDNA FU 13585 fis, clone PL 494 2.3 

320279 AB033062 Hs.134970 DKFZP434N178 protein 76.2 2.2 

322221 N24236 Hs, 1 79662 nucleosome assembly protein 1 -like 1 253.2 2.1 

15 302925 AL137449 Hs.126666 homeoboxB4 136.6 2.1 

331384 AB041035 Hs.93847 NADPH oxidase 4 720 1.8 

' 300938 AA514416 Hs.152320 ESTs, Weakly similar to 1605244A erythro 27 1.8 

312695 AW196663 Hs.200242 ESTs 303.8 1.6 

320223 W35132 Hs.267442 ESTs 189 1.5 

20 332743 AW247977 Hs.87595 translocase of inner mitochondrial membr 14.4 1.4 

331039 AW378685 Hs.18625 Mitochondrial Acyl-CoA Thioesterase 529.8 1.4 

333123 predicted exon 396.2 1.4 

328455 predicted exon 91.8 1.3 

334458 predicted exon 406.4 1.3 

25 313478 AA643008 Hs.192775 ESTs 413.4 1.1 

309899 AW338564 Hs.217493 annexinA2 -30.8 1 

311735 AW294416 Hs.144687 Homo sapiens cDNAFL J 12981 fis, clone NT -62.8 1 

312953 NM_001992Hs.1 28087 coagulation factor II (thrombin) recepto -73.6 1 

313055 AW367295 Hs.241175 ESTs -43.8 1 

30 313291 AI267970 Hs.150614 ESTs, Weakly similar to ALU4_HUMAN ALU S -63 1 

315059 AW275110 Hs.271106 ESTs -67 1 

322284 AI792140 Hs.49265 ESTs -395.2 1 

322450 AL121278 Hs.25144 ESTs -1.6 1 

324803 AW975183 Hs.292663 ESTs 4.4 1 

35 331495 AW970939 Hs.291039 ESTs -282.8 1 

333610 predicted exon -152.6 1 

335093 predicted exon -23.2 1 

339403 predicted exon -331.2 1 

302820 X04588 Hs.85844 neurotrophic tyrosine kinase, receptor, 591.2 1 

40 302270 R56151 Hs.93589 Homo sapiens mRNA; cDNA DKFZp564B1162 (f 276.61 

323755 AW300094 Hs.136252 ESTs 135 0.9 

326946 predicted exon 727.4 0.9 

315343 BE144306 Hs.179891 ESTs, Weakly similar to P4HA_HUMAN PROLY 122.80.9 

311168 AK001270 Hs.196086 hypothetical protein FLJ 10408 304 0.9 

45 329732 predicted exon 109.2 0.9 

321415 BE621807 Hs.3337 transmembrane 4 superfamily member 1 414.8 0.7 

333121 predicted exon 87.8 0.7 

333120 predicted exon 379.8 0.7 

330392 AW797956 Hs.75748 proteasome (prosome, macropain) subunit, 589.2 0.7 

5 0 314711 AA769365 Hs. 126058 ESTs -87 0.6 

330865 BE409857 Hs.69499 hypothetical protein 347.4 0.6 

333169 predicted exon -1182 0.6 

335095 predicted exon 106.4 0.6 

335815 predicted exon -156 0.6 

5 5 330232 predicted exon 102.6 0.6 

330823 AA031 565 Hs.221 255 ESTs, Moderately similar to ALU5_HUMAN A -62 0.5 

331704 F04225 Hs.66032 ESTs -14.6 0.5 

302642 NM_016428Hs.1 30719 NESH protein 267.6 0.5 

304484 AA432067 Hs.258373 ESTs 85 0.5 

60 310230 AK000377 Hs.144840 hornotog of mouse C2PA -70 0.4 

301531 AI077462 Hs.134084 ESTs -195.4 0.4 

306337 AA954221 HsJ3742 ribosomalprotein.large.PO -33.4 0.4 

331327 N46436 Hs.109221 ESTs -392 0.4 

332961 predicted exon -5.6 0.4 

65 322796 W31178 Hs.154140 Homo sapiens ovary-specific acidic prote -880.6 0.3 

328857 predicted exon 55.2 0.3 

316342 AA743935 Hs.202329 ESTs 43.4 0.3 

331263 AW780192 Hs.267596 ESTs -180.4 0.3 

335987 predicted exon -134 0.3 

70 311923 T60843 Hs.189679 ESTs 12.2 0.3 

310522 AW134529 Hs.244647 ESTs -187.8 0.3 

315363 AA759190 Hs.121454 ESTs, Weakly similar to olfactory recept 80 0.3 

302032 NM_001992Hs.128087 coagulation factor II (thrombin) recepto -877 0.3 

313140 BE265133 Hs.217493 annextnA2 95.4 0.3 

75 310860 AW015920 Hs.161359 ESTs -239 0.3 

317899 AI952430 Hs.150614 ESTs, Weakly similar to ALU4 JHUMAN ALU S -715.2 0.3 
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328520 predicted exon -109.2 0.2 

302406 NM_012099Hs.211956 CD3-epsilon-associated protein; antisens 10 0.2 

311804 AI866921 Hs.203349 Homo sapiens cDNA FU12149 (is, clone MA -252.6 0.2 

315065 AK001122 Hs.105859 hypothetical protein FLJ10260 -46.2 0.2 

5 314129 AA228366 Hs.1 15122 ESTs -308.8 0.2 

335697 predicted exon -47.2 0.2 

335989 predicted exon 89 0.2 
320606 AW867943 Hs.127216 hypothetical protein FU13465 -205.6 0.2 
329745 predicted exon 103 0.2 

10 313628 AW419069 Hs.209670 ESTs -177.8 0.2 

334616 predicted exon -936.6 0.2 

308820 AI821267 Hs.207243 EST -7.2 0.2 

320416 A1026984 Hs.293662 ESTs -18.4 0.2 

335211 predicted exon -142 0.2 

15 323629 AA375957 Hs.6682 ESTs -100 0.1 

331420 AW452904 gb:UI-H-BI3-a)y-h-11-0-Ul.s1 NCLCGAP.Su 83 0.1 

315984 AI015862 Hs.131793 ESTs -250.6 0.1 

332833 predicted exon -374.2 0.1 

332607 NM_002314Hs.36566 LI M domain kinase 1 -27.6 0.1 

20 313467 AA004879 Hs.187820 ESTs -288.2 0.1 

323333 AV651680 Hs.208558 ESTs -735.6 0.1 

330775 AW247020 Hs.250747 SUMO-1 activating enzyme subunit 1 53.6 0.1 

333168 predicted exon -1041,8 0.1 

332079 AI308876 Hs.103849 ESTs 19.4 0.1 

25 322724 AF161442 Hs.191591 Homo sapiens HSPC324 mRNA, partial cds -123.6 0.1 

303652 AI799111 Hs.64341 ESTs -46.4 0.1 

303131 AW081061 Hs.10318O DC2 protein -156.4 0.1 

320716 AI479439 Hs.171532 ESTs -146.6 0.1 

300454 AA659037 Hs.163780 ESTs -304 0.1 

30 312757 AI285970 Hs.183817 ESTs -445 0.1 

312391 R43707 Hs.133159 ESTs, Weakly similar to PIHUSD salivary -111.8 0.1 

308877 AI832519 gb:at69h03.x1 Barstead colon HPLRB7 Homo-149.6 0 

311275 AI659166 Hs.207144 ESTs -62.6 0 

302363 AW163799 Hs.198365 2,3-bisphosphoglycerate mutase -15 0 

35 321717 AW956580 Hs.42699 ESTs -1059.6 0 

302638 AA463798 Hs.102696 MCT-1 protein -332.2 0 

306352 AA961367 gb:or52a05.s1 NCLCGAPJ3C3 Homo sapiens 21.8 0 

313798 AI292148 Hs.71622 SWI/SNF related, matrix associated, acti -97.2 0 

320807 AA135370 Hs.188536 Homo sapiens cDNA: FLJ21635 fis, clone C -2222 0 

40 320931 AW262836 Hs.252844 ESTs -881.6 0 

332450 AW288085 Hs.11156 hypothetical protein 28.4 0 

332535 AF167706 Hs.19280 cysteine-rich motor neuron 1 -722 0 

335990 predicted exon -421 0 
330746 AB033888 Hs.8619 SRY (sex determining region Y)-box 18 35.4 0 

45 316820 AI627912 Hs.130783 Forssm an synthetase -373.6 0 

337429 predicted exon -257 0 

331192 BE622021 Hs.152571 ESTs, Highly similar to IGF-ll mRNA-bind -33 0 

330609 AI346201 Hs.76118 ubiquitin carboxyl-terminal esterase L1 -280 0 

323593 AI739435 Hs.39168 ESTs -3627.6 0 

50 302704 AA531133 Hs.4253 hypothetical protein MGC2574 -278.6 0 

330534 NM_004579Hs.82979 mitogen-activating protein kinase kinase -244 0 

332374 X91195 Hs.100623 phospholipase C, beta 3, neighbor pseudo -1204.2 0 

333221 predicted exon -189.6 0 

335988 predicted exon -122.6 0 

55 330574 AI984144 Hs.66713 hepatitis delta antigen-interacting prot -2257.4 0 

312052 BE621697 Hs.14317 nucleolar protein family A. member 3 (H/ -359.2 0 

319568 AF131781 Hs.84753 hypothetical protein FL J 12442 ' -874.6 0 

337113 predicted exon -24.6 0 

335149 predicted exon -191.8 0 
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TABLE 6A 

Table 6A shows the accession numbers for those pkeys lacking unigenelD's for Table 6. The pkeys In Table 7 lacking unlgenelD's are represented within 
Tables 1-6A. For each probeset we have listed the gene cluster number from which the oligonucleotides were designed. Gene clusters were compiled 
using sequences derived from Genbank ESTs and mRNAs. These sequences were clustered based on sequence similarity using Clustering and 
Alignment Tools (DoubleTwist, Oakland California). The Genbank accession numbers for sequences comprising each cluster are listed in the "Accession" 
column. 

Pkey: Unique Eos probeset identifier number 

CAT number Gene cluster number 
Accession: Genbank accession numbers 

Pkey CAT Number Accession 

320925 1525201.1 D62892 D79755 D62760 

321614 87866J H86161 AA054303 AA018955 

313952 136885 J F20956 AA 129374 AA 133740 A W81 9878 

314648 293660J AW979268 AA878419 AA431342 AA431628 

302749 458 107 M16951 M16952 M16948 M16949 M16950 

312362 764066 1 AW01 5994 R39898 AW000978 AI598202 AI521 706 

312542 1522649 1 D60076 D60259 D61037 

312642 1005225 1 AW052128 H51439 H51481 

312986 171879J AA21 1586 F35799AA21 1641 F29720AW937387 AW937408 

329350 cjc hs 

329414 c_y_hs 

329440 c y hs 

329451 c_yjis 

338033 CH22_6528FG_UNK_EM:AC00 

338038 CH22_6535FG__LINK_EM:ACQ0 

338116 CH22 6650FG__UNKLEM:AC00 

338158 CH22J700FG_UNK_EM:AC00 

329732 c14j)2 

329745 c14j>2 

308106 AI476803 

329863 c14j>2 

336316 CH22_6944FG_UNK_EM:AC00 

308248 AI560919 

338388 CH22_7034FG_UNK_EM:ACOO 

338442 CH22_7109FG_UNK_EM:ACOO 

338645 CH22_7410FG_JJNK_EM:AC00 

338728 CH22_7527FG_JJNK_EM:AC00 

308877 AI832519 

338962 CH22_7838FG_UNKJDJ32I10 

308886 AI833240 

333120 CH22_349FG_81J_LINK_EMA 

333121 CH22 350FG_81_4 LINK_EM:A 

333122 CH22J}51FGJ1_6_UNK_EM:A 

333123 CH22J52FG_81_7J_INK_EM:A 

333168 CH22J00FG_94_1JJNK_EM:A 

333169 CH22_401FG_94J_UNK_EM:A 
333221 CH22_458FGJ05J_LINK_EM: 
326077 C17 hs 

326080 c17Jis 

326169 c17_hs 

326198 c17_hs 

326230 c17_hs 

333585 CH22_846FG_203 4J_INK_EM: 

333610 CH22_871FG__217_5_LINK_EM: 

335093 CH22 2423FG 492 3_LINK_EM 

335095 CH22J425FG 492_5_UNK_EM 

335149 CH22J484FG_499_5JJNK^EM 

326759 c20_hs 

333977 CH22J254FG_309_6JJNK_EM 

326788 c20_hs 

335211 CH2£_2550FG_511 2_UNK_EM 

305192 AA666019 

303973 AW512014 

303992 AW515800 

326946 c21Jis 

328229 c_6Jis 

328262 c_6__hs 
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328418 Cj hs 
328455 c_7_hs 

335697 CH22J058FG 596J2J.INK-E 
328520 c 7_hs 
328548 c 7 hs 

335815 CH22J187FGJ18_3_LINK - EM 

328688 c_7_hs 

328695 cJMis 

307010 AI140014 

337113 CH22J058FG_493_1_ 

307041 Al 144243 

328700 c 7 hs 

335946 CH22J324FG_646_20_LINK_D 

335986 CH22_3366FG 654J0JJNKJ) 

335987 CH22_3367FG_654J1JJNKJ) 

335988 CH22_3368FG_654J2JJNK_D 

335989 CH22_3369FG_655J_LINKJ)J 

335990 CH22_3370FG 655_4_UNKJ)J 
337214 CH22_5288FG_613J_ 
330020 c16_p2 

305989 AA888220 

328857 c_7Jis 

328937 c_8_hs 

328957 c 8 _hs 

330187 c_4_p2 

337407 CH22_5607FG_755_1_ 

337429 CH22J633FG_762JL 

330232 c_5j)2 

307414 AI242106 

330305 c_7_p2 

330306 C_7_p2 

337603 CH22_5896FG_LIN}CC20H12. 
337953 CH22_6395FG_UNK^EM:AC00 
339236 CH22_8181FG_UNK^BA354I1 
339403 CH22_8384FG__LINK^BA232E1 
309349 AW051913 
325222 dOJis 
325251 dOJis 

318188 956161J AI792566 AI053836 AI054127 AI792489 AI288324 
309871 AW300366 
325544 c12_hs 
309931 AW341683 

332833 CH22_50FG_17_7_LINKC20H1 

302779 33837 1 AJ235667 AJ235666 AJ235664 AJ235665 AJ235668 AJ235669 AJ235670 

302790 34168J AJ245245 AJ245247 AJ245257 AJ245248 AJ245254 AJ245256 AJ245253 AJ245203 AJ245250 AJ245252 AJ245243 AJ245204 

AJ245201 AJ245206 AJ245246 AJ245255 AJ245205 AJ245202 AJ245251 AJ245249 AJ245207 AJ245244 
332961 CH22J85FG_48J8JJNK_EM: 
325753 c14_hs 
327036 c21_hs 
325843 c16Jis 
325889 c16_hs 
304261 AA059387 
304275 AA070605 

334376 CH22_1670FG_379_8JJNK_EM 
327220 cj_hs 
304363 AA206045 

334458 CH22_1757FG_391_2JJNK_EM 
327365 cjjis 
327373 c_2_hs 

334616 CH22J923FGJ11_15_LINK_E 
327414 c_2Jis 
327568 c_3_hs 

336034 CH22_3419FGJ78_5_LINK_DJ 
336059 CH22J445FG 684_2_LINK_DJ 
334834 CH22_2148FGJ39_3_LINK_EM 
304782 AA582081 
304876 AA595765 
327747 c_5_hs 

336228 CH22_3626FGJ30_4_LINK_DA 

329073 c.xjis ' 

329088 c_x_hs 

304969 AA614406 

327844 cj_hs 

327876 cj.hs 

306352 AA961367 

331131 genbankj*54797 R54797 
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331139 genbank_R65706 R65706 

331420 675963 J AW452904 AW449414 BE467906 AI298565 BE549932 BE326357 F04362 
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TABLE 6B 

Table 6B shows the genomic positioning for those pkeys lacking unigene ID's and accession numbers in Table 6. The pkeys in Table 7 lacking 
unigenelD's are represented within Tables 1-6B. For each predicted exon, we have listed the genomic sequence source used for prediction. Nucleotide 
locations of each predicted exon are also listed. 



Pkey: Unique number corresponding to an Eos probeset 

Ref: Sequence source. The 7 digit numbers in this column are Genbank Identifier (Gl) numbers. "Dunham I. et al." refers to the publication 

entitled "The DNA 

sequence of human chromosome 22." Dunham I. et al., Nature (1999) 402:489-495. 
Strand: Indicates DNA strand from which exons were predicted. 

NLposition: Indicates nucleotide positions of predicted exons. 



Pkey Ref 

332961 Dunham, I. 

333221 Dunham, I. 

333585 Dunham, I. 

333610 Dunham, I. 

334376 Dunham,!. 

334458 Dunham,!. 

334616 Dunham,!. 

335149 Dunham, I. 

335211 Dunham,!. 

335697 Dunham, I. 

335986 Dunham, I. 

335987 Dunham,!. 

335988 Dunham,!. 

335989 Dunham, I. 

335990 Dunham, I. 
336034 Dunham, I. 
337953 Dunham, I. 
338033 Dunham, I. 
338038 Dunham, I. 
338316 Dunham, I. 
338442 Dunham, I. 
338962 Dunham, I. 
332833 Dunham, I. 

333120 Dunham, I. 

333121 Dunham, I. 

333122 Dunham, I. 

333123 Dunham, I. 

333168 Dunham, I. 

333169 Dunham. I. 
333977 Dunham, I. 
334834 Dunham, I. 
335093 Dunham,!. 
335095 Dunham, I. 
335815 Dunham, I. 
335946 Dunham, I. 
336059 Dunham, I. 
336228 Dunham,!. 
337113 Dunham, I. 
337214 Dunham,!. 
337407 Dunham. I. 
337429 Dunham, I. 
337603 Dunham,!. 
338116 Dunham,!. 
338158 Dunham, I. 
338388 Dunham, I. 
338645 Dunham, I. 
338728 Dunham,!. 
339236 Dunham, I. 
339403 Dunham, I. 
325222 6525287 
325251 6682448 
325544 6682452 
325753 6682474 
329745 6065779 
329732 6065783 
329863 6691797 
325889 5867087 





Strand 


etaL 


Plus 


etal. 


Plus 


et.al. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etai. 


Pius 


etal. 


Plus 


etal. 


Plus 


etal. 


Plus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


et.al. 


Minus 


et.al. 


Minus 


etal. 


Minus 


et.al. 


Minus 


etal. 


Minus 


et.al. 


Minus 


et.al. 


Minus 


etal. 


Minus 


eta!. 


Minus 


etai. 


Minus 


et.al. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


et.al. 


Minus 


etal. 


Minus 


etal. 


Minus 


etal. 


Minus 


eta!. 


Minus 




Minus 




Minus 




Plus 




Plus 




Plus 




Plus 




Plus 




Plus 



Nt ^position 

2521424-2521555 

3978070-3978187 

6234778-6234894 

6547007-6547116 

13902218-13902331 

14353496-14353572 

15176123-15176470 

21497441-21497587 

21774611-21774680 

25481456-25481649 

27967791-27967852 

27971413-27971481 

27977912-27978013 

27983788-27983860 

27988532-27988608 

29014404-29014590 

6827029-6827125 

8092128-8092271 

8138219-6138392 

17089711-17089988 

19980640-19980698 

29581892-29582020 

1119848-1119705 

3307508-3307427 

3308446-3308358 

3309596-3309531 

3310817-3310749 

3729896-3729788 

3730864-3730767 

8722928-8722725 

17182681-17182535 

21297367-21297214 

21292546-21292381 

26320518-26320421 

27487203-27487035 

29184079-29183969 

30904602-30904497 

21233344-21233237 

26095902-26095502 

31886652-31886567 

32086238-32086079 

1299296-1299194 

10614071-10613814 

11794465-11794343 

18662403-18662305 

24063839-24063775 

25949039-25948927 

32773355-32773202 

34050728-34050625 

22332-22473 

411693-411751 

171228-171286 

398512-398621 

174774-175142 

161252-161322 

196801-196971 

223829-223891 
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325843 


6552453 


Minus 


7126-7232 


330020 


6671887 


Plus 


172397-172491 


326198 


5867215 


Minus 


80295-80674 


326230 


5887230 


Minus 


301868-301972 


326169 


5867255 


Minus 


128321-128388 


326077 


6682495 


Minus 


312108-312168 


326080 


6682495 


Plus 


478644478847 


326759 6249610 


Plus 


97216-97311 


326788 6682503 


Plus 


277132-277335 


326946 


6004446 


Minus 


116677-116967 


327036 


6531965 


Plus 


319951-320040 


327220 


5867525 


Minus 


65701-65781 


327365 


6552412 


Minus 


118133-118198 


327414 


5867750 


Plus 


102461-102566 


327373 


5867792 


Minus 


8186-8742 


327568 


5867811 


Minus 


4615246287 


330187 


6706138 


Plus 


212923-213020 


327747 


5867947 


Plus 


115322-115498 


327844 


6249582 


Minus 


18895-18958 


330232 


6013526 


Plus 


113655-113830 


328229 


5868105 


Minus 


120936-121053 


327876 


5866140 


Plus 


103882-104034 


328262 


6381906 


Plus 


11867-12027 


328688 


5868262 


Plus 


626030-626094 


328700 


5868264 


Pius 


764089-764203 


328695 


5868264 


Plus 


318632-318695 


328418 


5868409 


Minus 


258811-258894 


328455 


5868431 


Plus 


385576-385633 


328520 


5868477 


Plus 


1942075-1942246 


328548 


5868487 


Plus 


72301-72397 


328857 


6381927 


Minus 


80557-81051 


330305 


4877982 


Minus 


52269-52365 


330306 


4877982 


Plus 


96161-96233 


328937 


5868500 


Minus 


1448241-1448333 


328957 


6456773 


Plus 


219195-219297 


329073 


5868596 


Plus 


37838-37956 


329088 


5868608 


Plus 


116738-116950 


329350 


6456785 


Plus 


98911-98969 


329414 


5868874 


Plus 


942555-942643 


329440 


5868885 


Plus 


21943-22063 


329451 


5868887 


Plus' 


25974-26048 
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TABLE 7: 

Table 7 depicts Seq ID No., UnlgeneiD, UnigeneTitle, Pkey, and ExAccn for all of the sequences in Table 8. Seq ID No links the nucleic acid and protein 
sequence Information inTable 8 to Table 7. 



Pkey: 


Unique Eos probeset identifier number 




ExAccn: 


Exemplar Accession number, Genbank accession number 




UnigenelD: 


Unigene number 






Unigene Title: Unlgene gene title 






Seq.ID.No.: 


Sequence Identification Number found in Table 8 




PKey 


txAccn 


unigene iu 


unigene i wie 


SEQ ID NO 


101545 


QdARWZA 
bt240lD4 


ns. IIWZ IU 


crvHnfholIal Hifforonti^stinn crvhinnnllnl 

ciiuouiciiai uinereniiaiiun, auiiniguiiui 


Seq ID 1 & 2 


115819 


A/vtODD^U 


Lie 411^*5 
rib. 4 1 1 0 J 


onHnmi ipln_0 


Seq ID 3 & 4 


424503 


My nrioinK 


ns.i*iyouy 


wiegnn, aipna o uiDronecun receptor, 


Seq ID 5 & 6 


102917 


AI016712 


Lie 0Q77Q7 


miegnn, oeia i uioroneciin recepior, 


Seq ID7&8 


102915 


XQ7o2U 


Ue OOGfl 

nS.22oo 


mamx rneiaiioproieinase iu isuumeiybiii 


Seq ID 9 & 10 


105330 


AW338625 


Ue 001 on 


CO IS 


Seq ID 11 & 12 


107385 


NM_Q05397 


nS.lb42b 


poaocaiyxin-iiKe 


Seq ID 13 & 14 


102024 


A A OfH OCT 

AA301867 


Up 7C00>I 

HS./0224 


t^r*coniaining iiouun-iiKc exiraceiiuia 


Seq ID 15 & 16 


102024 


AA301867 


HS.76224 


tv3r-<x)niaintng iiouun-itKe exiraceiiuia 


SeqlD17&1B 


134416 


X68264 


Lin 4 E7fi 

HS.211o7y 


melanoma ceil aonesion moiecuie 


Seq ID 19 & 20 


103036 


M 13509 


nS.o3lDb 


mauix meiaiioproteinase i ^niersuuai 


Seq ID 21 &22 


104865 


T79340 


it _ onc7c 

HS.22o75 


t>-ceii trLL/iympnoma o, memoer o \zinc u 


SeqiD23&24 


106124 


ny33bb 


Ue 7RR7 


UniMA eanianc oDNIA* PI !91Qfi9 fr<3 rlnno H 

nomo sapiens cuina. rLjt lyo^ iio, uuhc n 


Seq ID 25 & 26 


109001 


A \f\CCCAQ 


lis. / 21 lb 


nypoineiicai proiein rLJzusy^ simiior iu 


Seq ID27&28 


104764 


AI039243 


!_J~ 07DC0C 

HS.z/oooo 


CQTe 

to IS 


Seq ID 29 & 3D 


133200 


A 0/1177 1C 


ns. loobjy 


nypoineucai proiein rLj iuz iu 


Seq ID 31 & 32 


105263 


AW3oo633 


nS.bbo2 


soiuie camenamiiy /, ^canonic amino 


Seq ID 33 & 34 


102892 


OC.44UU42 


nS.oJo2b 


mauix meianoproieinase j ^suomeiysin 


Seq ID 35 & 36 


109456 


AW956580 


ns.42byy 


to IS 


Seq ID 37 & 38 


110906 


AA03521 1 


HS.1/4U4 


to IS 


Seq ID 39 & 40 


119073 


BE245360 


U- 17ft vl 77 

HS.279477 


toTS 


Seq ID 41 & 42 


132050 


AI267615 


rlS.JBU22 


to IS 


Seq ID 43 & 44 


132490 


NMJX)1290 


i_l _ innrt 

HS.49B0 


LIM domain binding 2 


Seq ID 45 & 46 


102283 


AW161552 


HS.OJool 


guanine nucieouce umuing proiein 1 1 


Seq ID 47 & 48 


101714 


Mo 8874 


Ll« H H C07 


AU<sAAkAl!nAAA A O Ami irN IV / A / 

pnospnonpase A2, group iva i^cytosoiic, 


Seq ID 49 & 50 


133975 


r^A nice 


ns.2yDy44 


iissue lacior pamway inniuiior z 


Seq ID 51 & 52 


106793 


UCMOG7 


MS. lb40U 


to 1 S 


Seq ID 53 & 54 


118511 


N75o<iU 


Ue Alt K7 


to 1 S 


Seq ID 54 & 55 


101447 


» ini fine 

M21305 




go.numan aipna sateiiiie ana saieiuie o 


Seq ID 56 & 57 


314941 


AAo1oyU2 


Ue llfiCRA 

MS.JoUbOU 


CCTe 

to IS 


Seq ID 58 & 59 


332466 


Abu io2oy 


ns. I io ihu 


iMAAUr io gene piuuuci 


Seq ID 60 & 61 


313513 


AW/yoouu 


ns. 1*1 io**u 


co is, vvedwy oiiniidi iu ojjjui iihcmciu 


Seq ID 62 & 63 


313556 


A ACOQR'17 


Ue HQEflO 


CCTe 
to IS 


Seq ID 64 & 65 


313665 


AW/D12U1 


Ue ^-lOII 

nS.DJ 2oo 


CCTe 
to IS 


Seq ID 66 & 67 


314372 


ALUhU I/O 


Ue i/ionm 


CCTe 

to IS 


Seq ID 68 & 69 


429276 


a cnccnRt; 
ArUooUoD 


Ue ioQcn 


o protem-coupieo recepior o i 


Seq ID 70 & 71 


101345 


MM nn^7Qs 


ns. isic. i / j 


rAfrlfnntn rprpnfnr-likp 


Seq ID 72 & 73 


418994 


AA296520 


Hs.89546 


selectin E (endothelial adhesion molecul 


Seq ID 74 & 75 


103850 


AA187101 


Hs.213194 


hypothetical protein MGC10895 


Seq ID 76 & 77 


133260 


AA403045 


Hs.6906 


Homo sapiens cDNA: FLJ23197 fis, clone R 


Seq ID 78 & 79 


101097 


BE245301 


Hs.89414 


chemokine (C-X-C motif), receptor 4 (fus 


Seq ID 80 & 81 


104786 


AA027167 


Hs.10031 


KIAA0955 protein 


Seq ID 82 & 83 


132173 


X89426 


Hs.41716 


endothelial cell-specific molecule 1 


Seq ID 84 & 85 


100420 


D86983 


Hs.118893 


Melanoma associated gene 


Seq ID 86&87 


111018 


AI287912 


Hs.3628 


mitogen-aciivated protein Kinase kinase 


Seq ID 88 & 89 


108507 


AI554545 


Hs.68301 


ESTs 


Seq ID 90 & 91 


104894 


AF065214 


Hs.18858 


phospholipaseA2, group IVC (cytosolic, 


Seq ID 92 & 93 


118511 


N75620 


Hs.43157 


ESTs 


Seq ID 94 & 95 


125609 


AA868063 


Hs.104576 


carbohydrate (keratan sulfate Gal-6) sul 


Seq ID 96 & 97 


101543 


M31166 


Hs.2050 


pentaxin-related gene, rapidly induced b 


Seq ID 98 & 99 


102241 


NM_007351 


Hs.268107 


multimerin 


Seq ID 100 & 101 


101560 


AW958272 


Hs.347326 


intercellular adhesion molecule 2 


Seq ID 102 & 103 


103280 


U84722 


Hs.76206 


cadherin 5, type 2, VE-cadherin (vascula 


Seq ID 104 & 105 


105826 


AA478756 


Hs.194477 


E3 ubiquittn iigase SMURF2 


Seq ID 106 & 107 


102804 


NMJ02318 


Hs.83354 


lysyl oxidase-like 2 


Seq ID 108 & 109 


131647 


AA359615 


Hs.30089 


ESTs 


Seq ID 110 & 111 


103095 


NWL005424 


Hs.78824 


tyrosine kinase with immunoglobulin and 


Seq ID 112 & 113 


103037 


BE0 18302 


Hs.2894 


placental growth factor, vascular endoth 


Seq ID 114 & 115 


100405 


AW291587 


Hs.82733 


nidogen 2 


Seq ID 116 & 117 


102012 


BE259035 


Hs.1 18400 


singed (Drosophila)-like (sea urchin fas 


Seq ID 118 & 119 
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101261 


D30857 


Hs.82353 


protein C receptor, endothelial (EPCR) 


Seq ID 120 & 121 


105729 


H46612 


Hs.293815 


Homo sapiens HSPC285 mRNA, partial cds 


Seq ID 122 & 123 


107216 


D51069 


Hs.211579 


melanoma cell adhesion molecule 


Seq ID 124 & 125 


131080 


NM.001955 


Hs.2271 


endothelin 1 


Seq ID 126 & 127 


131486 


F06972 


Hs.27372 


BMX non-receptor tyrosine kinase 


Seq ID 128 & 129 


134299 


AW580939 


Hs.97199 


complement component C1q receptor 


Seq ID 130 & 131 


134983 


D28235 


Hs.196384 


prostaglandin-endoperoxide synthase 2 (p 


Seq ID 132 & 133 


115827 


AA428000 


Hs.283072 


actin related protein 2/3 complex, subun 


Seq ID 134 & 135 


133614 


NM.003003 


Hs.75232 


SEC14 (S. cerevislae)-like 1 


Seq ID 136 & 137 


116483 


AI346201 


Hs.76118 


ubiquitin carboxyl-terminal esterase L1 


Seq ID 138 & 139 


132546 


M24283 


Hs.168383 


intercellular adhesion molecule 1 (CD54) 


Seq ID 140 & 141 


133678 


AW247252 


NA 


nucleoside phosphorylase 


Seq ID 142 & 143 


130184 


H58306 


Hs.15165 


retinoicacid induced 14 


Seq ID 144 & 145 


134786 


T29618 


Ms.89640 


TEK tyrosine kinase, endothelial (venous 


Seq ID 146 & 147 


129371 


X06828 


Hs.110802 


von Willebrand factor 


Seq ID 148 & 149 


418506 


AA084248 


Hs.85339 


G protein-coupled receptor 39 


Seq ID 150 & 151 


322262 


AA632012 


Hs.1 88746 


ESTs 


Seq ID 152 & 153 


312173 


AI821409 


Hs.304471 


EST 


Seq ID 154 & 155 


319795 


AB037821 


Hs.146858 


protocadherin 10 


Seq ID 156 & 157 


313978 


A1870175 


Hs.13957 


ESTs 


Seq ID 158 & 159 


306840 


AI077477 


Hs.307912 


ESTs 


Seq ID 160 & 161 


310272 


AF216389 


Hs.148932 


sema domain, transmembrane domain (TM), 


Seq ID 162 & 163 


310272 


AF216389 


Hs.148932 


sema domain, transmembrane domain (TM), 


Seq ID 164 & 165 


315044 


BE547674 


Hs.204169 


ESTs, Weakly similar to S65657 alpha-1C- 


Seq ID 166 & 167 


321325 


AB033100 


Hs.300646 


KIAA1274 protein (similar to mouse palad 


Seq ID 168 & 169 


321325 


AB033100 


Hs.300646 


KIAA1274 protein (similar to mouse palad 


Seq ID 170 & 171 


303251 


AF240635 


Hs.1 15897 


protocadherin 12 


Seq ID 172 & 173 


302378 


AL109712 


Hs.296506 


Homo sapiens mRNA full length insert cDN 


Seq ID 174 & 175 


319267 


F11802 


Hs.6818 


ESTs 


Seq ID 176 & 177 


310442 


AW072215 


Hs.208470 


ESTs 


Seq ID 178 & 179 


300469 


BE301708 


Hs.233955 


hypothetical protein FLJ20401 


Seq ID 180 & 181 


331237 


W87874 


Hs.25277 


Homo sapiens cDNA FUI10717 fis; clone NT 


Seq ID 182 & 183 


330968 


R44557 


Hs.23748 


ESTs 


Seq ID 184 & 185 


301822 


X17033 


Hs.271986 


integrin, alpha 2 (CD49B, alpha 2 subuni 
integrin, alpha V (vitronectin receptor 


Seq ID 186 & 187 


422573 


AW297985 


Hs.295726 


Seq ID 188 & 189 


133061 


AI186431 


Hs.296638 


prostate differentiation factor 


Seq ID 190 & 191 


135235 


AW298244 


Hs.266195 


ESTs 


Seq ID 192 & 193 


101192 


BE247295 


Hs.78452 


solute carrier family 20 (phosphate tran 


Seq ID 194 & 195 


113195 


H83265 


Hs.8881 


ESTs, Weakly similar to S41044 chromosom 


Seq ID 196 & 197 


101741 


NM 003199 


Hs.326198 


transcription factor 4 


Seq ID 198 & 199 


321911 


AF026944 


Hs.293797 


ESTs 


Seq ID 200 & 201 


320635 


N50617 


Hs.80506 


small nuclear ribonucleoprotein polypept 


Seq ID 202 & 203 


326230 






NM_017643:Homo sapiens hypothetical prot 


Seq ID 204 & 205 


132968 


AF234532 


Hs.61638 


myosin X 


Seq ID 206 & 207 


135073 


W55956 


Hs.94030 


Homo sapiens mRNA; cDNA DKFZp586E1624 (f 


Seq ID 208 & 209 


108937 


AL050107 


Hs.24341 


transcriptional co-activator with PDZ-bi 


Seq ID 210 & 211 


116430 


AK001531 


Hs.66048 


hypothetical protein FLJ10669 


Seq ID 212 & 213 


104877 


Al 138635 


Hs.22968 


Homo sapiens clone IMAGE:451939, mRNA se 


Seq ID 214 & 215 


122697 


AA420683 


Hs.98321 


hypothetical protein FLJ14103 


Seq ID 216 & 217 


112522 


R68857 


Hs.265499 


ESTs 


Seq ID 218 & 219 


304782 


AA582081 




gb:nn32h08.s1 NCI_CGAP_Gas1 Homo sapiens 


Seq ID 220 & 221 


312802 


AA644669 


Hs.1 93042 


ESTs 


Seq ID 222 & 223 


302680 


AW1 92334 


Hs.38218 


ESTs 


Seq ID 224 & 225 


326198 






Phase 2 & 3 Exons 


Seq ID 226 & 227 


331019 


NM_006033 


Hs.65370 


lipase; endothelial 


Seq ID 228 & 229 
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TABLE 8 



Seq ID NO: 1 DNA sequence 

Nucleic Acid Accession #: NM_001400 

Coding sequence: 244-2208 (underlined sequences correspond to start and stop codons) ) 



10 



11 



15 



20 



25 



30 



35 



40 



45 



50 



55 



GTCGGGGGCA 
CTTCGCCCTG 
CACAAAAAGC 
CGCCCTCTAG 
ACCATGGGGC 
GTCAACTATG 
GACAAGGAGA 
ATCCTGGAGA 
ATGTACTATT 
GCTAACCTGC 
CGGGAAGGGA 
ATTGAGCGCT 
CTCTTCCTGC 
ATGGGCTGGA 
AAGCACTATA 
CTGTACTGCA 
AACATTTCCA 
ATCGTCCTGA 
GTGGGCTGCA 
GCTGTGCTCA 
CGGGCCTTCA 
TTCAAGCGAC 
CACCCCCAGA 
TCTTCTTCCT 
CCACCCCAGT 
CAAGCCAGAG 
TAGAGTTAGT 
TATATATTCT 
AGCTCCTAAA 
TCTTTGTCTG 
GTGTGCACTT 
TTCATACCCC 
CTGGGGTTGT 
TGGGAAGATG 
CATGTAAGCG 
CATCTTTTCA 
AAGCCCACTT 
AGCAAAACAA 
AAATGAGTCT 
TCTTGTGTGA 
CTTGATTTTT 
GTTAACTTTT 
CGCCAGAACT 
ACAAAGAATA 
AGATGTCTTG 
TTTGCACATA 



GCAGCAAGAT 
CTTGAGCGAG 
CTGGATCACT 
CGTTCGTCTG 
CCACCAGCGT 
ATATCATCGT 
ACAGCATTAA 
ACATCTTTGT 
TTATTGGCAA 
TCTTGTCTGG 
GTATGTTTGT 
ATATCACAAT 
TAATCAGCGC 
ACTGCATCAG 
TCCTCTTCTG 
GAATCTACTC 
AGGCCAGCCG 
GCGTCTTCAT 
AGGTGAAGAC 
ACTCCGGCAC 
TCCGGATCAT 
CCATCATCGC 
AAGACGAAGG 
AGAACTGGAA 
GTTTGGAAAA 
GGAGGAAGGG 
TCCTGTGAAC 
ACCCCCCTGG 
GGGTTCATTT 
GAGCTTTGAG 
CTGCTTCTTT 
TCCTCAACGT 
GGAATGATCG 
AAGATGGTTT 
GGATCCGTTT 
ATGAAATGTG 
TATC TAAATG 
AGTGAAAACC 
AACAAATATG 
TTCATTTCAA 
GAATGTATTT 
CTAGAATCCA 
TTTAAGTCCA 
AAAATATATT 
TTTTTTTAAA 
GCTTTATCAA 



21 
I 

GCGAAGCGAG 
GCTGCGGTTT 
CATCGAACCA 
GAGTAGCGCC 
CCCGCTGGTC 
CCGGCATTAC 
ACTGACCTCG 
CTTGCTGACC 
TCTGGCCCTC 
GGCCACCACC 
GGCCCTGTCA 
GCTGAAAATG 
CTGCTGGGTC 
TGCGCTGTCC 
CACCACGGTC 
CTTGGTCAGG 
CAGCTCTGAG 
CGCCTGCTGG 
CTGTGACATC 
CAACCCCATC 
GTCCTGCTGC 
CGGCATGGAA 
GGACAACCCA 
GCTGTCCACC 
AAATCTCTGG 
GGAGAATACG 
AATGCACTGG 
AGCTTTGATT 
GGCCCCTCCT 
GAGATGTTTT 
AGGGATGCCC 
TCTTTTACTT 
ATCATCTATA 
GGAGGTGTAA 
TTTGGAATTT 
TTACCATTTC 
AT ATT AG CCA 
GAATGGATTA 
ACATCCGTCT 
GCAACAACAT 
GTTTCAGGAA 
CCCTCTTGTG 
GCTATTCATT 
ACTGTCTCTT 
AAGAATAGTA 
CTTTTAAACA 



31 
I 

CCGTACAGAT 
CCGAGGCCCT 
CCCCTGAAGC 
ACCCCGGCTT 
AAGGCCCACC 
AACTACACGG 
GTGGTGTTCA 
ATTTGGAAAA 
TCAGACCTGT 
TACAAGCTCA 
GCCTCCGTGT 
AAACTCCACA 
ATCTCCCTCA 
AGCTGCTCCA 
TTCACTCTGC 
ACTCGGAGCC 
AAGTCGCTGG 
GCACCGCTCT 
CTCTTCAGAG 
ATTTACACTC 
AAGTGCCCGA 
TTCAGCCGCA 
GAGACCATTA 
CACCGGAAGC 
GCTTCGACTG 
AACAGCCTGG 
GAAGGGTGGA 
TTGCACTGAG 
CAAAGACTAA 
CCTTCACTTT 
TGTACATCCC 
TATACTTTAA 
GCAAATAGGC 
AACAATGTCC 
GGTTGAAGTC 
ATATCCATTG 
GGATCCTTGG 
ACTTTTGCAA 
TTCCCACTTT 
GTTGTATTTT 
GAAGTCATTT 
CCCTTAAGCA 
AGATAGTAAT 
TAGTATGGTT 
TTTAATAGGT 
TTAATAAACT 



41 



CCCGGGCTCT 
CTCCAGCCAA 
CAGTGAAGGC 
CCTGGGGACA 
GCAGCTCGGT 
GAAAGCTGAA 
TTCTCATCTG 
CCAAGAAATT 
TGGCAGGAGT 
CTCCCGCCCA 
TCAGTCTCCT 
ACGGGAGCAA 
TCCTGGGTGG 
CCGTGCTGCC 
TTCTGCTCTC 
GCCGCCTGAC 
CGCTGCTCAA 
TCATCCTGCT 
CGGAGTACTT 
TGACCAACAA 
GCGGAGACTC 
GCAAATCGGA 
TGTCTTCTGG 
GCTCTTTACT 
CTGCCAGGGA 
TGGTGTCGGG 
GATCAGGTCC 
CCAAAGGTCT 
TGTCCCCATG 
AGTTTCAAAC 
ACACCCCACC 
CTACCTGAGA 
TATGTTGAGT 
TTCGCTGAGG 
ACTTTGATTT 
AAGCCGAAAT 
TGTCC TAG GA 
ACCAAGGGAG 
TGTTGATGTT 
GTTGTGTTAA 
TATGGATTTT 
TTACTTTAAC 
TGAAGATATG 
TTCAGTGCAA 
TTCTGACTTT 
GATTTTTTTA 



51 
I 

CCGAACGCAA 60 

GGAAAAGCTA 120 

TCTCTCGCCT 180 

CAGGGTTGGC 240 

CTCTGACTAC 300 

TATCAGCGCG 360 

CTGCTTTATC 420 

CCACCGACCC 480 

AGCCTACACA 540 

GTGGTTTCTG 600 

CGCCATCGCC 660 

TAACTTCCGC 720 

CCTGCCTATC 78 0 

GCTCTACCAC 84 0 

CATCGTCATT 900 

GTTCCGCAAG 960 

GACCGTAATT 1020 

CCTGCTGGAT 1080 

CCTGGTGTTA 1140 

GGAGATGCGT 1200 

TGCTGGCAAA 1260 

CAATTCCTCC 1320 

AAACGTCAAC 1380 

TGGTCGCTGG 144 0 

GGAGCTGCTG 1500 

TGTTGGTGGG 1560 

CGGCCTGGAA 1620 

AGCATTGTCA 1680 

TGAAAGCGTC 1740 

CCAAGTGAGT 1800 

CTCCCTTCCC 1860 

GTTATCAGAG 1920 

ACGTAGGCTG 1980 

CCAAAGTTTC 2040 

CTTTAAAAAA 2100 

CTGCATAAGG 2160 

GAAACAGACA 222 0 

ATTTCTTAGC 22 80 

TATTTCAGAA 234 0 

AAGTACTTTT 24 00 

TCTAACCCGT 24 60 

TGGTAGGGAA 2520 

TATAAATATT 2580 

TTAAACCGAG 2 64 0 

TGTGGATCAT 2700 
AAG 



60 



65 



70 



Seq ID NO: 2 Protein sequence : 
protein Accession #: NP 001391 



11 



MGPTSVPLVK 
LENIFVLLTI 
EGSMFVALSA 
GWNCISALSS 
ISKASRSSEK 
VLNSGTNPII 
PQKDEGDNPE 



AHRSSVSDYV 
WKTKKFHRPM 
SVFSLLAIAI 
CSTVLPLYHK 
SLALLKTVII 
YTLTNKEMRR 
TIMSSGNVNS 



21 
i 

NYDIIVRHYN 
YYFIGNLALS 
ERYITMLKMK 
HYILFCTTVF 
VLSVFIACWA 
AFIRIMSCCK 
SS 



31 
I 

YTGKLNISAD 
DLIiAGVAYTA 
LHNGSNNFRL 
TLLLLSIVIL 
PLFILLLLDV 
CPSGDSAGKF 



41 
I 

KENSIKLTSV 
NLLLSGATTY 
FLLISACWVI 
YCRIYSLVRT 
GCKVKTCDIL 
KRPIIAGMEF 



51 
I 

VFILICCFII 6 0 

KLTPAQWFLR 120 

SLILGGLPIM 180 

RSRRLTFRKN 240 

FRAEYFLVLA 300 

SRSKSDNSSH 360 



75 



179 
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Seq ID NO: 3 Nucleotide sequence: 
Nucleic Acid Accession #: NM_016242 

Coding sequence: 79-864 (underlined sequences correspond to start and stop 



1 11 21 31 41 51 

I I I I I I 

AAGGCCCTGC CAGCTTGGGA GGGAATTGTC CCTGCCTGCT TCTGGAGAAA GAAGATATTG 60 

ACACCATCTA CGGGCACCAT GGAACTGCTT CAAGTGACCA TTCTTTTTCT TCTGCCCAGT 12 0 

ATTTGCAGCA GTAACAGCAC AGGTGTTTTA GAGGCAGCTA ATAATTCACT TGTTGTTACT 180 

ACAACAAAAC CATCTATAAC AACACCAAAC ACAGAATCAT TACAGAAAAA TGTTGTCACA 24 0 

CCAACAACTG GAACAACTCC TAAAGGAACA ATCACCAATG AATTACTTAA AATGTCTCTG 300 

ATGTCAACAG CTACTTTTTT AACAAGTAAA GATGAAGGAT TGAAAGCCAC AACCACTGAT 360 

GTCAGGAAGA ATGACTCCAT CATTTCAAAC GTAACAGTAA CAAGTGTTAC ACTTCCCAAT 420 

GCTGTTTCAA CATTACAAAG TTCCAAACCC AAGACTGAAA CTCAGAGTTC AATTAAAACA 4 80 

ACAGAAATAC CAGGTAGTGT TCTACAACCA GATGCATCAC CTTCTAAAAC TGGTACATTA 54 0 

ACCTCAATAC CAGTTACAAT TCCAGAAAAC ACCTCACAGT CTCAAGTAAT AGACACTGAG 600 

GGTGGAAAAA ATGCAAGCAC TTCAGCAACC AGCCGGTCTT ATTCCAGTAT TATTTTGCCG 660 

GTGGTTATTG CTTTGATTGT AATAACACTT TCAGTATTTG TTCTGGTGGG TTTGTACCGA 720 

ATGTGCTGGA AGGCAGATCC GGGCACACCA GAAAATGGAA ATGATCAACC TCAGTCTGAT 780 

AAAGAGAGCG TGAAGCTTCT TACCGTTAAG ACAATTTCTC ATGAGTCTGG TGAGCACTCT 840 

GCACAAGGAA AAACCAAGAA C TGA CAGCTT GAGGAATTCT CTCCACACCT AGGCAATAAT 900 

TACGCTTAAT CTTCAGCTTC TATGCACCAA GCGTGGAAAA GGAGAAAGTC CTGCAGAATC - 960 
AATCCCGACT TCCATACCTG CTGCTGG 



Seq ID NO: 4 Protein sequence: 
Protein Accession #: NP_057326 

1 11 21 31 41 51 

I I" I I I I 

MELLOVTILF llpsicssns tgvleaanns LWTTTKPSI TTPNTESLQK NWTPTTGTT 60 

PKGTITNELL KMSLMSTATF LTSKDEGLKA TTTDVRKNDS IISNVTVTSV TLPNAVSTLQ 120 

SSKPKTETQS SIKTTEIPGS VLQPDASPSK TGTLTS I PVT IPENTSQSQV IDTEGGKNAS 180 

TSATSRSYSS IILPWIALI VITLSVFVLV GLYRMCWKAD PGTPENGNDQ PQSDKESVKL 240 

LTVKTISHES GEHSAQGKTK N 



Seq ID NO: 5 Nucleotide sequence: 

Nucleic Acid Accession #: NM_002205 .. 

Coding sequence: 24.. 3173 {underlined sequences correspond to start 



1 11 21 31 41 51 

1 I l l I I 

CAGGACAGGG AAGAGCGGGC GCTATGGGGA GCCGGACGCC AGAGTCCCCT CTCCACGCCG 60 

TGCAGCTGCG CTGGGGCCCC CGGCGCCGAC CCCCGCTCGT GCCGCTGCTG TTGCTGCTCG 120 

TGCCGCCGCC ACCCAGGGTC GGGGGCTTCA ACTTAGACGC GGAGGCCCCA GCAGTACTCT 180 

CGGGGCCCCC GGGCTCCTTC TTCGGATTCT CAGTGGAGTT TTACCGGCCG GGAACAGACG 240 

GGGTCAGTGT GCTGGTGGGA GC ACCCAAGG CTAATACCAG CCAGCCAGGA GTGCTGCAGG 300 

GTGGTGCTGT CTACCTCTGT CCTTGGGGTG CCAGCCCCAC ACAGTGCACC CCCATTGAAT 360 

TTGACAGCAA AGGCTCTCGG CTCCTGGAGT CCTCACTGTC CAGCTCAGAG GGAGAGGAGC 420 

CTGTGGAGTA CAAGTCCTTG CAGTGGTTCG GGGCAACAGT TCGAGCCCAT GGCTCCTCCA 480 

TCTTGGCATG CGCTCCACTG TACAGCTGGC GCACAGAGAA GGAGCCACTG AGCGACCCCG 540 

TGGGCACCTG CTACCTCTCC ACAGATAACT TCACCCGAAT TCTGGAGTAT GCACCCTGCC 600 

GCTCAGATTT CAGCTGGGCA GCAGGACAGG GTTACTGCCA AGGAGGCTTC AGTGCCGAGT 660 

TCACCAAGAC TGGCCGTGTG GTTTTAGGTG GACCAGGAAG CTATTTCTGG CAAGGCCAGA 720 

TCCTGTCTGC CACTCAGGAG CAGATTGCAG AATCTTATTA CCCCGAGTAC CTGATCAACC 780 

TGGTTCAGGG GCAGCTGCAG ACTCGCCAGG CCAGTTCCAT CTATGATGAC AGCTACCTAG 840 

GATACTCTGT GGCTGTTGGT GAATTCAGTG GTGATGACAC AGAAGACTTT GTTGCTGGTG 900 

TGCCCAAAGG GAACCTCACT TACGG CTATG TCACCATCCT TAATGGCTCA GACATTCGAT 960 

CCCTCTACAA CTTCTCAGGG GAACAGATGG CCTCCTACTT TGGCTATGCA GTGGCCGCCA 1020 

CAGACGTCAA TGGGGACGGG CTGGATGACT TGCTGGTGGG GGCACCCCTG CTCATGGATC 1080 

GGACCCCTGA CGGGCGGCCT CAGGAGGTGG GCAGGGTCTA CGTCTACCTG CAGCACCCAG 1140 

CCGGCATAGA GCCCACGCCC ACCCTTACCC TCACTGGCCA TGATGAGTTT GGCCGATTTG 1200 

GCAGCTCCTT GACCCCCCTG GGGGACCTGG ACCAGGATGG CTACAATGAT GTGGCCATCG 1260 

GGGCTCCCTT TGGTGGGGAG ACCCAGCAGG GAGTAGTGTT TGTATTTCCT GGGGGCCCAG 1320 



ISO 



WO 02/079492 



PCT/US02/04915 



GAGGGCTGGG CTCTAAGCCT TCCCAGGTTC TGCAGCCCCT GTGGGCAGCC AGCCACACCC 1380 

CAGACTTCTT TGGCTCTGCC CTTCGAGGAG GCCGAGACCT GGATGGCAAT GGATATCCTG 144 0 

ATCTGATTGT GGGGTCCTTT GGTGTGGACA AGGCTGTGGT ATACAGGGGC CGCCCCATCG 1500 

TGTCCGCTAG TGCCTCCCTC ACCATCTTCC CCGCCATGTT CAACCCAGAG GAGCGGAGCT 1560 

5 GCAGCTTAGA GGGGAACCCT GTGGCCTGCA TCAACCTTAG CTTCTGCCTC AATGCTTCTG 1620 

GAAAACACGT TGCTGACTCC ATTGGTTTCA CAGTGGAACT TCAGCTGGAC TGGCAGAAGC 1680 

AGAAGGGAGG GGTACGGCGG GCACTGTTCC TGGCCTCCAG GCAGGCAACC CTGACCCAGA 1740 

CCCTGCTCAT CCAGAATGGG GCTCGAGAGG ATTGCAGAGA GATGAAGATC TACCTCAGGA 1800 

ACGAGTCAGA ATTTCGAGAC AAACTCTCGC CGATTCACAT CGCTCTCAAC TTCTCCTTGG 1860 

10 ACCCCCAAGC CCCAGTGGAC AGCCACGGCC TCAGGCCAGC CCTACATTAT CAGAGCAAGA 1920 

GCCGGATAGA GGACAAGGCT CAGATCTTGC TGGACTGTGG AGAAGACAAC ATCTGTGTGC 1980 

CTGACCTGCA GCTGGAAGTG TTTGGGGAGC AGAACCATGT GTACCTGGGT GACAAGAATG 2040 

CCCTGAACCT CACTTTCCAT GCCCAGAATG TGGGTGAGGG TGGCGCCTAT GAGGCTGAGC 2100 

TTCGGGTCAC CGCCCCTCCA GAGGCTGAGT ACTCAGGACT CGTCAGACAC CCAGGGAACT 2160 

15 TCTCCAGCCT GAGCTGTGAC TACTTTGCCG TGAACCAGAG CCGCCTGCTG GTGTGTGACC 2220 

TGGGCAACCC CATGAAGGCA GGAGCCAGTC TGTGGGGTGG CCTTCGGTTT ACAGTCCCTC 2280 

ATCTCCGGGA CACTAAGAAA ACCATCCAGT TTGACTTCCA GATCCTCAGC AAGAATCTCA 2340 

ACAACTCGCA AAGCGACGTG GTTTCCTTTC GGCTCTCCGT GGAGGCTCAG GCCCAGGTCA 2400 

CCCTGAACGG TGTCTCCAAG CCTGAGGCAG TGCTATTCCC AGTAAGCGAC TGGCATCCCC 2460 

20 GAGACCAGCC TCAGAAGGAG GAGGACCTGG GACCTGCTGT CCACCATGTC TATGAGCTCA 2520 

TCAACCAAGG CCCCAGCTCC ATTAGCCAGG GTGTGCTGGA ACTCAGCTGT CCCCAGGCTC 2580 

TGGAAGGTCA GCAGCTCCTA TATGTGACCA GAGTTACGGG ACTCAACTGC ACCACCAATC 2640 

ACCCCATTAA CCCAAAGGGC CTGGAGTTGG ATCCCGAGGG TTCCCTGCAC CACCAGCAAA 2700 

AACGGGAAGC TCCAAGCCGC AGCTCTGCTT CCTCGGGACC TCAGATC CTG AAATGCCCGG 2760 

25 AGGCTGAGTG TTTCAGGCTG CGCTGTGAGC TCGGGCCCCT GCACCAACAA GAGAGCCAAA 2820 

GTCTGCAGTT GCATTTCCGA GTCTGGGCCA AGACTTTCTT GCAGCGGGAG CACCAGCCAT 2880 

TTAGCCTGCA GTGTGAGGCT GTGTACAAAG CCCTGAAGAT GCCCTACCGA ATCCTGCCTC 2940 

GGCAGCTGCC CCAAAAAGAG CGTCAGGTGG CCACAGCTGT GCAATGGACC AAGGCAGAAG 3000 

GCAGCTATGG CGTCCCACTG TGGATCATCA TCCTAGCCAT CCTGTTTGGC CTCCTGCTCC 3060 

30 TAGGTCTACT CATCTACATC CTCTACAAGC TTGGATTCTT CAAACGCTCC CTCCCATATG 3120 

GCACCGCCAT GGAAAAAGCT CAGCTCAAGC CTCCAGCCAC CTCTGATGCC TGA GTCCTCC 3180 

CAATTTCAGA CTCCCATTCC TGAAGAACCA GTCCCCCCAC CCTCATTCTA CTGAAAAGGA 3240 

GGGGTCTGGG TACTTCTTGA AGGTGCTGAC GGCCAGGGAG AAGCTCCTCT CCCCAGCCCA 3300 

GAGACATACT TGAAGGGCCA GAGCCAGGGG GGTGAGGAGC TGGGGATCCC TCCCCCCCAT 3360 

35 GCACTGTGAA GGACCCTTGT TTACACATAC CCTCTTCATG GATGGGGGAA CTCAGATCCA 3420 

GGGACAGAGG CCCAGCCTCC CTGAAGCCTT TGCATTTTGG AGAGTTTCCT GAAACAACTG 3480 

GAAAGATAAC TAGGAAATCC ATTCACAGTT CTTTGGGCCA GACATGCCAC AAGGACTTCC 3540 

TGTCCAGCTC CAACCTGCAA AGATCTGTCC TCAGCCTTGC CAGAGATCCA AAAGAAGCCC 3600 

CCAGTAAGAA CCTGGAACTT GGGGAGTTAA GACCTGGCAG CTCTGGACAG CCCCACCCTG 3660 

40 GTGGGCCAAC AAAGAACACT AACTATGCAT GGTGCCCCAG GACCAGCTCA GGACAGATGC 3720 

CACAAGGATA GATGCTGGCC CAGGGCCAGA GCCCAGCTCC AAGGGGAATC AGAACTCAAA 3780 

TGGGGCCAGA TCCAGCCTGG GGTCTGGAGT TGATCTGGAA CCCAGACTCA GACATTGGCA 384 0 

CCAATCCAGG CAGATCCAGG ACTATATTTG GGCCTGCTCC AGACCTGATC CTGGAGGCCC 3900 

AGTTCACCCT GATTTAGGAG AAGCCAGGAA TTTCCCAGGA CCTGAAGGGG CCATGATGGC 3960 

45 AACAGATCTG GAACCTCAGC CTGGCCAGAC ACAGGCCCTC CCTGTTCCCC AGAGAAAGGG 4 020 

GAGCCCACTG TCCTGGGCCT GCAGAATTTG GGTTCTGCCT GCCAGCTGCA CTGATGCTGC 4 08 0 

CCCTCATCTC TCTGCCCAAC CCTTCCCTCA CCTTGGCACC AGACACCCAG GACTTATTTA 414 0 

AACTCTGTTG CAAGTGCAAT AAATCTGACC CAGTGCCCCC ACTGACCAGA ACTAGAAAAA 420 0 
AAAA 

50 

Seq ID NO: 6 Protein sequence : 
Protein Accession #: NP_002196.1 

55 1 11 21 31 41 51 

I I I 1 I I 

MGSRTPESPL HAVQLRWGPR RRPPLVPLLL LLVPPPPRVG GFNLDAEAPA VLSGPPGSFP 60 

GFSVEFYRPG TDGVSVLVGA PKANTSQPGV LQGGAVYLCP WGASPTQCTP IEFDSKGSRL 120 

LESSLSSSEG EEPVEYKSLQ WFGATVRAHG SSILACAPLY SWRTEKEPLS DPVGTCYLST 180 

60 DNFTRILEYA PCRSDFSWAA GQGYCQGGFS AEFTKTGRW LGGPGSYFWQ GQILSATQEQ 240 

IAESYYPEYL INLVQGQLQT RQASSIYDDS YLGYSVAVGE FSGDDTEDFV AGVPKGNliTY 300 

GYVTILNGSD IRSLYNFSGE QMASYFGYAV AATDVNGDGL DDLLVGAPLL MDRTPDGRPQ 360 

EVGRVYVYLQ HPAGIEPTPT LTLTGHDEFG RFGSSLTPLG DLDQDGYNDV AIGAPFGGET 420 

QQGWFVFPG GPGGLGSKPS QVLQPIjWAAS HTPDFFGSAL RGGRDLDGNG YPDLIVGSFG 480 

65 VDKAWYRGR PIVSASASDT IFPAMFNPEE RSCSLEGNPV ACINLSFCLN ASGKHVADSI 54 0 

GFTVELQLDW QKQKGGVRRA LFLASRQATL TQTLLIQNGA REDCREMKIY LRNESEFRDK 600 

LSPIHIALNF SLDPQAPVDS HGLRPALHYQ SKSRIEDKAQ ILLDCGEDNI CVPDLQLEVF 660 

GEQNHVYLGD KNALNLTFHA QNVGEGGAYE AELRVTAPPE AEYSGLVRHP GNFSSLSCDY 720 

FAVNQSRLLV CDLGNPMKAG ASLWGGLRFT VPHLRDTKKT IQFDFQILSK NLNNSQSDW 780 

70 SFRLSVEAQA QVTLNGVSKP EAVLFPVSDW HPRDQPQKEE DLGPAVHHVY ELINQGPSSI 84 0 

SQGVLELSCP QALEGQQLLY VTRVTGLNCT TNHPINPKGL ELDPEGSLHH QQKREAPSRS 900 

SASSGPQILK CPEAECFRLR CELGPLHQQE SQSLQLHFRV WAKTFLQREH QPFSLQCEAV 960 

YKALKMPYRI LPRQLPQKER QVATAVQWTK AEGSYGVPLW IIILAILFGL LLLGLLIYIL 1020 
YKIiGFFKRSL PYGTAMEKAQ LKPPATSDA 

75 
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Seq ID NO: 7 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002211 

Coding sequence: 104.. 2500 (underlined sequences correspond to start and stop codons) 

5 

1 11 21 31 41 51 

I I I I I I 

GTCCGCCAAA ACCTGCGCGG ATAGGGAAGA ACAGCACCCC GGCGCCGATT GCCGTACCAA SO 
ACAAGCCTAA CGTCCGCTGG GCCCCGGACG CCGCGCGGAA AAG ATGA ATT TACAACCAAT 120 

10 TTTCTGGATT GGACTGATCA GTTCAGTTTG CTGTGTGTTT GCTCAAACAG ATGAAAATAG 180 
ATGTTTAAAA GCAAATGCCA AATCATGTGG AGAATGTATA CAAGCAGGGC CAAATTGTGG 240 
GTGGTGCACA AATTCAACAT TTTTACAGGA AGGAATGCCT ACTTCTGCAC GATGTGATGA 300 
TTTAGAAGCC TTAAAAAAGA AGGGTTGCCC TCCAGATGAC ATAGAAAATC CCAGAGGCTC 360 
CAAAGATATA AAGAAAAATA AAAATGTAAC CAACCGTAGC AAAGGAACAG CAGAGAAGCT 420 

15 CAAGCCAGAG GATATTACTC AGATCCAACC ACAGCAGTTG GTTTTGCGAT TAAGATCAGG 480 
GGAGCCACAG ACATTTACAT TAAAATTCAA GAGAGCTGAA GACTATCCCA TTGACCTCTA 540 
CTACCTTATG GACCTGTCTT ATTCAATGAA AGACGATTTG GAGAATGTAA AAAGTCTTGG 600 
AACAGATCTG ATGAATGAAA TGAGGAGGAT TACTTCGGAC TTCAGAATTG GATTTGGCTC 660 
ATTTGTGGAA AAGACTGTGA TGCCTTACAT TAGCACAACA CCAGCTAAGC TCAGGAACCC 720 

20 TTGCACAAGT GAACAGAACT GCACCACCCC ATTTAGCTAC AAAAATGTGC TCAGTCTTAC 780 
TAATAAAGGA GAAGTATTTA ATGAACTTGT TGGAAAACAG CGCATATCTG GAAATTTGGA 840 
TTCTCCAGAA GGTGGTTTCG ATGCCATCAT GCAAGTTGCA GTTTGTGGAT CACTGATTGG 900 
CTGGAGGAAT GTTACACGGC TGCTGGTGTT TTCCACAGAT GCCGGGTTTC ACTTTGCTGG 960 

AGATGGGAAA CTTGGTGGCA TTGTTTTACC AAATGATGGA CAATGTCACC TGGAAAATAA 1020 

25 TATGTACACA ATGAGCCATT ATTATGATTA TCCTTCTATT GCTCACCTTG TCCAGAAACT 1080 

GAGTGAAAAT AATATTCAGA CAATTTTTGC AGTTACTGAA GAATTTCAGC CTGTTTACAA 1140 

GGAGCTGAAA AACTTGATCC CTAAGTCAGC AGTAGGAACA TTATCTGCAA ATTCTAGCAA 1200 

TGTAATTCAG TTGATCATTG ATGCATACAA TTCCCTTTCC TCAGAAGTCA TTTTGGAAAA 1260 

CGGCAAATTG TCAGAAGGAG TAACAATAAG TTACAAATCT TACTGCAAGA ACGGGGTGAA 1320 

30 TGGAACAGGG GAAAATGGAA GAAAATGTTC CAATATTTCC ATTGGAGATG AGGTTCAATT 1380 

TGAAATTAGC ATAACTTCAA ATAAGTGTCC AAAAAAGGAT TCTGACAGCT TTAAAATTAG 1440 

GCCTCTGGGC TTTACGGAGG AAGTAGAGGT TATTCTTCAG TACATCTGTG AATGTGAATG 1500 

CCAAAGCGAA GGCATCCCTG AAAGTCCCAA GTGTCATGAA GGAAATGGGA CATTTGAGTG 1560 

TGGCGCGTGC AGGTGCAATG AAGGGCGTGT TGGTAGACAT TGTGAATGCA GCACAGATGA 1620 

35 AGTTAACAGT GAAGACATGG ATGCTTACTG CAGGAAAGAA AACAGTTCAG AAATCTGCAG 1680 

TAACAATGGA GAGTGCGTCT GCGGACAGTG TGTTTGTAGG AAGAGGGATA ATAGAAATGA 1740 

AATTTATTCT GGCAAATTCT GCGAGTGTGA TAATTTCAAC TGTGATAGAT CCAATGGCTT 1800 

AATTTGTGGA GGAAATGGTG TTTGCAAGTG TCGTGTGTGT GAGTGCAACC CCAACTACAC 1860 

TGGCAGTGCA TGTGACTGTT CTTTGGATAC TAGTACTTGT GAAGCCAGCA ACGGACAGAT 1920 

40 CTGCAATGGC CGGGGCATCT GCGAGTGTGG TGTCTGTAAG TGTACAGATC CGAAGTTTCA 1980 

AGGGCAAACG TGTGAGATGT GTCAGACCTG CCTTGGTGTC TGTGCTGAGC ATAAAGAATG 2040 

TGTTCAGTGC AGAGCCTTCA ATAAAGGAGA AAAGAAAGAC ACATGCACAC AGGAATGTTC 2100 

CTATTTTAAC ATT ACCAAG G TAGAAAGTCG GGACAAATTA CCCCAGCCGG TCCAACCTGA 2160 

TCCTGTGTCC CATTGTAAGG AGAAGGATGT TGACGACTGT TGGTTCTATT TTACGTATTC 2220 

45 AGTGAATGGG AACAACGAGG TCATGGTTCA TGTTGTGGAG AATCCAGAGT GTCCCACTGG 2280 

TCCAGACATC ATTCCAATTG TAGCTGGTGT GGTTGCTGGA ATTGTTCTTA TTGGCCTTGC 2340 

ATTACTGCTG ATATGGAAGC TTTTAATGAT AATTCATGAC AGAAGGGAGT TTGCTAAATT 2400 

TGAAAAGGAG AAAATGAATG CCAAATGGGA CACGGGTGAA AATCCTATTT ATAAGAGTGC 2460 

CGTAACAACT GTGGTCAATC CGAAGTATGA GGGAAAA TGA GTACTGCCCG TGCAAATCCC 2520 

50 ACAACACTGA ATGCAAAGTA GCAATTTCCA TAGTCACAGT TAGGTAGCTT TAGGGCAATA 2580 

TTGCCATGGT TTTACTCATG TGCAGGTTTT GAAAATGTAC AATATGTATA ATTTTTAAAA 264 0 

TGTTTTATTA TTTTGAAAAT AATGTTGTAA TTCATGCCAG GGACTGACAA AAGACTTGAG 2700 

ACAGGATGGT TATTCTTGTC AGCTAAGGTC ACATTGTGCC TTTTTGACCT TTTCTTCCTG 2760 

GACTATTGAA ATCAAGCTTA TTGGATTAAG TGATATTTCT ATAGCGATTG AAAGGGCAAT 2820 

55 AGTTAAAGTA ATGAGCATGA TGAGAGTTTC TGTTAATCAT GTATTAAAAC TGATTTTTAG 2880 

CTTTACATAT GTCAGTTTGC AGTTATGCAG AATCCAAAGT AAATGTCCTG CTAGCTAGTT 294 0 

AAGGATTGTT TTAAATCTGT TATTTTGCTA TTTGCCTGTT AGACATGACT GATGACATAT 3000 

CTGAAAGACA AGTATGTTGA GAGTTGCTGG TGTAAAATAC GTTTGAAATA GTTGATCTAC 3060 

AAAGGCCATG GGAAAAATTC AGAGAGTTAG GAAGGAAAAA CCAATAGCTT TAAAACCTGT 3120 

60 GTGCCATTTT AAGAGTTACT TAATGTTTGG TAACTTTTAT GCCTTCACTT TACAAATTCA 3180 

AGCCTTAGAT AAAAGAACCG AGCAATTTTC TGCTAAAAAG TCCTTGATTT AGCACTATTT 3240 

ACATACAGGC CATACTTTAC AAAGTATTTG CTGAATGGGG ACCTTTTGAG TTGAATTTAT 3300 

TTTATTATTT TTATTTTGTT TAATGTCTGG TGCTTTCTAT CACCTCTTCT AATCTTTTAA 3360 

TGTATTTGTT TGCAATTTTG GGGTAAGACT TTTTTATGAG TACTTTTTCT TTGAAGTTTT 3420 

65 AGCGGTCAAT TTGCCTTTTT AATGAACATG TGAAGTTATA CTGTGGCTAT GCAACAGCTC 3480 

TCACCTACGC GAGTCTTACT TTGAGTTAGT GCCATAACAG ACCACTGTAT GTTTACTTCT 3540 

CACCATTTGA GTTGCCCATC TTGTTTCACA CTAGTCACAT TCTTGTTTTA AGTGCCTTTA 3600 
GTTTTAACAG TTCA 

70 Seq ID NO: 8 Protein sequence: 
Protein Accession #: NP_002202 



1 11 21 31 41 51 

75 | | I I I I 

MNLQPIFWIG LISSVCCVFA QTDENRCLKA NAKSCGECIQ AGPNCGWCTN STFLQEGMPT 60 
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SARCDDLEAL KKKGCPPDDI ENPRGSKDIK KNKNVTNRSK GTAEKLKPED ITQIQPQQLV 120 

LRLRSGEPQT FTLKFKRAED YPIDLYYLMD LSYSMKDDLE NVKSLGTDLM NEMRRITSDF 180 

RIGFGSPVEK TVMPYISTTP AKLRNPCTSE QNCTSPFSYK NVLSLTNKGE VFNELVGKQR 240 

ISGNLDSPEG GFDAIMQVAV CGSLIGWRNV TRLLVFSTDA GFHFAGDGKL GGIVLPNDGQ 300 

5 CHLENNMYTM SHYYDYPSIA HLVQKLSENN IQTIFAVTEE FQPVYKELKN LIPKSAVGTL 360 

SANSSNVIQL IIDAYNSLSS EVILENGKLS EGVTISYKSY CKNGVNGTGE NGRKCSNISI 420 

GDEVQFEISI TSNKCPKKDS DSFKIRPLGF TEEVEVILQY ICECECQSEG IPESPKCHEG 480 

NGTFECGACR CNEGRVGRHC ECSTDEVNSE DMDAYCRKEN SSEICSNNGE CVCGQCVCRK 540 

RDNTNEIYSG KFCECDNFNC DRSNGLICGG NGVCKCRVCE CNPNYTGSAC DCSIiDTSTCE 600 

10 ASNGQICNGR GICECGVCKC TDPKFQGQTC EMCQTCLGVC AEHKECVQCR AFNKGEKKDT 660 

CTQECSYFNI TKVESRDKLP QPVQPDPVSH CKEKDVDDCW FYFTYSVNGN NEVMVHWEN 720 

PECPTGPDII PIVAGWAGI VLIGI1ALI1LI WKLLMIIHDR REFAKFEKEK MNAKWDTGEN 780 
PIYKSAVTTV VNPKYEGK 

15 Seq ID NO: 9 Nucleotide sequence: " 
Nucleic Acid Accession #:NM_002425 

Coding sequence: 23.. 1453 (underlined sequences correspond to start and stop codons) 



20 1 11 21 31 41 51 

I I I i I I 

AAAGAAGGTA AGGGCAGTGA GAATGATGCA TCTTGCATTC CTTGTGCTGT TGTGTCTGCC 60 

• AGTCTGCTCT GCCTATCCTC TGAGTGGGGC AGCAAAAGAG GAGGACTCCA ACAAGGATCT 120 

TGCCCAGCAA TACCTAGAAA AGTACTACAA CCTCGAAAAG GATGTGAAAC AGTTTAGAAG 180 

25 AAAGGACAGT AATCTCATTG TTAAAAAAAT CCAAGGAATG CAGAAGTTCC TTGGGTTGGA 24 0 

GGTGACAGGG AAGCTAGACA CTGACACTCT GGAGGTGATG CGCAAGCCCA GGTGTGGAGT 300 

TCCTGACGTT GGTCACTTCA GCTCCTTTCC TGGCATGCCG AAGTGGAGGA AAACCCACCT 360 

TACATACAGG ATTGTGAATT ATACACCAGA TTTGCCAAGA GATGCTGTTG ATTCTGCCAT 420 

TGAGAAAGCT CTGAAAGTCT GGGAAGAGGT GACTCCACTC ACATTCTCCA GGCTGTATGA 480 

30 AGGAGAGGCT GATATAATGA TCTCTTTCGC AGTTAAAGAA CATGGAGACT TTTACTCTTT 540 

TGATGGCCCA GGACACAGTT TGGCTCATGC CTACCCACCT GGACCTGGGC TTTATGGAGA 600 

TATTCACTTT GATGATGATG AAAAATGGAC AGAAGATGCA TCAGGCACCA ATTTATTCCT 660 

CGTTGCTGCT CATGAACTTG GCCACTCCCT GGGGCTCTTT CACTCAGCCA ACACTGAAGC 720 

TTTGATGTAC CCACTCTACA ACTCATTCAC AGAGCTCGCC CAGTTCCGCC TTTCGCAAGA 780 

35 TGATGTGAAT GGCATTCAGT CTCTCTACGG ACCTCCCCCT GCCTCTACTG AGGAACCCCT 840 

GGTGCCCACA AAATCTGTTC CTTCGGGATC TGAGATGCCA GCCAAGTGTG ATCCTGCTTT 900 

GTCCTTCGAT GCCATCAGCA CTCTGAGGGG AGAATATCTG TTCTTTAAAG ACAGATATTT 96 0 

TTGGCGAAGA TCCCACTGGA ACCCTGAACC TGAATTTCAT TTGATTTCTG CATTTTGGCC 1020 

CTCTCTTCCA TCATATTTGG ATGCTGCATA TGAAGTTAAC AGCAGGGACA CCGTTTTTAT 1080 

40 TTTTAAAGGA AATGAGTTCT GGGCCATCAG AGGAAATGAG GTACAAGCAG GTTATCCAAG 114 0 

AGGCATCCAT ACCCTGGGTT TTCCTCCAAC CATAAGGAAA ATTGATGCAG CTGTTTCTGA 1200 

CAAGGAAAAG AAGAAAACAT ACTTCTTTGC AGCGGACAAA TACTGGAGAT TTGATGAAAA 1260 

TAGCCAGTCC ATGGAGCAAG GCTTCCCTAG ACTAATAGCT GATGACTTTC CAGGAGTTGA 1320 

GCCTAAGGTT GATG CTGT AT TACAGGCATT TGGATTTTTC TACTTCTTCA GTGGATCATC 1380 

45 ACAGTTTGAG TTTGACCCCA ATGCCAGGAT GGTGACACAC ATATTAAAGA GTAACAG CTG 144 0 

GTTACATTGC TAGGCGAGAT AGGGGGAAGA CAGATATGGG TGTTTTTAAT AAATCTAATA 1500 

ATTATTCATC TAATGTATTA TGAGCCAAAA TGGTTAATTT TTCCTGCATG TTCTGTGACT 156 0 

GAAGAAGATG AGCCTTGCAG ATATCTG CAT GTGTCATGAA GAATGTTTCT GGAATTCTTC 1620 

ACTTGCTTTT GAATTGCACT GAACAGAATT AAGAAATACT CATGTGCAAT AGGTGAGAGA 1680 

50 ATGTATTTTC ATAGATGTGT TATTACTTCC TCAATAAAAA GTTTTATTTT GGGCCTGTTC 1740 
CTT 



Seq ID NO: 10 Protein sequence: 
55 Protein Accession #: NP 002416 

1 11 21 31 41 51 

I I I I I I 

MMHLAFLVLL CLPVCSAYPL SGAAKEEDSN KDLAQQYLEK YYNLEKDVKQ FRRKDSNLIV 60 

60 KKIQGMQKFL GLEVTGKLDT DTLEVMRKPR CGVPDVGHFS SFPGMPKWRK THLTYRIVNY 120 

TPDLPRDAVD SAIEKALKVW EEVTPLTFSR LYEGEADIMI SFAVKEHGDF YSFDGPGHSL 180 

AHAYPPGPGL YGDIHFDDDE KWTEDASGTN LFLVAAHELG HSLGLFHSAN TEALMYPLYN 24 0 

SFTELAQFRL SQDDVNGIQS LYGPPPASTE EPLVPTKSVP SGSEMPAKCD PALSFDAIST 300 

LRGEYLFFKD RYFWRRSHWN PEPEFHLISA FWPSLPSYLD AAYEVNSRDT VFIFKGNEFW 360 

65 AIRGNEVQAG YPRGIHTLGF PPTIRKIDAA VSDKEKKKTY FFAADKYWRF DENSQSMEQG 42 0 

FPRLIADDFP GVEPKVDAVL QAFGFFYFFS GSSQFEFDPN ARMVTHILKS NSWLHC 



70 Seq ID NO: 11 Nucleotide sequence: 
Nucleic Acid Accession #: XM_058189 

Coding sequence: 169. .774 (underlined sequences correspond to, start and stop codons) 

1 11 21 31 41 51 

75 | | I | | I 

GAAGACCAGC TCAGCTCTTC AGTTGTTGAT CATTGTCTAT TGTTCTCCAA ACAGTAAACC 60 
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AGTATTTCAC ACTGAGATTG TCGGCTGCGG GTATATTCCA ATTCCCCGTC TCCTCATGAA 120 

TATGAAGTGA AGGGCTCTGA CCCTGGAAGT GGTTCTAAGC AGGGCAA AAT GG GGTCTCGG 180 

AAGTGTGGAG GCTGCCTAAG TTGTTTGCTG ATTCCGCTTG CACTTTGGAG TATAATCGTG 240 

AACATATTAT TGTATTTCCC GAATGGGCAA ACTTCCTATG CATCCAGCAA TAAACTCACC 300 

AACTACGTGT GGTATTTTGA AGGAATCTGT TTCTCAGGCA TCATGATGCT TATAGTAACA 360 

ACAGTTCTTC TGGTACTGGA GAATAATAAC AACTATAAAT GTTGCCAGAG TGAAAACTGC 420 

AGCAAAAAAT ATGTGACACT GCTGTCAATT ATCTTTTCTT CCCTCGGAAT TGCTTTTTCT 4 80 

GGATACTGCC TGGTCATCTC TGCCTTGGGT CTTGTCCAAG GGCCATATTG CCGCACCCTT 540 

GATGGCTGGG AGTATGCTTT TGAAGGCACT GCTGGACGTT TCCTTACAGA TTCTAGCATA 600 

TGGATTCAGT GCCTGGAACC TGCACATGTT GTGGAGTGGA ACATCATTTT ATTTTCCATT 660 

CTCATAACCC TCAGTGGGCT TCAAGTGATC ATCTGCCTCA TCAGAGTAGT CATGCAACTA 720 

TCCAAGATAC TGTGTGGAAG CTATTCAGTG ATCTTCCAGC CTGGAATCAT TTGAATAAGG 780 

ACAAAATGTT TTCCATTATC AAGACATGGC CATCTATCTA AATATTATAT CAACTGTGTA 840 

GACTTGAGGG CAATATTGAA ATGATGGTGC TTTCTGCATT TGGTGTTTAT TTGTAAAAAA 900 

TTTGCAGTCC TCACTGCACA TGCAAGTATA CCACCCTTCC ATTTAGTATG TTTTTTAAGT 960 

AATATGCATC AGAAACTTCA GAAATACTTC TGCCCTTTGA TCAAACAAAT CCATTTCCAA 1020 

GAATCTGTAC TAGGGAAGTA AATAAGAATA TGAGAGAAAC CTTTATGCAA ATATGTATAT 1080 

TGCAACATTA TTTAATATTC TGGAAAATTG GAAACACCCC AAAATTCTAA ACTCAGAGGA 1140 

AGGATTAAGT AAAGAGTGGT ACATACTGTA AATGTTTTCT GATATTAAAA AAAAAATTAA 1200 
ATAAAAAATA AAGAGTACTA CATGGTTGTA AAA 

Seq ID NO: 12 Protein sequence: 
Protein Accession #: XP 058189 



1 11 21 31 41 51 

I I I I I I 

MGSRKCGGCL SCLLIPLALW SIIVNILLYF PNGQTSYASS NKLTNYVWYF EGICFSGIMM 

LIVTTVLIiVL ENNNNYKCCQ SENCSKKYVT LLSIIFSSLG IAFSGYCLVI SALGLVQGPY 

CRTLDGWEYA FEGTAGRFLT DSSIWIQCLE PAHWEWNII LFSILITLSG LQVIICLIRV 

VMQLSKILCG SYSVIFQPGI I 



60 
120 
180 



Seq ID NO: 13 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005397 

Coding sequence: 251.. 1837 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

AAACGCCGCC CAGGACGCAG CCGCCGCCGC CGCCGCTCCT CTGCCACTGG CTCTGCGCCC 60 

CAGCCCGGCT CTGCTGCAGC GGCAGGGAGG AAGAGCCGCC GCAGCGCGAC TCGGGAGCCC 120 

CGGGCCACAG CCTGGCCTCC GGAGCCACCC ACAGGCCTCC CCGGGCGGCG CCCACGCTCC 180 

TACCGCCCGG ACGCGCGGAT CCTCCGCCGG CACCGCAGCC ACCTGCTCCC GGCCCAGAGG 240 

CGACGACACG ATG CGCTGCG CGCTGGCGCT CTCGGCGCTG CTGCTACTGT TGTCAACGCC 300 

GCCGCTGCTG CCGTCGTCGC CGTCGCCGTC GCCGTCGCCG TCGCCCTCCC AGAATGCAAC 360 

CCAGACTACT ACGGACTCAT CTAACAAAAC AG C ACCG ACT CCAGCATCCA GTGTCACCAT 420 

CATGGCTACA GATACAGCCC AG C AG AG C AC AGTCCCCACT TCCAAGGCCA ACGAAATCTT 480 

GGCCTCGGTC AAGGCGACCA CCCTTGGTGT ATCCAGTGAC TCACCGGGGA CTACAACCCT 540 

GGCTCAGCAA GTCTCAGGCC CAGTCAACAC TACCGTGGCT AGAGGAGGCG GCTCAGGCAA 600 

CCCTACTACC ACCATCGAGA GCCCCAAGAG CACAAAAAGT GCAGACACCA CTACAGTTGC 660 

AACCTCCACA GCCACAGCTA AACCTAACAC CACAAGCAGC CAGAATGGAG CAGAAGATAC 720 
AACAAACTCT GGGGGGAAAA GCAGCCACAG TGTG AC CAC A GACCTCACAT CCACTAAGGC 780 

AGAACATCTG ACGACCCCTC ACCCTACAAG TCCACTTAGC CCCCGACAAC CCACTTTGAC 840 

GCATCCTGTG GCCACCCCAA CAAGCTCGGG ACATGACCAT CTTATGAAAA TTTCAAGCAG 900 

TTCAAGCACT GTGGCTATCC CTGGCTACAC CTTCACAAGC CCGGGGATGA CCACCACCCT 960 

ACCGTCATCG GTTATCTCGC AAAGAACTCA ACAGACCTCC AGTCAGATGC CAGCCAGCTC 1020 

TACGGCCCCT TCCTCCCAGG AGACAGTGCA GCCCACGAGC CCGGCAACGG CATTGAGAAC 1080 

ACCTACCCTG CCAGAGACCA TGAGCTCCAG CCCCACAGCA GCATCAACTA CCCACCGATA 1140 

CCCCAAAACA CCTTCTCCCA CTGTGGCTCA TGAGAGTAAC TGGGCAAAGT GTGAGGATCT 1200 

TGAGACACAG ACACAGAGTG AGAAGCAGCT CGTCCTGAAC CTCACAGGAA ACACCCTCTG 1260 

TGCAGGGGGC GCTTCGGATG AGAAATTGAT CTCACTGATA TGCCGAGCAG TCAAAGCCAC 1320 

CTTCAACCCG GCCCAAGATA AGTGCGGCAT ACGGCTGGCA TCTGTTCCAG GAAGTCAGAC 1380 

CGTGGTCGTC AAAGAAATCA CTATTCACAC TAAGCTCCCT GCCAAGGATG TGTACGAGCG 144 0 

GCTGAAGGAC AAATGGGATG AACTAAAGGA GGCAGGGGTC AGTGACATGA AGCTAGGGGA 1500 

CCAGGGGCCA CCGGAGGAGG CCGAGGACCG CTTCAGCATG CCCCTCATCA TCACCATCGT 1560 

CTGCATGGCG TCATTCCTGC TCCTCGTGGC GGCCCTCTAT GGCTGCTGCC ACCAGCGCCT 1620 

CTCCCAGAGG AAGGACCAGC AGCGGCTAAC AGAGGAGCTG CAGACAGTGG AGAATGGTTA 1680 

CCATGACAAC CCAACACTGG AAGTGATGGA GACCTCTTCT GAGATGCAGG AGAAGAAGGT 1740 

GGTCAGCCTC AACGGGGAGC TGGGGGACAG CTGGATCGTC CCTCTGGACA ACCTGACCAA 1800 

GGACGACCTG GATGAGGAGG AAGACACACA CCTCTAGTCC GGTCTGCCGG TGGCCTCCAG 1860 

CAGCACCACA GAGCTCCAGA CCAACCACCC CAAGTGCCGT TTGGATGGGG AAGGGAAAGA 1920 

CTGGGGAGGG AGAGTGAACT CCGAGGGGTG TCCCCTCCCA ATCCCCCCAG GGCCTTAATT I960 

TTTCCCTTTT CAACCTGAAC AAATCACATT CTGTCCAGAT TCCTCTTGTA AAATAACCCA 204 0 

CTAGTGCCTG AGCTCAGTGC TGCTGGATGA TGAGGGAGAT CAAGAAAAAG CCACGTAAGG 2100 

GACTTTATAG ATGAACTAGT GGAATCCCTT CATTCTGCAG TGAGATTGCC GAGACCTGAA 2160 

GAGGGTAAGT GACTTGCCCA AGGTCAGAGC CACTTGGTGA CAGAGCCAGG ATGAGAACAA 2220 
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AGATTCCATT TGCACCATGC CACACTGCTG TGTTCACATG TGCCTTCCGT CCAGAGCAGT 228 0 

CCCGGGCAGG GGTGAAACTC CAGCAGGTGG CTGGGCTGGA AAGGAGGGCA GGGCTACATC 234 0 

CTGGCTCGGT GGGATCTGAC GACCTGAAAG TCCAGCTCCC AAGTTTTCCT TCTCCTACCC 2400 

CAGCCTCGTG TACCCATCTT CCCACCCTCT ATGTTCTTAC CCCTCCCTAC ACTCAGTGTT 2460 

5 TGTTCCCACT TACTCTGTCC TGGGGCCTCT GGGATTAGCA CAGGTTATTC ATAACCTTGA 2520 

ACCCCTTGTT CTGGATTCGG ATTTTCTCAC ATTTGCTTCG TGAGATGGGG GCTTAACCCA 2580 

CACAGGTCTC CGTGCGTGAA CCAGGTCTGC TTAGGGGACC TGCGTGCAGG TGAGGAGAGA 2640 

AGGGGACACT CGAGTCCAGG CTGGTATCTC AGGGCAGCTG ATGAGGGGTC AGCAGGAACA 2700 

CTGGCCCATT GCCCCTGGCA CTCCTTGCAG AGGCCACCCA CGATCTTCTT TGGGCTTCCA 2760 

10 TTTCCACCAG GGACTAAAAT CTGCTGTAGC TAGTGAGAGC AGCGTGTTCC TTTTGTTGTT 2820 

CACTGCTCAG CTGATGGGAG TGATTCCCTG AGACCCAGTA TGAAAGAGCA GTGGCTGCAG 2880 

GAGAGGCCTT CCCGGGGCCC CCCATCAGCG ATGTGTCTTC AGAGACAATC CATTAAAGCA 2940 

GCCAGGAAGG ACAGGCTTTC CCCTGTATAT CATAGGAAAC TCAGGGACAT TTCAAGTTGC 3000 

TGAGAGTTTT GTTATAGTTG TTTTCTAACC CAGCCCTCCA CTGCCAAAGG CCAAAAGCTC 3060 

15 AGACAGTTGG CAGACGTCCA GTTAGCTCAT CTCACTCACT CTGATTCTCC TGTGCCACAG 3120 

GAAAAGAGGG CCTGGAAAGC GCAGTGCATG CTGGGTGCAT GAAGGGCAGC CTGGGGGACA 3180 

GACTGTTGTG GGAACGTCCC ACTGTCCTGG CCTGGAGCTA GGCCTTGCTG TTCCTCTTCT 3240 

CTGTGAGCCT AGTGGGGCTG CTGCGGTTCT CTTGCAGTTT CTGGTGGCAT CTCAGGGGAA 3300 

CACAAAAGCT ATGTCTATTC CCCAATATAG GACTTTTATG GGCTCGGCAG TTAGCTG CCA 3360 

20 TGTAGAAGGC TCCTAAGCAG TGGGCATGGT GAGGTTTCAT CTGATTGAGA AGGGGGAATC 3420 

CTGTGTGGAA TGTTGAACTT TCGCCATGGT CTCCATCGTT CTGGGCGTAA ATTCCCTGGG 3480" 

ATCAAGTAGG AAAATGGGCA GAACTGCTTA GGGGAATGAA ATTGCCATTT TTCGGGTGAA 3540 

ACGCCACACC TCCAGGGTCT TAAGAGTCAG GCTCCGGCTG TAGTAGCTCT GATGAAATAG 3 600 

GCTATCCACT CGGGATGGCT TACTTTTTAA AAGGGTAGGG GGAGGGGCTG GGGAAGATCT 3660 

25 GTCCTGCACC ATCTGCCTAA TTCCTTCCTC ACAGTCTGTA GCCATCTGAT ATCCTAGGGG 3 720 

GAAAAGGAAG GCCAGGGGTT CACATAGGGC CCCAG CGAGT TTCCCAGGAG TTAGAGGGAT 3780 

GCGAGGCTAA CAAGTTCCAA AAACATCTGC CCCGATGCTC TAGTGTTTGG AGGTGGGCAG 3 840 

GATGGAGAAC AGTGCCTGTT TGGGGGAAAA CAGGAAATCT TGTTAGG CTT GAGTGAGGTG 3900 

TTTGCTTCCT TCTTGCCCAG CGCTGGGTTC TCTCCACCCA GTAGGTTTTC TGTTGTGGTC 3 960 

30 CCGTGGGAGA GGCCAGACTG GATTATTCCT CCTTTGCTGA TCCTGGGTCA CACTTCACCA 4 020 

GCCAGGGCTT TTGACGGAGA CAGCAAATAG GCCTCTGCAA ATCAATCAAA GGCTGCAACC 4 080 

CTATGGCCTC TTGGAGACAG ATGATGACTG GCAAGGACTA GAGAGCAGGA GTGCCTGGCC 4140 

AGGTCGGTCC TGACTCTCCT GACTCTCCAT CGCTCTGTCC AAGGAGAACC CGGAGAGGCT 4200 

CTGGGCTGAT TCAGAGGTTA CTGCTTTATA TTCGTCCAAA CTGTGTTAGT CTAGGCTTAG 4260 

35 GACAGCTTCA GAATCTGACA CCTTGCCTTG CTCTTGCCAC CAGGACACCT ATGTCAACAG 4320 

GCCAAACAGC CATGCATCTA TAAAGGTCAT CATCTTCTGC CACCTTTACT GGGTTCTAAA 4380 

TGCTCTCTGA TAATTCAGAG AGCATTGGGT CTGGGAAGAG GTAAGAGGAA CACTAGAAGC 4440 

TCAGCATGAC TTAAACAGGT TGTAGCAAAG ACAGTTTATC ATCAACTCTT TCAGTGGTAA 4500 

ACTGTGGTTT CCCCAAGCTG CACAGGAGGC CAGAAACCAC AAGTATGATG ACTAGGAAGC 4560 

40 CTACTGTCAT GAGAGTGGGG AG ACAGG C AG CAAAGCTTAT GAAGGAGGTA CAGAATATTC 4620 

TTTGCGTTGT AAGACAGAAT ACGGGTTTAA TCTAGTCTAG GCRCCAGATT TTTTTCCCGC 4680 

TTGATAAGGA AAGCTAGCAG AAAGTTTATT TAAACCACTT CTTGAGCTTT ATCTTTTTTG 4 740 

AC AATATAC T GGAGAAACTT TGAAGAACAA GTTCAAACTG ATACATATAC ACATATTTTT 4800 

TTGATAATGT AAATACAGTG ACCATGTTAA CCTACCCTGC ACTGCTTTAA GTGAACATAC 4 860 

45 TTTGAAAAAG CATTATGTTA GCTGAGTGAT GGCCAAGTTT TTTCTCTGGA CAGGAATGTA 4920 

AATGTCTTAC TGGAAATGAC AAGTTTTTGC TTGATTTTTT TTTTTAAACA AAAAATGAAA 4980 

TATAACAAGA CAAACTTATG ATAAAGTATT TGTCTTGTAG ATCAGGTGTT TTGTTTTGTT 5040 

TTTTTAATTT TAAAATGCAA CCCTGCCCCC TCCCCAGCAA AGTCACAGCT CCATTTCAGT 5100 

AAAGGTTGGA GTCAATATGC TCTGGTTGGC AGGCAACCCT GTAGTCATGG AGAAAGGTAT 5160 

50 TTCAAGATCT AGTCCAATCT TTTTCTAGAG AAAAAGATAA TCTGAAGCTC ACAAAGATGA 5220 

AGTGACTTCC TCAAAATCAC ATGGTTCAGG ACAGAAACAA GATTAAAACC TGGATCCACA 5280 

GACTGTGCGC CTCAGAAGGA ATAATCGGTA AATTAAGAAT TGCTACTCGA AGGTGCCAGA 5340 

ATGACACAAA GGACAGAATT CCTTTCCCAG TTGTTACCCT AGCAAGGCTA GGGAGGGCAT 5400 

GAACACAAAC ATAAGAACTG GTCTTCTCAC ACTTTCTCTG AATCATTTAG GTTTAAGATG 5460 

55 TAAGTGAACA ATTCTTTCTT TCTGCCAAGA AACAAAGTTT TGGATGAGCT TTTATATATG 5520 

GAACTTACTC CAACAGGACT GAGGGACCAA GGAAACATGA TGGGGGAGGC AAGAGAGGGC 5580 

AAAGAGTAAA ACTGTAGCAT AGCTTTTGTC ACGGTCACTA GCTGATCCCT CAGGTCTGCT 5640 

GCAAACACAG CATGGAGGAC ACAGATGACT CTTTGGTGTT GGTCTTTTTG TCTGCAGTGA 5700 

ATGTTCAACA GTTTGCCCAG GAACTGGGGG ATCATATATG TCTTAGTGGA CAGGGGTCTG 5760 

60 AAGTACACTG GAATTTACTG AGAAACTTGT TTGTAAAAAC TATAGTTAAT AATTATTGCA 5820 
TTTTCTTACA AAAATATATT TTGGAAAATT GTATACTGTC AATTAAAGT 



65 



Seq ID NO: 14 Protein sequence: 
Protein Accession #: NP_005388 



11 21 31 41 51 

I I I I I 

MRCALALSAL LLLLSTPPLL PSSPSPSPSP SPSQNATQTT TDSSNKTAPT PASSVTIMAT 60 

DTAQQSTVPT SKANEILASV KATTLGVSSD SPGTTTLAQQ VSGPVNTTVA RGGGSGNPTT 120 

70 TIESPKSTKS ADTTTVATST ATAKPNTTSS QNGAEDTTNS GGKSSHSVTT DLTSTKAEHL 180 

TTPHPTSPLS PRQPTLTHPV ATPTSSGHDH LMKISSSSST VAIPGYTFTS PGMTTTLPSS 240 

VISQRTQQTS SQMPASSTAP SSQETVQPTS PATALRTPTL PETMSSSPTA ASTTHRYPKT 300 

PSPTVAHESN WAKCEDLETQ TQSEKQLVLN LTGNTLCAGG ASDEKLISLI CRAVKATFNP 360 

AQDKCGIRLA SVPGSQTVW KEITIHTKLP AKDVYERLKD KWDELKEAGV SDMKLGDQGP 420 

75 PEEAEDRFSM PLIITIVCMA SFLLLVAALY GCCHQRLSQR KDQQRIjTEEL QTVENGYHDN 4 80 

PTLEVMETSS EMQEKKWSL NGELGDSWIV PLDNLTKDDL DEEEDTHL 
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Seq ID NO: 15 Nucleotide sequence: 
Nucleic Acid Accession #: NM_004105 

Coding sequence: 150.. 1631 (underlined sequences correspond to start and stop codons) 
1 11 21 31 41 51 

l l I I I I 

CTAGTATTCT ACTAGAACTG GAAGATTGCT CTCCGAGTTT TTTTTTTGTT ATTTTGTTAA 60 

AAAATAAAAA GCTTGAGCAG CAATTCATAT TACTGT CACA GGTATTTTTG CTGTGCTGTG 120 

CAAGGTAACT CTGCTAGCTA AGATTCACAA TGTTGAAAGC CCTTTTCCTA ACTATGCTGA 180 

CTCTGGCGCT GGTCAAGTCA CAGGACACCG AAGAAACCAT CACGTACACG CAATGCACTG 240 

ACGGATATGA GTGGGATCCT GTGAGACAGC AATGCAAAGA TATTGATGAA TGTGACATTG 300 

TCCCAGACGC TTGTAAAGGT GGAATGAAGT GTGTCAACCA CTATGGAGGA TACCTCTGCC 360 

TTCCGAAAAC AGCCCAGATT ATTGTCAATA ATGAACAGCC TCAGCAGGAA ACACAACCAG 420 

CAGAAGGAAC CTCAGGGGCA ACCACCGGGG TTGTAGCTGC CAGCAGCATG GCAACCAGTG 480 

GAGTGTTGCC CGGGGGTGGT TTTGTGGCCA GTGCTGCTGC AGTCGCAGGC CCTGAAATGC 540 

AGACTGGCCG AAATAACTTT GTCATCCGGC GGAACCCAGC TGACCCTCAG CGCATTCCCT 600 

CCAACCCTTC CCACCGTATC CAGTGTGCAG CAGGCTACGA GCAAAGTGAA CACAACGTGT 660 

GCCAAGACAT AGACGAGTGC ACTGCAGGGA CGCACAACTG TAGAGCAGAC CAAGTGTGCA 720 

TCAATTTACG GGGATCCTTT GCATGTCAGT GCCCTCCTGG ATATCAGAAG CGAGGGGAGC 780 

AGTGCGTAGA CATAGATGAA TGTACCATCC CTCCATATTG CCACCAAAGA TGCGTGAATA 840 

CACCAGGCTC ATTTTATTGC CAGTGCAGTC CTGGGTTTCA ATTGGCAGCA AACAACTATA 900 

CCTGCGTAGA TATAAATGAA TGTGATGCCA GCAATCAATG TGCTCAGCAG TGCTACAACA 960 

TTCTTGGTTC ATTCATCTGT CAGTGCAATC AAGGATATGA GCTAAGCAGT GACAGGCTCA 1020 

ACTGTGAAGA CATTGATGAA TGCAGAACCT CAAGCTACCT GTGTCAATAT CAATGTGTCA 1080 

ATGAACCTGG GAAATTCTCA TGTATGTGCC CCCAGGGATA CCAAGTGGTG AGAAGTAGAA 1140 

CATGTCAAGA TATAAATGAG TGTGAGACCA CAAATGAATG CCGGGAGGAT GAAATGTGTT 1200 

GGAATTATCA TGGCGGCTTC CGTTGTTATC CACGAAATCC TTGTCAAGAT CCCTACATTC 1260 

TAACACCAGA GAACCGATGT GTTTGCCCAG TCTCAAATGC CATGTGCCGA GAACTGCCCC 1320 

AGTCAATAGT CTACAAATAC ATGAGCATCC GATCTGATAG GTCTGTGCCA TCAGACATCT 13 80 

TCCAGATACA GGCCACAACT ATTTATGCCA ACACCATCAA TACTTTTCGG ATTAAATCTG 144 0 

GAAATGAAAA TGGAGAGTTC TACCTACGAC AAACAAGTCC TGTAAGTGCA ATGCTTGTGC 1500 

TCGTGAAGTC ATTATCAGGA CCAAGAGAAC ATATCGTGGA CCTGGAGATG CTGACAGTCA 1560 

GCAGTATAGG GACCTTCCGC ACAAGCTCTG TGTTAAGATT GACAATAATA GTGGGGCCAT 1620 

TTTCATTTTA GTCTTTTCTA AGAGTCAACC ACAGGCATTT AAGTCAGCCA AAGAATATTG 1680 

TTACCTTAAA GCACTATTTT ATTTATAGAT ATATCTAGTG CATCTACATC TCTATACTGT 174 0 

ACACTCACCC ATAACAAACA ATTACACCAT GGTATAAAGT GGGCATTTAA TATGTAAAGA 1800 

TTCAAAGTTT GTCTTTATTA CTATATGTAA ATTAGACATT AATCCACTAA ACTGGTCTTC 186 0 

TTCAAGAGAG CTAAGTATAC ACTATCTGGT GAAACTTGGA TTCTTTCCTA TAAAAGTGGG 192 0 

ACCAAGCAAT GATGATCTTC TGTGGTGCTT AAGGAAACTT ACTAGAGCTC CACTAACAGT 198 0 

CTCATAAGGA GGCAGCCATC ATAACCATTG AATAGCATGC AAGGGTAAGA ATGAGTTTTT 2040 

AACTGCTTTG TAAGAAAATG GAAAAGGTCA ATAAAGATAT ATTTCTTTAG AAAATGGGGA 2100 

TCTGCCATAT TTGTGTTGGT TTTTATTTTC ATATCCAGCC TAAAGGTGGT TGTTTATTAT 2160 

ATAGTAATAA ATCATTGCTG TACAACATGC TGGTTTCTGT AGGGTATTTT TAATTTTGTC 2220 

AGAAATTTTA GATTGTGAAT ATTTTGTAAA AAACAGTAAG CAAAATTTTC CAGAATTCCC 2280 

AAAATGAACC AGATACCCCC TAGAAAATTA TACTATTGAG AAATCTATGG GGAGGATATG 234 0 

AGAAAATAAA TTCCTTCTAA ACCACATTGG AACTGACCTG AAGAAGCAAA CTCGGAAAAT 24 0 0 

ATAATAACAT CCCTGAATTC AGGCATTCAC AAGATGCAGA ACAAAATGGA TAAAAGGTAT 24 60 

TTCACTGGAG AAGTTTTAAT TTCTAAGTAA AATTTAAATC CTAACACTTC ACTAATTTAT 2520 

AACTAAAATT TCTCATCTTC GTACTTGATG CTCACAGAGG AAGAAAATGA TGATGGTTTT 2580 

TATTCCTGGC ATCCAGAGTG ACAGTGAACT TAAGCAAATT ACCCTCCTAC CCAATTCTAT 264 0 

GGAATATTTT ATACGTCTCC TTGTTTAAAA TCTGACTGCT TTACTTTGAT GTATCATATT 2700 
TTTAAATAAA AATAAATATT CCTTTAGAAG ATCACTCTAA AA 



Seq ID NO: 16 Protein sequence: 
Protein Accession #: NP_004096 



1 11 21 31 41 51 

I I I I I I 

MLKALFLTML TLALVKSQDT EETITYTQCT DGYEWDPVRQ QCKDIDECDI VPDACKGGMK 60 
CVNHYGGYLC LPKTAQIIVN NEQPQQETQP AEGTSGATTG WAASSMATS GVLPGGGFVA 120 
SAAAVAGPEM QTGRNNFVIR RNPADPQRIP SNPSHRIQCA AGYEQSEHNV CQDIDECTAG 180 
THNCRADQVC INLRGSFACQ CPPGYQKRGE QCVDIDECTI PPYCHQRCVN TPGSFYCQCS 240 
PGFQIiAANNY TCVDINECDA SNQCAQQCYN ILGSFICQCN QGYELSSDRL NCEDIDECRT 3 00 
SSYLCQYQCV NEPGKFSCMC PQGYQWRSR TCQDINECET TNECREDEMC WNYHGGFRCY 360 
PRNPCQDPYI LTPENRCVCP VSNAMCRELP QSIVYKYMSI RSDRSVPSDI FQIQATTIYA 420 
NTINTFRIKS GNENGEFYLR QTSPVSAMLV LVKSLSGPRE HIVDLEMLTV SSIGTFRTSS 4 80 
VLRLTIIVGP FSF 

Seq ID NO: 17 Nucleotide sequence : 
Nucleic Acid Accession #: NM_018894 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I I I 

AAAACATTCA ACAAATTAAT GGGTGTAAGG AACTGGAAAA CCTGGACTCC TACCACATGC 60 

AGATAAAACC AATAGAGTGC AGAATAAGAC TCAAGTCAAG TAAGTAACGT TAAACACCAT 120 

AAAGACACAT GGCCTTCTTT GTGTACATGA CATGCATTCT CAACAATGCA CTGACGGATA 180 

TGAGTGGGAT CCTGTGAGAC AGCAATGCAA AGATATTGAT GAATGTGACA TTGTCCCAGA 240 

CGCTTGTAAA GGTGGAATGA AGTGTGTCAA CCACTATGGA GGATACCTCT GCCTTCCGAA 300 

AACAGCCCAG ATTATTGTCA ATAATGAACA GCCTCAGCAG GAAACACAAC CAGCAGAAGG 360 

AACCTCAGGG GCAACCACCG GGGTTGTAGC TGCCAGCAGC ATGGCAACCA GTGGAGTGTT 420 

GCCCGGGGGT GGTTTTGTGG CCAGTGCTGC TGCAGTCGCA GGCCCTGAAA TGCAGACTGG 480 

CCGAAATAAC TTTGTCATCC GGCGGAACCC AGCTGACCCT CAGCGCATTC CCTCCAACCC 540 

TTCCCACCGT ATCCAGTGTG CAGCAGGCTA CGAGCAAAGT GAACACAACG TGTGCCAAGA 600 

CATAGACGAG TGCACTGCAG GGACGCACAA CTGTAGAGCA GACCAAGTGT GCATCAATTT 660 

ACGGGGATCC TTTGCATGTC AGTGCCCTCC TGGATATCAG AAGCGAGGGG AGCAGTGCGT 720 

AGACATAGAT GAATGTACCA TCCCTCCATA TTGCCACCAA AGATGCGTGA ATACACCAGG 780 

CTCATTTTAT TGCCAGTGCA GTCCTGGGTT TCAATTGGCA GCAAACAACT ATACCTGCGT 840 

AGATATAAAT GAATGTGATG CCAGCAATCA ATGTGCTCAG CAGTGCTACA ACATTCTTGG 900 

TTCATTCATC TGTCAGTGCA ATCAAGGATA TGAGCTAAGC AGTGACAGGC TCAACTGTGA 960 

AGACATTGAT GAATGCAGAA CCTCAAGCTA CCTGTGTCAA TATCAATGTG TCAATGAACC 1020 

TGGGAAATTC TCATGTATGT GCCCCCAGGG ATACCAAGTG GTGAGAAGTA GAACATGTCA 1080 

AGATATAAAT GAGTGTGAGA CCACAAATGA ATGCCGGGAG GATGAAATGT GTTGGAATTA 1140 

TCATGGCGGC TTCCGTTGTT ATCCACGAAA TCCTTGTCAA GATCCCTACA TTCTAACACC 1200 

AGAGAACCGA TGTGTTTGCC CAGTCTCAAA TGCCATGTGC CGAGAACTGC CCCAGTCAAT 1260 

AGTCTACAAA TACATGAGCA TCCGATCTGA TAGGTCTGTG CCATCAGACA TCTTCCAGAT 1320 

ACAGGCCACA ACTATTTATG CCAACACCAT CAATACTTTT CGGATTAAAT CTGGAAATGA 1380 

AAATGGAGAG TTCTACCTAC GACAAACAAG TCCTGTAAGT GCAATGCTTG TGCTCGTGAA 1440 

GTCATTATCA GGACCAAGAG AACATATCGT GGACCTGGAG ATGCTGACAG TCAGCAGTAT 1500 

AGGGACCTTC CGCACAAGCT CTGTGTTAAG ATTGACAATA ATAGTGGGGC CATTTTCATT 1560 

TTAGTCTTTT CTAAGAGTCA ACCACAGGCA TTTAAGTCAG CCAAAGAATA TTGTTACCTT 1620 

AAAGCACTAT TTTATTTATA GATATATCTA GTGCATCTAC ATCTCTATAC TGTACACTCA 1680 

CCCATAACAA ACAATTACAC CATGGTATAA AGTGGGCATT TAATATGTAA AGATTCAAAG 1740 

TTTGTCTTTA TTACTATATG TAAATTAGAC ATTAATCCAC TAAACTGGTC TTCTTCAAGA 1800 

GAGCTAAGTA TACACTATCT GGTGAAACTT GGATTCTTTC CTATAAAAGT GGGACCAAGC 1860 

AATGATGATC TTCTGTGGTG CTTAAGGAAA CTTACTAGAG CTCCACTAAC AGTCTCATAA 1920 

GGAGGCAGCC ATCATAACCA TTGAATAGCA TGCAAGGGTA AGAATGAGTT TTTAACTGCT 1980 

TTGTAAGAAA ATGGAAAAGG TCAATAAAGA TATATTTCTT TAGAAAATGG GGATCTGCCA 2040 

TATTTGTGTT GGTTTTTATT TTCATATCCA GCCTAAAGGT GGTTGTTTAT TATATAGTAA 2100 

TAAATCATTG CTGTACAACA TGCTGGTTTC TGTAGGGTAT TTTTAATTTT GTCAGAAATT 2160 

TTAGATTGTG AATATTTTGT AAAAAACAGT AAGCAAAATT TTCCAGAATT CC CAAAATG A 2220 

ACCAGATACC CCCTAGAAAA TTATACTATT GAGAAATCTA TGGGGAGGAT ATGAGAAAAT 2280 

AAATTCCTTC TAAACCACAT TGGAACTGAC CTGAAGAAGC AAACTCGGAA AATATAATAA 2340 

CATCCCTGAA TTCAGGCATT CACAAGATGC AGAACAAAAT GGATAAAAGG TATTTCACTG 2400 

GAGAAGTTTT AATTTCTAAG TAAAATTTAA ATCCTAACAC TTCACTAATT TATAACTAAA 2460 

ATTTCTCATC TTCGTACTTG ATGCTCACAG AGGAAGAAAA TGATGATGGT TTTTATTCCT 2520 

GGCATCCAGA GTGACAGTGA ACTTAAGCAA ATTACCCTCC TACCCAATTC TATGGAATAT 2580 

TTTATACGTC TCCTTGTTTA AAATCTGACT GCTTTACTTT GATGTATCAT ATTTTTAAAT 2640 
AAAAATAAAT ATTCCTTTAG AAGATCACTC TAAAA 

Seq ID NO: 18 Protein sequence: 
Protein Accession #: NP_061489.1 

1 11 21 31 41 51 

| I I I I I 

MHSQQCTDGY EWDPVRQQCK DIDECDIVPD ACKGGMKCVN HYGGYLCLPK TAQIIVNNEQ 60 

PQQETQPAEG TSGATTGWA ASSMATSGVL PGGGFVASAA AVAGPEMQTG RNNFVIRRNP 120 

ADPQRIPSNP SHRIQCAAGY EQSEHNVCQD IDECTAGTHN CRADQVCINL RGSFACQCPP 180 

GYQKRGEQCV DIDECTIPPY CHQRCVNTPG SFYCQCSPGF QLAANNYTCV DINECDASNQ 240 

CAQQCYNILG SFICQCNQGY ELSSDRLNCE DIDECRTSSY LCQYQCVNEP GKFSCMCPQG 3 00 

YQWRSRTCQ DINECETTNE CREDEMCWNY HGGFRCYPRN PCQDPYILTP ENRCVCPVSN 360 

AMCRELPQSI VYKYMSIRSD RSVPSDIFQI QATTIYANTI NTFRIKSGNE NGEFYLRQTS 420 
PVSAMLVLVK SLSGPREHIV DLEMLTVSSI GTFRTSSVLR LTIIVGPFSF 

Seq ID NO: 19 Nucleotide sequence; 
Nucleic Acid Accession #: NM_006500 

Coding sequence: 27.. 1967 (underlined sequences correspond to start and 



1 11 21 31 41 51 

1 I I I I I 

ACTTGCGTCT CGCCCTCCGG CCAAGCATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 300 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
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GCATCTTCTT GTGCCAGGGC AAGCGCCCTC GGTCCCAGGA 
TCTACAAAGC TCCGGAGGAG CCAAACATCC AGGTCAACCC 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG TAGGGAGGAA 
TCATCTGGTA CAAGAATGGC CGGCCTCTGA AGGAGGAGAA 
5 CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT ACACCTTGCA 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT ACTGTGAGCT 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA CCGTCCCTGT 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC TGAAGGAAGG 
GTTTGGCTGA TGGCAACCCT CCACCACACT TCAGCATCAG 

10 GGGAGGCAGA GGAAGAGACA ACCAACGACA ACGGGGTCCT 
AGGAACACAG TGGGCGCTAT GAATGTCAGG CCTGGAACTT 
TGAGTGAACC ACAGGAACTA CTGGTGAACT ATGTGTCTGA 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA CCCTGACCTG 
ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA CAGACCAGGT 

15 TTCAGTTGCA TGACCTGAAA CGGGAGGCAG GAGGCGGCTA 
CCAGCATACC CGGCCTGAAC CGCACACAGC TGGTCAAGCT 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG TGAAAGAGAA 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA TCTCCTGGAA 
AACAAGACCA AGAT CCACAG CGAGTCCTGA GCACCCTGAA 

20 TGTTGGAGAC AGGTGTTGAA TGCACGGCCT CCAACGACCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC TCACACCAGA 
TCAGCACTTC CACTGCCAGT CCTCATACCA GAGCCAACAG 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA TCGTGGCTGT 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC TCTATAAGAA 

25 GCTCAGGGAA GCAGGAGATC ACGCTGCCCC CGTCTCGTAA 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG GCCTCCTGCA 
GGGCTCCGGG AGACCAGGGA GAGAAATACA TCGATCTGAG 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA GCTCCCTGCT 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG CTCCCCTCAC 

30 GGCCACTGGG TTAGGACCTG AGGACCTCAC TTGGCCCTGC 
GTCCACCACC ATCTCCTCCA CGTTGAGTGA AGCTCATCCC 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC GTGTTTTTTC 
AAATACCTGG CTCCTGCCAG CAGCTGAGCT GGGTAGCCTC 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC CACTGAAGTG 

35 GCCTGCTCAT GTTGAAGTGC GCTGTTCACA CCCGCTCCGG 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC CTCCTGCTCG 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC TGGTGTCATT 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC CCAGCACTTT 
TCACAAAGTC AGGACGAGAC CATCCTGGCT AACACGGTGA 

40 TACAAAAAAA AATTAGCTAG GCGTAGTGGT TGGCACCTAT 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA GGTGGAGCTT 
CACTGCACTC CAGCCTGGGC " AACACAGCGA GACTCCGTCT 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC TGTTTTCGAG 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC TTGATGGATC 

45 GGGGAGCAGA CAAAGATGAG GTCTACACTG TCCTTCATGG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA GGGCCCCAAC 
AGAATGGTAC TTAGGGATGG AAAACGGGGC CTGGCTAGAG 
CTGTGTGTAT GCATACATAT GTGTGTATAT ATGGTTTTGT 
TTGTTTCCTT TATATATGTA TGTATATATA TATATGAAAA 

50 AAAGCTTAAT TGTCCCAGAA AATCATACAT TGCTTTTTTA 
AACCTGGGGG CCTGTGAAAC TACAACCAAA AGGCACACAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC AAATGGCTCA 
CTACCCTACT TTTCAGCAGC AAAACGTCCC GTATGACGCA 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG TTTCCGTCCA 

55 

Seq ID NO: 20 Protein sequence : 
Protein Accession #: NP_006491 

60 

1 11 21 31 

I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV 
DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT 

65 PRSQEYRIQL RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT 
LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ 
VTVPVFYPTE KVWLEVEPVG MLKEGDRVEI RCIiADGNPPP 
DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV 
LTLTCEAESS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE 

70 QLVKLAIFGP PWMAFKERKV WVKENMVLNL SCEASGHPRP 
LSTi*NVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT 
TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY 
PPSRKTELW EVKSDKLPEE MGhLQGSSGD KRAPGDQGEK 
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GTACCGCATC 


CAGCTCCGCG 


420 


CCTGGGCATC 


CCTGTGAACA 


480 


CGGGTACCCC 


ATTCCTCAAG 


540 


GAACCGGGTC 


CACATTCAGT 


600 


GAGTATTCTG 


AAGGCACAGC 


660 


CAACTACCGG 


CTGCCCAGTG 


720 


TTTCTACCCG 


ACAGAAAAAG 


780 


GGACCGCGTG 


GAAATCAGGT 


840 


CAAGCAGAAC 


CCCAGCACCA 


900 


GGTGCTGGAG 


CCTGCCCGGA 


960 


GGACACCATG 


ATATCGCTGC 


1020 


CGTCCGAGTG 


AGTCCCGCAG 


1080 


TGAGGCAGAG 

X XvJ^XV XV WjWfaw 


AGTAGCCAGG 

JlW* X /4WV v*f*w w 


1140 


GCTGGAAAGG 


GGGCCTGTGC 


1200 


TCGCTGCGTG 

X *w W X XV VriVJ X XV 


GCGTCTGTGC 

V WW X X^ X XJ ^ V^J w 


1260 


GGCCATTTTT 

VJWwf*^ X X X X 


GGCCCCCCTT 


1320 


TATGGTGTTG 

X #* » XJ X^ ^ A A XV 


AATCTGTCTT 


1380 


CGTCAACGGC 


ACGGCAAGTG 

4 XJ XJ X^ S *V » %J 


1440 


TGTC CTCGTG 


ACCCCGGAGC 


1500 


GGGCAAAAAC 


ACCAGCATCC 


1560 


CTCCAACACA 


ACCACTGGCC 


1620 


CACCTCCACA 


GAGAGAAAGC 


1680 


GATTGTGTGC 


ATCCTGGTCC 


174 0 


GGGCAAGCTG 


CCGTGCAGGC 

X^ X*» XV * XJ X^* *X^ x^ 


1800 


GACCGAACTT 


GTAGTTGAAG 


1860 


GGGCAGCAGC 

^V XV ^V wt>W %»flw X^ 


GGTGACAAGA 


1920 


GCATTAGCCC 


CGAATCACTT 

x^ x^-*** * x x^ rax* * x 


1980 


CACTCTTCTC 

V^X^x* X w X X X*» X V* 


TCAGCCAAAG 


2040 

X* W V 


CTGCACACCC 


CCTTTCAGAG 


2100 


AAGCCGCTTT 

fV^Xv WWW Vrf XXX 


TCAGGGACCA 


2160 


AAGCAAGGAG 


CCCCAGTCTC 

xx v^nu ^ x^ a x-» 


2220 


TTTACACACA 


TTATGGCTGT 

x im w xjj x 


2280 


TCTGAGCTGG 

X X* X XV**XV X* -X \J W 


TTTCCTGCCC 

<A. X X X* Xp^ A XJ X X 


2340 


AGGACACACC 


GGAGCCAGGC 

xv wiw w wnv xv w 


2400 


AGAGCACCCC 


AGCGGCATCC 


2460 


CCTCTTCAAA 

X^X* A X^ V» A VlMlil 


GTCTCCTGTG 

XV A X* ^ X^ X^ A- XV * XV 


2520 


CCTTAAAAGA 

X^ X* X A AM Uwiv^t 


TACGTGCCGG 

* +^^XV XV X^ Vi XV 


2580 


GGGAGGCCGA 


GGCGGGCGGA 


2640 


AACCCTGTCT 

rui^^ x* x \j x x 


CTACTAAAAA 

X* V* «* X^ X ♦*#**a**4 _ * 


2700 


AGTCCCAGCT 


ACTCGGAAGG 

i* X X XV*J* u *wU 


2760 


r r AGTG AG re 


G AG ACCGTG C 

WO\jr^x N« X XJ Vw» 


2820 

£j IV *-4 w 


PGAGGAAAAA 


AAAAGAAAACJ 


2880 


TTCAGGTGAA 


TTAGCCTCAA 

x x *^>VT x>» x— v. x* *A4r* 


2940 








GGATTAAAGC 


TATGGTTATA 


3060 


CCTAGAAGGG 


CCCAAATGAG 


3120 


CTTCGGGTGT 


GTGTGTCTGT 


3180 


CAGGTGTGTA 


AATTTG C AAA 


3240 


TATATATATA 


TATGAAAAAT 


3300 


TTCTACATGG 


GTACCACAGG 


3360 


AACCGTTTCC 


AGTTGGCAGC 


3420 


AGCTCTACCA 


GAGCAGACAG 


3480 


GCACGAAGGG 


CCTGGCAGGC 


3540 


CTT 







41 


51 




1 

GSTALLKCGL 


! 

SQSQGNLSHV 


60 


LALTQVTPQD 


ERIFLCQGKR 


120 


CVGRNGYPIP 


QVIWYKNGRP 


180 


FYCELNYRLP 


SGNHMKESRE 


240 


HFSISKQNPS 


TREAEEETTN 


300 


NYVSDVRVSP 


AAPERQEGSS 


360 


AGGGYRCVAS 


VPSIPGLNRT 


420 


TISWNVNGTA 


SEQDQDPQRV 


480 


TLTPDSNTTT 


GLSTSTASPH 


540 


FLYKKGKLPC 


RRSGKQEITL 


600 


YIDLRH 
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Seq ID NO: 21 Nucleotide sequence; 
Nucleic Acid Accession #: NM_002421 

Coding sequence: 72.. 1481 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGATATTGG AGTAGCAAGA GGCTGGGAAG CCATCACTTA CCTTGCACTG AGAAAGAAGA 60 

CAAAGGCCAG TATGCACAGC TTTCCTCCAC TGCTGCTGCT GCTGTTCTGG GGTGTGGTGT 120 

CTCACAGCTT CCCAGCGACT CTAGAAACAC AAGAGCAAGA TGTGGACTTA GTCCAGAAAT 180 

ACCTGGAAAA ATACTACAAC CTGAAGAATG ATGGGAGGCA AGTTGAAAAG CGGAGAAATA 240 

GTGGCCCAGT GGTTGAAAAA TTGAAGCAAA TGCAGGAATT CTTTGGGCTG AAAGTGACTG 300 

GGAAACCAGA TGCTGAAACC CTGAAGGTGA TGAAGCAGCC CAGATGTGGA GTGCCTGATG 360 

TGGCTCAGTT TGTCCTCACT GAGGGGAACC CTCGCTGGGA GCAAACACAT CTGACCTACA 420 

GGATTGAAAA TTACACGCCA GATTTGCCAA GAGCAGATGT GGACCATGCC ATTGAGAAAG 480 

CCTTCCAACT CTGGAGTAAT GTCACACCTC TGACATTCAC CAAGGTCTCT GAGGGTCAAG 540 

CAGACATCAT GATATCTTTT GTCAGGGGAG ATCATCGGGA CAACTCTCCT TTTGATGGAC 600 

CTGGAGGAAA TCTTGCTCAT GCTTTTCAAC CAGGCCCAGG TATTGGAGGG GATGCTCATT 660 

TTGATGAAGA TGAAAGGTGG ACCAACAATT TCAGAGAGTA CAACTTACAT CGTGTTGCGG 720 

CTCATGAACT CGGCCATTCT CTTGGACTCT CCCATTCTAC TGATATCGGG GCTTTGATGT 780 

ACCCTAGCTA CACCTTCAGT GGTGATGTTC AGCTAGCTCA GGATGACATT GATGGCATCC 840 

AAG CCATATA TGGACGTTCC CAAAATCCTG TCCAGCCCAT CGGCCCACAA ACCCCAAAAG 900 

CGTGTGACAG TAAGCTAACC TTTGATGCTA TAACTACGAT TCGGGGAGAA GTGATGTTCT 960 

TTAAAGACAG ATTCTACATG CGCACAAATC CCTTCTACCC GGAAGTTGAG CTCAATTTCA 1020 

TTTCTGTTTT CTGGCCACAA CTGCCAAATG GGCTTGAAGC TGCTTACGAA TTTGCCGACA 1080 

GAGATGAAGT CCGGTTTTTC AAAGGGAATA AGTACTGGGC TGTTCAGGGA CAGAATGTGC 1140 

TACACGGATA CCCCAAGGAC ATCTACAGCT CCTTTGGCTT CCCTAGAACT GTGAAGCATA 1200 

TCGATGCTGC TCTTTCTGAG GAAAACACTG GAAAAACCTA CTTCTTTGTT GCTAACAAAT 1260 

ACTGGAGGTA TGATGAATAT AAACGATCTA TGGATCCAGG TTATCCCAAA ATGATAGCAC 1320 

ATGACTTTCC TGGAATTGGC CACAAAGTTG ATGCAGTTTT CATGAAAGAT GGATTTTTCT 1380 

ATTTCTTTCA TGGAACAAGA CAATACAAAT TTGATCCTAA AACGAAGAGA ATTTTGACTC 1440 

TCCAGAAAGC TAATAGCTGG TTCAACTGCA GGAAAAATTG AACATTACTA ATTTGAATGG 1500 

AAAACACATG GTGTGAGTCC AAAGAAGGTG TTTTCCTGAA GAACTGTCTA TTTTCTCAGT 1560 

CATTTTTAAC CTCTAGAGTC ACTGATACAC AGAATATAAT CTTATTTATA CCTCAGTTTG 1620 

CATATTTTTT TACTATTTAG AATGTAGCCC TTTTTGTACT GATATAATTT AGTTCCACAA 1680 

ATGGTGGGTA CAAAAAGTCA AGTTTGTGGC TTATGGATTC ATATAGGCCA GAGTTGCAAA 1740 

GATCTTTTCC AGAGTATGCA ACTCTGACGT TGATCCCAGA GAGCAGCTTC AGTGACAAAC 1800 

ATATCCTTTC AAGACAGAAA GAGACAGGAG ACATGAGTCT TTGCCGGAGG AAAAGCAGCT 1860 

CAAGAACACA TGTGCAGTCA CTGGTGTCAC CCTGGATAGG CAAGGGATAA CTCTTCTAAC 1920 
ACAAAATAAG TGTTTTATGT TTGGAATAAA GTCAACCTTG TTTCTACTGT TTT 



Seq ID NO: 22 Protein sequence : 
Protein Accession #: NP 002412 



1 11 21 31 41 51 

I I I I I I 

MHSFPPLLLL LFWGWSHSF PATLETQEQD VDLVQKYLEK YYNLKNDGRQ VEKRRNSGPV 60 

VEKLKQMQEF FGLKVTGKPD AETLKVMKQP RCGVPDVAQF VLTEGNPRWE QTHLTYRIEN 120 

YTPDLPRADV DHAIEKAFQL WSNVTPLTFT KVSEGQADIM I SFVRGDHRD NSPFDGPGGN 180 

LAHAFQPGPG IGGDAHFDED ERWTNNFREY NLHRVAAHEL GHSLGLSHST DIGALMYPSY 24 0 

TFSGDVQLAQ DDIDGIQAIY GRSQNPVQPI GPQTPKACDS KLTFDAITTI RGEVMFFKDR 300 

FYMRTNPFYP EVELNFISVF WPQLPNGLEA AYEFADRDEV RFFKGNKYWA VQGQNVLHGY 3 60 

PKDIYSSFGF PRTVKHIDAA LSEENTGKTY FFVANKYWRY DEYKRSMDPG YPKMIAHDFP 420 
GIGHKVDAVF MKDGFFYFFH GTRQYKFDPK TKRILTLQKA NSWFNCRKN 



Seq ID NO: 23 Nucleotide sequence: 

Nucleic Acid Accession #: FGENESH predicted ORF 

Coding sequence: 141-1580 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

| I I I I I 

TCTGCGTGTG CCGGGGCTAG GGGCTGGAAG TCCTGGCTCT AGTTGCACCT CGGAAGGAAA 60 

AGGCAAACAG AGGAGGGAAG GCGTCTTAGG ACTGCCTGGA TCCAGAGCAC TTTCCTCGGC 120 

CTCTACAGGC CTGTGTCGCT ATGG GTTCCC CCGCCGCCCC GGAGGGAGCG CTGGGCTACG 180 

TCCGCGAGTT CACTCGCCAC TCCTCCGACG TGCTGGGCAA CCTCAACGAG CTGCGCCTGC 240 

GCGGGATCCT CACTGACGTC ACGCTGCTGG TTGGCGGGCA ACCCCTCAGA GCACACAAGG 300 

CAGTTCTCAT CGCCTGCAGT GGCTTCTTCT ATTCAATTTT CCGGGGCCGT GCGGGAGTCG 360 

GGGTGGACGT GCTCTCTCTG CCCGGGGGTC CCGAAGCGAG AGGCTTCGCC CCTCTATTGG 420 

ACTTCATGTA CACTTCGCGC CTGCGCCTCT CTCCAGCCAC TGCACCAGCA GTCCTAGCGG 4 80 

CCGCCACCTA TTTGCAGATG GAGCACGTGG TCCAGGCATG CCACCGCTTC ATCCAGGCCA 540 

GCTATGAACC TCTGGGCATC TCCCTGCGCC CCCTGGAAGC AGAACCCCCA ACACCCCCAA 600 

CGGCCCCTCC ACCAGGTAGT CCCAGGCGCT CCGAAGGACA CCCAGACCCA CCTACTGAAT 660 
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CTCGAAGCTG CAGTCAAGGC CCCCCCAGTC CAGCCAGCCC TGACCCCAAG GCCTGCAACT 720 

GGAAAAAGTA CAAGTACATC GTGCTAAACT CTCAGGCCTC CCAAGCAGGG AGCCTGGTCG 780 

GGGAGAGAAG TTCTGGTCAA CCTTGCCCCC AAGCCAGGCT CCCCAGTGGA GACGAGGCCT 840 

CCAGCAGCAG CAGCAGCAGC AGCAGCAGCA GTGAAGAAGG ACCCATTCCT GGTCCCCAGA 900 

GCAGGCTCTC TCCAACTGCT GCCACTGTGC AGTTCAAATG TGGGGCTCCA GCCAGTACCC 960 

CCTACCTCCT CACATCCCAG GCTCAAGACA CCTCTGGATC ACCCTCTGAA CGGGCTCGTC 1020 

CACTACCGGG AAGTGAATTT TTCAGCTGCC AGAACTGTGA GGCTGTGGCA GGGTGCTCAT 1080 

CGGGGCTGGA CTCCTTGGTT CCTGGGGACG AAGACAAACC CTATAAGTGT CAGCTGTGCC 1140 

GGTCTTCGTT CCGCTACAAG GGCAACCTTG CCAGTCATCG TACAGTGCAC ACAGGGGAAA 1200 

AGCCTTACCA CTGCTCAATC TGCGGAGCCC GTTTTAACCG GCCAGCAAAC CTGAAAACGC 1260 

ACAGCCGCAT CCATTCGGGA GAGAAGCCGT ATAAGTGTGA GACGTGCGGC TCGCGCTTTG 1320 

TACAGGTGGC ACATCTGCGG GCGCACGTGC TGATCCACAC CGGGGAGAAG CCCTACCCTT 13 80 

GCCCTACCTG CGGAACCCGC TTCCGCCACC TGCAGACCCT CAAGAGCCAC GTTCGCATCC 1440 

ACACCGGAGA GAAGCCTTAC CACTGCGACC CCTGTGGCCT GCATTTCCGG CACAAGAGTC 1500 

AACTGCGGCT GCATCTGCGC CAGAAACACG GAGCTGCTAC CAACACCAAA GTGCACTACC 1560 

ACATTCTCGG GGGGCCC TAG CTGAGCGCAG GCCCAGGCCC CACTTGCTTC CTGCGGGTGG 1620 

GAAAGCTGCA GGCCCAGGCC TTGCTTCCCT ATCAGGCTTG GGCATAGGGG TGTGCCAGGC 1680 

CACTTTGGTA TCAGAAATTG CCACCCTCTT AATTTCTCAC TGGGGAGAGC AGGGGTGGCA 1740 

GATCCTGGCT AGATCTGCCT CTGTTTTGCT GGTCAAAACC TCTTCCCCAC AAGCCAGATT 1800 

GTTTCTGAGG AG AG AG CT AG CTAGGGGCTG GGAAAGGGGA GAGATTGGAG TCCTGGTCTC 1860 

CCTAAGGGAA TAGCCCTCCA CCTGTGGCCC CCATTGCATT CAGTTTATCT GTAAATATAA 1920 

TTTATTGAGG CCTTTGGGTG GCACCGGGGC CTTCATTCGA TTGCATTTCC CACTCCCCTC 1980 

TTCCACAAGT GTGATTAAAA GTGACCAGAA ACACAGAAGG TGAGATCACA GCTCTGCTGG 2040 

CAGAGATTAC TAGCCCTTGG CTCTCTCGTT TGGCTTGGGT ATTTTATATT ATTTCTGT C A 2100 

TAACTTTTAT CTTTAGAATT GTTCTTTCTC CTGTTTGTTT GCTTGTTAGT TTGTTTAAAA 2160 

TGGAAAAAGG GGTTCTCTGT GTTCTGCCCC TGTAATTCTA GGTCTGGAAC CTTTATTTGT 2220 

TCTAGGGCAG CTCTGGGAAC ATGCGGGATT GTGGAATTGG GTCAGGAACC CTCTCTGGTA 2280 

TTCTGGATGT TGTAGGTTCT CTAGCAGTCT AGAAATGGAT ACAGACATTT CTCTGTTCTT 2340 

CAAGGGTGAT AGGAACCATT ATGTTGAGCC CAAAATGGAA GTAATAATAA ATGCCTCCTG 2400 

GAGGCTGTGG GTGTGGGGGA TTCTGTATCT GGATTCCGTA TCACTCCAAC TGGAGGCTGT 2460 

GGGTGTGGGG GATTCTGTAT CTGGATTCCG TATCACTCCA AGTGGAGGCT GGCAGGTTTT 2520 

TCTGCAAGAT GGTCCAGAAT CTAAAATGTC CCATTAATCT GGTCACTTGG GTTTGGCTCT 2580 

GCTGTATCCA TCTATAGTGG TAGAGACCCA CCAGGGCTCA AGTGGAGTCC ATCATCCTCC 2640 

CACGGGGGCC TGTTCTTAGC ACTGAGTTGA TCGCTCCATG GGGGAGAGAT CAGACATTCC 2700 

TTATCAGAGA TGATGTGACC TTTTCTGACT CTGCCCAGTC TCTATGAATG TTATGGCCTA 2760 

GGGAAGAATC ATGAAACTCT TTAGCTTGAT TAGATGGTAA ACAGTGTTAA CCCATCCTTT 2820 

ACTACAGAGG CATATGGGTT TGAATGTTAC CTGGGGTTCT CTCTATTGAG TTGAGCCCCT 2880 

TCTTCCTTTA GTGGGTTTTG GACATCTTCT GGCAAGTGTC CAGATGCCAG AACCTTCTTT 2940 

TCCTCTAGAA GGGATGGTGC TTGGTAACCT TACCTTTTAA AAGCTGGGTC TGTGACCTGG 3000 

TCTTCCCATC CCTGCATTCC TGTCTGGAAC CAGTGAATGC ATTAGAACCT TCCATAGGAA 3060 

AAGAAAAGGG GCTGAGTTCC ATTCTGGGTT TGCTGTAGTT TGGTTGGGAT TATTGTTGGC 3120 

ATTACAGATG TAAAAGATTG ACTAGCCCAT AGGCCAAAGG CCTGTTCTAG TTGACCAAGT 3180 

TTCAAGTAGG ATTAAGAGGT TGGTTGAGGG GTGCAGTTTC TGGTGTAGGC CAGGTAGGTA 3240 

GAAAGTGAGG AACAGGGTTG CCTCTTGGCT GGGTGGAGTC TCTGAAATGT TAGAAGAAGC 3300 

GCTGAAGCCT TGATTGATAG TTCTGCCCCT TGTTGCCCTG GGG CTTATCT GATTATGGGA 3360 

CGAGGGTAGA AAGTAAGAAG CACTTTTGAA TTTGTGGGGT AGAACTTCAA CAATAAGTCA 3420 

GTTCTAGTGG CTGTCGCCTG GGGACTAGTG AGAAAGCTAC TCTTCTCCCT CTTCCCTCTT 34 80 

TCTCCCCATG GCCCCACTGC AGAATTAAAG AAGGAAGAAG GGAAGGCGGA GGAGTCTATA 3 540 

AGAAGGAATC ATGATTTCTA TTTAGCAGAT TGGATGGGCA GGTGGAGAAT GCCTGGGGGT 3 600 

AGAAATGTTA GATCTTGCAA CATCAGATCC TTGGAATAAA GAAGCCTCTC TGYGCWRAAA 3660 
AAAAAAAAAA AAAAAA 



Seq ID NO: 24 Protein sequence; 
Protein Accession #: FGENESH predicted 



1 11 21 31 41 51 

I I I I I I 

MGSPAAPEGA LGYVREFTRH SSDVLGNLNE LRLRGILTDV TLLVGGQPLR AHKAVLIACS 60 

GFFYSIFRGR AGVGVDVLSL PGGPEARGFA PLLDFMYTSR LRLSPATAPA VLAAATYLQM 120 

EHWQACHRF IQASYEPLGI SLRPLEAEPP TPPTAPPPGS PRRSEGHPDP PTESRSCSQG 180 

PPSPASPDPK ACNWKKYKYI VLNSQASQAG SLVGERSSGQ PCPQARLPSG DEASSSSSSS 24 0 

SSSSEEGPIP GPQSRLSPTA ATVQFKCGAP ASTPYLLTSQ AQDTSGSPSE RARPLPGSEF 300 

FSCQNCEAVA GCSSGLDSLV PGDEDKPYKC QIjCRSSFRYK GNLASHRTVH TGEKPYHCSI 360 

CGARFNRPAN LKTHSRIHSG EKPYKCETCG SRFVQVAHLR AHVLIHTGEK PYPCPTCGTR 420 
FRHLQTLKSH VRIHTGEKPY HCDPCGLHFR HKSQLRLHIiR QKHGAATNTK VHYHILGGP 



Seq ID NO: 25 Nucleotide sequence; 
Nucleic Acid Accession #: U21551 

Coding sequence: 1..1155 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

| I I I I I 

ATGGATTGCA GTAACGGATC GGCAGAGTGT ACCGGAGAAG GAGGATCAAA AGAGGTGGTG 60 
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GGGACTTTTA AGGCTAAAGA CCTAATAGTC ACACCAGCTA CCATTTTAAA GGAAAAACCA 120 

GACCCCAATA ATCTGGTTTT TGGAACTGTG TTCACGGATC ATATGCTGAC GGTGGAGTGG 180 

TCCTCAGAGT TTGGATGGGA GAAACCTCAT ATCAAGCCTC TTCAGAACCT GTCATTGCAC 240 

CCTGGCTCAT CAGCTTTGCA CTATGCAGTG GAATTATTTG AAGGATTGAA GGCATTTCGA 300 

GGAGTAGATA ATAAAATTCG ACTGTTTCAG CCAAACCTCA ACATGGATAG AATGTATCGC 360 

TCTGCTGTGA GGGCAACTCT GCCGGTATTT GACAAAGAAG AGCTCTTAGA GTGTATTCAA 420 

CAGCTTGTGA AATTGGATCA AGAATGGGTC CCATATTCAA CATCTGCTAG TCTGTATATT 4 80 

CGTCCTGCAT TCATTGGAAC TGAGCCTTCT CTTGGAGTCA AGAAGCCTAC CAAAGCCCTG 540 

CTCTTTGTAC TCTTGAGCCC AGTGGGACCT TATTTTTCAA GTGGAACCTT TAATCCAGTG 600 

TCCCTGTGGG CCAATCCCAA GTATGTAAGA GCCTGGAAAG GTGGAACTGG GGACTGCAAG 660 

ATGGGAGGGA ATTACGGCTC ATCTCTTTTT GCCCAATGTG AAGACGTAGA TAATGGGTGT 720 

CAGCAGGTCC TGTGGCTCTA TGGCAGAGAC CATCAGATCA CTGAAGTGGG AACTATGAAT 780 

CTTTTTCTTT ACTGGATAAA TGAAGATGGA GAAGAAGAAC TGGCAACTCC TCCACTAGAT 840 

GGCATCATTC TTCCAGGAGT GACAAGGCGG TGCATTCTGG ACCTGGCACA TCAGTGGGGT 900 

GAATTTAAGG TGTCAGAGAG ATACCTCACC ATGGATGACT TGACAACAGC CCTGGAGGGG 960 

AACAGAGTGA GAGAGATGTT TAGCTCTGGT ACAGCCTGTG TTGTTTGCCC AGTTTCTGAT 1020 

ATACTGTACA AAGGCGAGAC AATACACATT CCAACTATGG AGAATGGTCC TAAGCTGGCA 1080 

AGCCGCATCT TGAGCAAATT AACTGATATC CAGTATGGAA GAGAAGAGAG CGACTGGACA 1140 
ATTGTGCTAT C CTGA 



Seq ID NO: 26 Protein sequence: 
Protein Accession #: AAB08528 

1 11 21 31 41 51 

I I I I I I 

MDCSNGSAEC TGEGGSKEW GTFKAKDLIV TPATILKEKP DPNNLVFGTV FTDHMLTVEW 60 
SSEFGWEKPH IKPLQNLSLH PGSSALHYAV ELFEGLKAFR GVDNKIRLFQ PNLNMDRMYR 120 
SAVRATLPVF DKEELLECIQ QLVKLDQEWV PYSTSASLYI RPAFIGTEPS LGVKKPTKAL 180 
LFVLLSPVGP YFSSGTFNPV SLWANPKYVR AWKGGTGDCK MGGNYGSSLF AQCEDVDNGC 240 
QQVLWLYGRD HQITEVGTMN LFLYWINEDG EEELATPPLD GIILPGVTRR CILDLAHQWG 300 
EFKVSERYLT MDDLTTALEG NRVREMFSSG TACWCPVSD ILYKGETIHI PTMENGPKLA 360 
SRILSKLTDI QYGREESDWT IVLS 



Seq ID NO: 27 Nucleotide sequence: 
Nucleic Acid Accession #: XM_039209 

Coding sequence: 656.. 2758 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

TCGCGCGGGG GCCGCCCCCT CCCCTTCCCT CCACCCTGGG CGGGGGCGCG CGAGAAGCGG 60 

TGACGTCAAG GGGCGCGCTG TGGCAGCACC TCCCCGCGCG CTAGTTAAAA AGAAGAAGAA 120 

AAGAGGGAAC GAAACATGAG AGGCTGTGTG AGAAGCTGCA GCCGCCGGCA GAGGAGACCT 180 

CAGCATCATC TAGAGCCCAG CGCTGGCCCT GCCTCCGCCT GCCCCGCCGC CGCCGTCGCC 24 0 

GTTTCTGTTC CTGCTACTGT CCCACCTAAA CAACTCCCGT TACACGGACA AGTGAACATC 300 

TGTGGCTGTC CTCTCCTTTT CTTCCTCCTC TTCCAACTCC TTCTCCTCCT CCCACTTCCC 360 

AGCCGCAGCA GAAAGCCCCC AACCCAACTG ACACTGGCAC AACTGCAAAC GGTGTCATCC 420 

GCACAACTTT ATCTCGCTCC TCGGGCTCCC CTAAGGCATT GGACCCATCG CCGCGTCTTT 4 80 

TATTTTTTGC AAAGTTGCAT CGCTGTACAT ATTTTTGTCC CCGCCACCTC CCTCTGTCTC 540 

TGGAGTGCCC TACAGCCCCG CAAACTCCTC CTGGAGCTGC GCCCTAGTGC CCCTGCTGGG 600 

CAGTGGCGTT CCCCCCCATC CTCCCGCGCC CAGCCCCTGC TGCTCTGGGC AG ACG ATG CT 660 

GAAGATGCTC TCCTTTAAGC TGCTGCTGCT GGCCGTGGCT CTGGGCTTCT TTGAAGGAGA 720 

TGCTAAGTTT GGGGAAAGAA ACGAAGGGAG CGGAGCAAGG AGGAGAAGGT GCCTGAATGG 780 

GAACCCCCCG AAGCGCCTGA AAAGGAGAGA CAGGAGGATG ATGTCCCAGC TGGAGCTGCT 840 

GAGTGGGGGA GAGATGCTGT GCGGTGGCTT CTACCCTCGG CTGTCCTGCT GCCTGCGGAG 900 

TGACAGCCCG GGGCTAGGGC GCCTGGAGAA TAAGATATTT TCTGTTACCA ACAACACAGA 960 

ATGTGGGAAG TTACTGGAGG AAATCAAATG TGCACTTTGC TCTCCACATT CTCAAAGCCT 1020 

GTTCCACTCA CCTGAGAGAG AAGTCTTGGA AAGAGACCTA GTACTTCCTC TGCTCTGCAA 1080 

AGACTATTGC AAAGAATTCT TTTACACTTG CCGAGGCCAT ATTCCAGGTT TCCTTCAAAC 114 0 

AACTGCGGAT GAGTTTTGCT TTTACTATGC AAGAAAAGAT GGTGGGTTGT GCTTTCCAGA 1200 

TTTTCCAAGA AAACAAGTCA GAGGACCAGC ATCTAACTAC TTGGACCAGA TGGAAGAATA 1260 

TGACAAAGTG GAAGAGATCA GCAGAAAGCA CAAACACAAC TGCTTCTGTA TTCAGGAGGT 1320 

TGTGAGTGGG CTGCGGCAGC CCGTTGGTGC CCTGCATAGT GGGGATGGCT CGCAACGTCT 13 80 

CTTCATTCTG GAAAAAGAAG GTTATGTGAA GATACTTACC CCTGAAGGAG AAATTTTCAA 144 0 

GGAGCCTTAT TTGGACATTC ACAAACTTGT TCAAAGTGGA ATAAAGGGAG GAGATGAAAG 1500 

AGGACTGCTA AGCCTCGCAT TCCATCCCAA TTACAAGAAA AATGGAAAGT TGTATGTGTC 1560 

CTATACCACC AACCAAGAAC GGTGGGCTAT CGGGCCTCAT GACCACATTC TTAGGGTTGT 1620 

GGAATACACA GTATCCAGAA AAAATCCACA CCAAGTTGAT TTGAGAACAG CCAGAGTCTT 1680 

TCTTGAAGTT GCAGAACTCC ACAGAAAGCA TCTGGGAGGA CAACTGCTCT TTGGCCCTGA 1740 

CGGCTTTTTG TACATCATTC TTGGTGATGG GATGATTACA CTGGATGATA TGGAAGAAAT 1800 

GGATGGGTTA AGTGATTTCA CAGGCTCAGT GCTACGGCTG GATGTGGACA CAGACATGTG 1860 

CAACGTGCCT TATTCCATAC CAAGGAGCAA CCCACACTTC AACAGCACCA ACCAGCCCCC 1920 

CGAAGTGTTT GCTCATGGGC TCCACGATCC AGGCAGATGT GCTGTGGATA GACATCCCAC 1980 

TGATATAAAC ATCAATTTAA CGATACTGTG TTCAGACTCC AATGGAAAAA AC AG AT CATC 204 0 
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AGCCAGAATT CTACAGATAA TAAAGGGGAA AGATTATGAA AGTGAGCCAT CACTTTTAGA 2100 

ATTCAAGCCA TTCAGTAATG GTCCTTTGGT TGGTGGATTT GTATACCGGG GCTGCCAGTC 2160 

AGAAAGATTG TATGGAAGCT ACGTGTTTGG AGATCGTAAT GGGAATTTCC TAACTCTCCA 2220 

GCAAAGTCCT GTGACAAAGC AGTGGCAAGA AAAACCACTC TGTCTCGGCA CTAGTGGGTC 2280 

CTGTAGAGGC TACTTTTCCG GTCACATCTT GGGATTTGGA GAAGATGAAC TAGGTGAAGT 2340 

TTACATTTTA TCAAGCAGTA AAAGTATGAC CCAGACTCAC AATGGAAAAC TCTACAAAAT 2400 

TGTAGATCCC AAAAGACCTT TAATGCCTGA GGAATGCAGA GCCACGGTAC AACCTGCACA 2460 

GACACTGACT TCAGAGTGCT CCAGGCTCTG TCGAAACGGC TACTGCACCC CCACGGGAAA 2520 

GTGCTGCTGC AGTCCAGGCT GGGAGGGGGA CTTCTGCAGA ACTGCAAAAT GTGAGCCAGC 2580 

ATGTCGTCAT GGAGGTGTCT GTGTTAGACC GAACAAGTGC CTCTGTAAAA AAGGATATCT 2640 

TGGTCCTCAA TGTGAACAAG TGGACAGAAA CATCCGCAGA GTGACCAGGG CAGGTATTCT 2700 

TGATCAGATC ATTGACATGA CATCTTACTT GCTGGATCTA ACAAGTTACA TTGTATAGTT 2760 

TCTGGGACTG TTTGAATATT CTATTCCAAT GGGCATTTAT TTTTTATCCT GTCATTAAAA 2820 

AAAAAAGACT GTTATCCTGC TACACACTCC TGTGATTTCA TTCTCTTTTA TTAATTTAAA 2880 

AATAATTTCC AGAAATGTGC AGATCCTCTG TGTGTATGTC AGCATGTTTG TTCACATATG 2940 

CACATACACA TACTCATAAC CCCTATATGC GTTGTTGCAT AACAGATGAT TTTTTAAAAT 3000 

ATATACTTCC TTATGCAAAG TAATTTACAC AGAAATTCCA TTGTAAATTG ATAATGGATT 3060 

TTTTATGTTA CTAGAAGAGA TTATTTGACT TCCCAGGAAT TTTCTGTCTG TAATCACTAA 3120 

AGTCAACTTT AATAGAGTTT TGAAACAGTA CTGTGCAATC CGATGGATCT AATTAAAAAA 3180 

AAGGCAATAT TTTTATATTA AAGTACTATA CTAGGAGAGA ATGTTTCAGA ACTCCCTGAT 3240 

GAATTTCTAA GTGAGCAACT TGATATAAAA TTGTAATCTT CATTTTTGTC AGTGTATCCA 3300 

GTTACAGAAT GCTACACACT TACCTTTTTA TTGGCTGAGA AATCTGGTTA TTTCATCTTA 3360 

ATCTCAAGAT TGTTTTCAAG TGTTTTATAA TTAAATCATA ATAG CATATT TTAAAATCAA 3420 

TCTTCCTAAA AGGTCTGCTT TTATTGTATA TTTTATTTAA CAATAGGCAC TGGGTTTGTG 3480 
TTACATATTT ATATATTTTA TTTTATTTTT ATAATATAGA CATCACCTAG 

Seq ID NO: 28 Protein sequence: 
Protein Accession #: XP 039209 



1 11 21 31 41 51 

I I I I I I 

MLKMLSFKLL LLAVALGFFB GDAKFGERNE GSGARRRRCL NGNPPKRLKR RDRRMMSQLE 60 

LLSGGEMLCG GFYPRLSCCL RSDSPGLGRL ENKIFSVTNN TECGKLLEEI KCALCSPHSQ 120 

SIiFHSPEREV LERDLVLPLL CKDYCKEFFY TCRGHIPGFL QTTADEFCFY YARKDGGLCF 180 

PDFPRKQVRG PASNYLDQME EYDKVEEISR KHKHNCFCIQ EWSGLRQPV GALHSGDGSQ 240 

RLFILEKEGY VKILTPEGEI FKEPYLDIHK LVQSGIKGGD ERGLLSLAFH PNYKKNGKLY 300 

VSYTTNQERW AIGPHDHILR WEYTVSRKN PHQVDLRTAR VFLEVAELHR KHLGGQLLFG 360 

PDGFLYIILG DGMITLDDME EMDGLSDFTG SVLRLDVDTD MCNVPYSIPR SNPHFNSTNQ 420 

PPEVFAHGLH DPGRCAVDRH PTDININLTI LCSDSNGKNR SSARILQIIK GKDYESEPSL 4 80 

LEFKPFSNGP LVGGFVYRGC QSERLYGSYV FGDRNGNFLT LQQSPVTKQW QEKPLCLGTS 540 

GSCRGYFSGH ILGFGEDELG EVYILSSSKS MTQTHNGKLY KIVDPKRPLM PEECRATVQP 600 

AQTLTSECSR LCRNGYCTPT GKCCCSPGWE GDFCRTAKCE PACRHGGVCV RPNKCLCKKG 660 
YLGPQCEQVD RNIRRVTRAG ILDQIIDMTS YLLDLTSYIV 



Seq ID NO: 29 Nucleotide sequence: 
Nucleic Acid Accession #: NM_024756 

Coding sequence: 75.. 2924 (underlined sequences correspond to start and 

1 11 21 31 41 51 

I I I I 1 1 

AAGACAACGT -CACTAGCAGT TTCTGGAGCT ACTTGCCAAG GCTGAGTGTG AGCTGAGCCT 60 

GCCCCACCAC CAAGATGATC CTGAGCTTGC TGTTCAGCCT TGGGGGCCCC CTGGGCTGGG 120 

GGCTGCTGGG GGCATGGGCC CAGGCTTCCA GTACTAGCCT CTCTGATCTG CAGAGCTCCA 180 

GGACACCTGG GGTCTGGAAG GCAGAGGCTG AGGACACCAG CAAGGACCCC GTTGGACGTA 240 

ACTGGTGCCC CTACCCAATG TCCAAGCTGG TCACCTTACT AGCTCTTTGC AAAACAGAGA 300 

AATTCCTCAT CCACTCGCAG CAGCCGTGTC CGCAGGGAGC TCCAGACTGC CAGAAAGTCA 36 0 

AAGTCATGTA CCGCATGGCC CACAAGCCAG TGTACCAGGT CAAG CAGAAG GTGCTGACCT 420 

CTTTGGCCTG GAGGTGCTGC CCTGGCTACA CGGGCCCCAA CTGCGAGCAC CACGATTCCA 48 0 

TGGCAATCCC TGAGCCTGCA GATCCTGGTG ACAGCCACCA GGAACCTCAG GATGGACCAG 540 

TCAGCTTCAA ACCTGGCCAC CTTGCTGCAG TGATCAATGA GGTTGAGGTG CAACAGGAAC 600 

AG CAGG AAC A TCTGCTGGGA GATCTCCAGA ATGATGTGCA CCGGGTGGCA GACAGCCTGC 660 

CAGGCCTGTG GAAAGCCCTG CCTGGTAACC TCACAGCTGC AGTGATGGAA GCAAATCAAA 720 

CAGGGCACGA GTTCCCTGAT AGATCCTTGG AGCAGGTGCT GCTACCCCAC GTGGACACCT 78 0 

TCCTACAAGT GCATTTCAGC CCCATCTGGA GGAGCTTTAA CCAAAGCCTG CACAGCCTTA 840 

CCCAGGCCAT AAGAAACCTG TCTCTTGACG TGGAGGCCAA CCGCCAGGCC ATCTCCAGAG 900 

TC CAGG AC AG TGCCGTGGCC AGGGCTGACT TCCAGGAGCT TGGTGCCAAA TTTGAGGCCA 960 

AGGTCCAGGA GAACACTCAG AGAGTGGGTC AGCTGCGACA GGACGTGGAG GACCGCCTGC 1020 

ACGCCCAGCA CTTTACCCTG CACCGCTCGA TCTCAGAGCT CCAAGCCGAT GTGGACACCA 1080 

AATTGAAGAG GCTGCACAAG GCTCAGGAGG CCCCAGGGAC CAATGGCAGT CTGGTGTTGG 1140 

CAACGCCTGG GGCTGGGGCA AGGCCTGAGC CGGACAGCCT GCAGGCCAGG CTGGGCCAGC 1200 

TGCAGAGGAA CCTCTCAGAG CTGCACATGA CCACGGCCCG CAGGGAGGAG GAGTTGCAGT 1260 

ACACCCTGGA GGACATGAGG GCCACCCTGA CCCGGCACGT GGATGAGATC AAGGAACTGT 1320 

ACTCCGAATC GGACGAGACT TTCGATCAGA TTAGCAAGGT GGAGCGGCAG GTGGAGGAGC 1380 

TGCAGGTGAA CCACACGGCG CTCCGTGAGC TGCGCGTGAT CCTGATGGAG AAGTCTCTGA 1440 
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TCATGGAGGA GAACAAGGAG GAGGTGGAGC GGCAGCTCCT GGAGCTCAAC CTCACGCTGC 1500 

AGCACCTGCA GGGTGGCCAT GCCGACCTCA TCAAGTACGT GAAGGACTGC AATTGCCAGA 1560 

AGCTCTATTT AGACCTGGAC GTCATCCGGG AGGGCCAGAG GGACGCCACG CGTGCCCTGG 1620 

AGGAGACCCA GGTGAGCCTG GACGAGCGGC GGCAGCTGGA CGGCTCCTCC CTGCAGGCCC 1680 

TGCAGAACGC CGTGGACGCC GTGTCGCTGG CCGTGGACGC GCACAAAGCG GAGGGCGAGC 1740 

GGGCGCGGGC GGCCACGTCG CGGCTCCGGA GCCAAGTGCA GGCGCTGGAT GACGAGGTGG 1800 

GCGCGCTGAA GGCGGCCGCG GCCGAGGCCC GCCACGAGGT GCGCCAGCTG CACAGCGCCT 1860 

TCGCCGCCCT GCTGGAGGAC GCGCTGCGGC ACGAGGCGGT GCTGGCCGCG CTCTTCGGGG 1920 

AGGAGGTGCT GGAGGAGATG TCTGAGCAGA CGCCGGGACC GCTGCCCCTG AGCTACGAGC 1980 

AGATCCGCGT GGCCCTGCAG GACGCCGCTA GCGGGCTGCA GGAGCAGGCG CTCGGCTGGG 2040 

ACGAGCTGGC CGCCCGAGTG ACGGCCCTGG AGCAGGCCTC GGAGCCCCCG CGGCCGGCAG 2100 

AGCACCTGGA GCCCAGCCAC GACGCGGGCC GCGAGGAGGC CGCCACCACC GCCCTGGCCG 2160 

GGCTGGCGCG GGAGCTCCAG AGCCTGAGCA ACGACGTCAA GAATGTCGGG CGGTGCTGCG 2220 

AGGCCGAGGC CGGGGCCGGG GCCGCCTCCC TCAACGCCTC CCTTGACGGC CTCCACAACG 2280 

CACTCTTCGC CACTCAGCGC AGCTTGGAGC AGCACCAGCG GCTCTTCCAC AGCCTCTTTG 2340 

GGAACTTCCA AGGGCTCATG GAAGCCAACG TCAGCCTGGA CCTGGGGAAG CTGCAGACCA 2400 

TGCTGAGCAG GAAAGGGAAG AAGCAGCAGA AAGACCTGGA AGCTCCCCGG AAGAGGGACA 2460 

AGAAGGAAGC GGAGCCTTTG GTGGACATAC GGGTCACAGG GCCTGTGCCA GGTGCCTTGG 2520 

GCGCGGCGCT CTGGGAGGCA GGATCCCCTG TGGCCTTCTA TGCCAGCTTT TCAGAAGGGA 2580 

CGGCTGCCCT GCAGACAGTG AAGTTCAACA CCACATACAT CAACATTGGC AGCAGCTACT 264 0 

TCCCTGAACA TGGCTACTTC CGAGCCCCTG AGCGTGGTGT CTACCTGTTT GCAGTGAGCG 2700 

TTGAATTTGG CCCAGGGCCA GGCACCGGGC AGCTGGTGTT TGGAGGTCAC CATCGGACTC 2760 

CAGTCTGTAC CACTGGGCAG GGGAGTGGAA GCACAGCAAC GGTCTTTGCC ATGGCTGAGC 282 0 

TGCAGAAGGG TGAG CGAGTA TGGTTTGAGT TAACCCAGGG ATCAATAACA AAGAGAAGCC 2880 

TGTCGGGCAC TGCATTTGGG GGCTTCCTGA TGTTTAAGAC C TGAA CCCCA GCCCCAATCT 2940 

GATCAGACAT CATGGACTCG CCCAGCTCTC CTCGGCCTGG GGCTCTGGCC AAGGATGGGC 3000 

TGGAGGTCAT TCAGTTGGTC TGTCTCTTCC CTGGAAACCT TCTGCAAAGA TGGTGTGGTG 3060 

TACGTGGCTT CCCTGTAACC ACATGGGGCT TGGCCATTTC TCCATGATGA GAAGGACTGG 3120 

AATGCTTCTC CGGGCAGGAC ATGGTCCTAG GAAGCCTGAA CCTTGGCTTG GCATGCCTTC 3180 

TCAGACAGCA CGGCCTGGGC TCCAACTCTT CACCACACCC TGTATTCTAC AACTTCTTTG 3240 

GTGTTTTGCT CCTCCTGTGG TTGGAAACTT CTGTACAACA CTTTAAACTT TTCTCTTGCT 3300 

TCCTCTTCTC TTCTCCCTTA TCGTATGATA GAAAGACATT CTTCCCCAGG AGGAATGTTT 3360 

AAAATGGAGG CAACATTTTG GCCAACATTG GAAAGCACTA GAGGGCAATG GGATTAAACC 3420 

AACCTGCTTG GTCTCTATTA GTCAGTAATG AAGACGACAG CCTGGCCAAC CAAGGGAAAG 3480 

GAAATTAGTA TCTTTAGTTT CAGTCATTCC TTGTAGGATA TGGTTTAGCT GTGCCCCCAC 3540 

CTAAAATATC ATCTTGAATT GTAATCCCTA TAATCCCCAC ATCAAGGGAG AGATCAGGTG 3600 

GAGGTAATTG GATCTTGGGG GCGGTTCCCC CATGCTGTTC TTGTGATAGT TCTCACGAGA 3660 

TCTGATGATT TTATAAGTTT GATAGTTCCT CCTGTGTTCA TTCTCCTTCC TGCCACCTTG 3720 

TGAAGATGCC TTGGTTCCTC TTCACTGTCT GCCATGATTG TAAGTTTCCT GAGGCCTCCC 378 0 
CAGCCATGTG GAACAGTGAG TCAATTAAAC CTCTTTCCTT TATAAATT 



Seq ID NO: 3 0 Protein sequence: 
Protein Accession #: NP_079032 

1 11 21 31 41 51 

I I I I I I 

MILSLLFSLG GPLGWGLLGA WAQASSTSLS DLQSSRTPGV WKAEAEDTSK DPVGRNWCPY 60 

PMSKLVTLLA LCKTEKFLIH SQQPCPQGAP DCQKVKVMYR MAHKPVYQVK QKVLTSLAWR 120 

CCPGYTGPNC EHHDSMAIPE PADPGDSHQE PQDGPVSFKP GHLAAVINEV EVQQEQQEHL 180 

LGDLQNDVHR VADSLPGLWK ALPGNLTAAV MEANQTGHEF PDRSLEQVLL PHVDTFLQVH 240 

FSPIWRSFNQ SLHSLTQAIR NLSLDVEANR QAISRVQDSA VARADFQELG AKFEAKVQEN 300 

TQRVGQLRQD VEDRLHAQHF TLHRSISELQ ADVDTKLKRL HKAQEAPGTN GSLVLATPGA 360 

GARPEPDSLQ ARLGQLQRNL SELHMTTARR EEELQYTLED MRATLTRHVD EIKELYSESD 420 

ETFDQISKVE RQVEELQVNH TALRELRVIL MEKSLIMEEN KEEVERQLLE LNLTLQHLQG 480 

GHADLIKYVK DCNCQKLYLD LDVIREGQRD ATRALEETQV SLDERRQLDG SSLQALQNAV 54 0 

DAVSLAVDAH KAEGERARAA TSRLRSQVQA LDDEVGALKA AAAEARHEVR QLHSAFAALL 600 

EDALRHEAVL AALFGEEVLE EMSEQTPGPL PLSYEQIRVA LQDAASGLQE QALGWDELAA 660 

RVTALEQASE PPRPAEHLEP SHDAGREEAA TTALAGLARE LQSLSNDVKN VGRCCEAEAG 720 

AGAASLNASL DGLHNALFAT QRSLEQHQRL FHSLFGNFQG LMEANVSLDL GKLQTMLSRK 78 0 

GKKQQKDLEA PRKRDKKEAE PLVDIRVTGP VPGALGAALW EAGSPVAFYA SFSEGTAALQ 840 

TVKFNTTYIN IGSSYFPEHG YFRAPERGVY LFAVSVEFGP GPGTGQLVFG GHHRTPVCTT 900 
GQGSGSTATV FAMAELQKGE RVWFELTQGS ITKRSIiSGTA FGGFLMFKT 

Seq ID NO: 31 Nucleotide sequence: 
Nucleic Acid Accession #: AB037715 

Coding sequence: 370.. 3489 (underlined sequences correspond to start and 



1 11 21 31 41 51 

I I I I I I 

GAACGCTCAC AGAACAGGCA GTGCAATTCC ATGTTCCTCT TAAGTATGTT AGCCCTACCG 60 

GGAGCTGAGC TGGCCAGTCT ACTTGGAGAG GAAAAGTAGA TCTGGGGAAG GTGGAAGGGT 120 

CAGTTCCTAA GTGACTTCCT CCTCGGGGAT GGTAAGGGCA TTTGCTGATC TCCAGTGACT 180 

GCCTGGTGCC TCATGGTCAG ACTCGGCTGT CTCACTCCCA GATATCTGAT TTTGCAAAAA 240 

GGGACACACC TATCTGCAGC AAAGAAGACA CTGACCAGAT TGCGAGCGGT GCTTTTGGAT 300 
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GCTCTGTAGC CACCCGGGGC CCAGGAGGAC 
TCGGAGACCA TGGCAGTGCA GCTGGTGCCC 
GAGGGCCGCC GATGTCAAGT ACATCTTCTT 
CCCAAGCTGT TGGCCAAGGA GCTTCTTGAC 
5 AAGGAGTACT TTGGAATAGC ATTCACAGAT 
GATCGAAGAG TATTGGAACA TGACTTCCCT 
TGTGTCAGGT TCTATATAGA AAGCATTTCA 
TTCTTTCTGA ACGCGAAGTC CTGCATCTAC 
GTGTTTGAAT TAGCTTCCTA TATTTTACAG 

10 GTTGTGAGGA GTGACTTGAA GAAGCTGCCA 
CCTTCCCTGG CCTACTGTGA AGACAGAGTC 
ACAAGAGGTC AAGCAATCGT AAACTACATG 
GTTCACTATT ATGCAGTGAA GGACAAGCAG 
AAAGGGATCT TCCAGTATGA CTACCATGAT 

15 AGACAGTTGG AAAACCTGTA CTTCAGAGAA 
CGCAGGGCTT CAGTGACAAG GAGGACGTTT 
TATGCATGTC CGGCATTGAT CAAGTCCATC 
TATCTGGACA GAAAGCAGAG TAAGTCCAAA 
GCCATCGACC TGACCGAGAC GGGGACGCTG 

20 AAGGGGAAGA TCATCAGCGG CAGCAGCGGC 
GATAGCTCGC AGTCGGCCAA GAAGGACATG 
CTGGAGGAAA CCCTGCGTCA GAGGCTGGAG 
GAGCTCACGG GCAAGCTGCC AGTAGAATAT 
GTTCGGAGAA GAATAGGAAC AGCCTTCAAA 

25 GAGGAAGCTG AGCTGGAACG CCTGGAACGA 
GCCGCCCGCC GCCTAGCCAG TGACCCCAAC 
ACCTCGT AT C TGAATGCACT GAAGAAACTG 
CGCATCAAGT CTGGGAAGAA ACCCACCCAG 
ATTGCCAGTG AAGACAGCTC CCTCTCAGAT 

30 GTTACCAGCA CAATATCCCC CCTACATTCT 
TCGCACAACA GGCCTCCTCC TCCCCAGTCC 
CGCAACGACT ATGACAAGTC ACCCATCAAG 
GAACCCTATG AGAAGGTCAA GAAGCGCTCC 
TTCCCCAGCA CAGGAAGCTG TGCGGAAGCC 

35 CCCATCCGCG GCCTCCCGCA CTGGAACTCC 
CGGGTCCGGA GTCCCCACTA CGTCCATTCC 
CTGCACAGCC TCGCACTGCA CTTTAGGCAC 
CTCCTGGGCT CGGAAAACGA CACCGGGAGC 
AGCAACGGCT CAGACCCCAT GGACGACTGC 

40 CACTACTACC CGGCGCAGAT GAACGCCAAC 
AAGGCGCGCC AGAGGCAGAG GCAGCGGCAG 
TCGGGCAGCA TGCCCAACCT GGCGGCGCGC 
GGCGGTGTGT ACCTGCACAG CCAGAGCCAG 
CCGCTGTACA TCGAGGGCGG CGCCACGCCC 

45 GAGTGCCACT ACAGCGTCAA GGCTCAGTTC 
CTGTTCAAGG AGAGCTGGCG CGGCGGCGGC 
CCGTCGCGAT CGCAGATCCT GCGGACTCCG 
GGCGCGGGCC GTGCCGCCGT CTCAGACGAG 
TCGCACAAGG AGCACAGCCG CCTGTCGCAC 

50 CAGTACAGCA CCTCCTCCCA GAGCACCTTC 
CAGATGTGCA AGGCCACGTC AGCTGCCTTA 
AGTGAAATTG GAGCCACCCC CCCAAGCAGC 
GAAGCAACAG AAAACTCACC CATTCTGGAT 
GATGAATAGA GGAGCTACAA TGATAGCTGT 

55 GCTGATGTCC AGTGGTACGG GCAGGAAAAA 
CCGGCCTAAT CTGACCGCCT CAACGCCATT 
TTACCCAGAC GCACCGTCAC CCTGCACCAG 
CTCCGCATTC CCTCCCCCTT GAAAACCTGA 
CACTGTGTGT CCCCTGGCGC TCTTGCCCAT 

60 • CTTGGTGGCT TCCCTCTGCC ATGACAGCCC 
GGCATCCAAT TCCTGCGGAT AAGTAGCGTT 
CAGGGTGACC CAGAAAGACG ATTCAG CTGT 
CAAGCACTTC ATGAAGAGGA GGCCTCGTGG 
GATGGGACAG CTTGTGGGGA TGGCTATGGG 

65 ACACCAGAAA TGCATCGGAG GACCACAATC 
TAAAAACATA AAAAATTAAG AGGGGCCAAG 
TTTTAAATTC TGAACTGCTA CTACACACAA 
CTCTCTCTAG CCCTCTCCCT TACTGGCCCA 
GCCCCAATGC CACGGTAAAG GCGAGGAAGT 

70 ATCCATCTGG ACACAAAGAG AGACCTGTGG 
CATGCAGGGG GTTCAGCCGA GCCCAAGACT 
AACGTAAGGT GATAATGGCC AAAAGTGGTT 
ATCCTATTTT TTTGCATAAG GTGTTTCATT 
ACATTGCGAT CCATTCAGTG TTTAACTGTC 

75 GTGACAAAAG AGCTCAGATC CGACTTCTCC 
TGCCCTTAGG TAGAAAGATT TGACTCGTGT 



TGACTCGGCA GCAGGATTCG TGCATGGGAA 360 

GACTCAGCTC TCGGCCTGCT GATGATGACG 42 0 

GATGACAGGA AGCTGGAACT CCTAGTACAG 48 0 

CTTGTGGCTT CTCACTTCAA TCTGAAGGAA 54 0 

GAAACGGGAC ACTTAAACTG GCTTCAGCTA 600 

AAAAAGTCAG GACCCGTGGT TTTATACTTT 660 

TACCTGAAGG ATAATGCTAC CATTGAGCTT 720 

AAGGAGCTTA TTGACGTTGA CAGCGAAGTG 780 

GAGGCAAAGG GAGATTTTTC TAGCAATGAA 840 

GCCCTTCCCA CCCAAGCCCT GAAGGAGCAC 900 

ATTGAGCACT ACAAGAAACT GAACGGTCAG 960 

AGCATCGTGG AGTCTCTCCC AACCTACGGG 1020 

GGCATACCAT GGTGGCTGGG CCTGAGCTAC 1080 

AAAGTGAAGC CAAGAAAGAT ATTCCAATGG 1140 

AAGAAGTTTT CCGTGGAAGT TCATGACCCA 1200 

GGGCACAGCG GCATTGCAGT GCACACGTGG 1260 

TGGGCTATGG CCATAAGCCA ACACCAGTTC 1320 

ATCCATGCAG CACGCAGCCT GAGTGAGATC 1380 

AAGACCTCGA AGCTGGCCAA CATGGGTAGC 1440 

AGCCTGCTGT CTTCAGGTTC TCAGGAATCA 1500 

CTGGCTGCCT TGAAGTCCAG GCAGGAAGCT 1560 

GAACTGAAGA AGCTGTGTCT CCGAGAAGCT 1620 

CCCCTGGATC CAGGGGAGGA ACCACCCATT 1680 

CTGGATGAAC AGAAAATCCT GCCCAAAGGA 174 0 

GAGTTTGCCA TTCAGTCCCA GATTACGGAG 1800 

GTCAGCAAAA AACTGAAGAA ACAAAGGAAA 1860 

CAGGAGATTG AAAATGCAAT CAATGAGAAC 1920 

AGGGCTTCGC TGATCATAGA CGATGGAAAC 1980 

GCCCTTGTTC TTGAGGATGA AGACTCTCAG 2040 

CCTCACAAGG GACTCCCTCC TCGGCCACCG 2100 

CTGGAGGGAC TCCGACAGAT GCACTATCAC 2160 

CCCAAAATGT GGAGTGAGTC CTCTTTAGAT 2220 

TCTCACAGCC ATTCCAGCAG CCACAAGCGC 2280 

GGCGGAGGAA GCAACTCCTT GCAGAACAGC 2340 

CAGTCCAGCA TGCCGTCCAC GCCAGACCTG 24 00 

ACGAGGTCGG TGGACATCAG CCCCACCCGA 2460 

CGGAGCTCCA GCCTGGAGTC CCAGGGCAAG 2520 

CCCGACTTCT ACACCCCGCG GACTCGTAGC 2580 

TCGTCGTGCA CCAGCCACTC GAGCTCGGAG 2640 

TACTCCACGC TGGCCGAGGA CTCGCCGTCC 2700 

CGGGCGGCGG GCGCACTGGG CTCAGCCAGC 2760 

GGGGGTGCGG GGGGCGCGGG GGGCGCGGGG 2820 

CCCAGCTCGC AGTACCGCAT CAAGGAGTAC 2880 

GTGGTGGTGC GCAGCCTGGA GAGCGACCAG 2 940 

AAGACGTCCA ACTCCTACAC GGCGGGCGGC 3000 

GGCGACGAGG GCGACACGGG CCGCCTGACG 3060 

TCGCTGGGCC GCGAGGGCGC CCACGACAAG 3120 

CTGCGCCAGT GGTACCAGCG TTCCACCGCC 3180 

ACCAGCTCCA CCTCCTCGGA CAGCGGCTCG 3240 

GTGGCGCACA GCAGGGTCAC CAGGATGCCC 33 00 

CCTCAAAGCC AGAGAAGCTC GACACCGTCA 3360 

CCCCACCACA TCCTAACCTG GCAGACTGGA 3420 

GGGTCTGAGT CTCCACCTCA CCAAAGTACT 34 80 

TTCCTGGATT CCTCCCTCTA TCCAGAACTA 3540 

GCCAAGCCCG GGACCCTCGT GTGAGCCAGC 3600 

CTGAGATCAC CTCACTGCCT CTCATTTGCC 3660 

CTTTGGCCCT CAGCACTTTT TTTCTCCTGT 3720 

CTGAGGAGAC ATTCTGGAAG GTTCCGGTCC 3780 

AGAGAGCCAG ACACCAATCC TCAATGGCAC 3 84 0 

CTAGGCCAGG AACCATCAGG GGGGCCAGCC 3900 

GGGAGAGAAC GGGAAAGGGG ACTTGGGTTA 3 960 

GTCCAGCCTG CCACCCATAC GTAGGCCAAC 4020 

CATATTCAGT TTACACCTGA AATATTCCTT 4080 

GGAAGGGGAG GTTGAGAAAG GAAGTTCTCG 4140 

AGTTCTATGC TGCCAAAGAT TAAAAATAAA 4200 

AGGAAGACAT TCTTTCTGCA AGGAAATTTC 4260 

GTGAAAGTCA ACCCTATGTA AACTGGTGTC 4320 

CTTCTCTCTC CGTAGAGAGC CTGAAAAACT 4380 

CTTGGCTGGC GTTGCTGACT CACAGTCGCC 4440 

GAGTCATAGA GGGTACTGTT AGCCCCGGTC 4500 

CAAAGCTGCT TTCCTTTCAG GATTTGTAGT 4560 

CTCTCTCATT AAACCAACCA GTAAAAGCGT 4620 

TTCGTTTTTA TGGGAAACCA AGGGAAAAGC 468 0 

GTGGCTCATT TTCTGTTCGT TAGCACTTGT 4 74 0 

TATGTGTCAC TTATTCCAAG AACCCAACTA 48 00 

GTCTACTAGC CAACAGGCAG AGCAGGGTTG 4860 
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AAAAAAATAT CAGCTCCCAA AGGGCCCATG TGTCTACATC ATCAGTTACT GTCATGCACC 4920 

ACATTTGTGT GCAGATACCA AAAGAGGAGG AAAGAAGAAA AAAATTAATG TGTGGGAGCT 4 980 

GCACGTTTAC ATGTTTTGAG CTATGCTTCA AACACAACTG GAAAGCCATC AATCTTCAAA 5040 

GGCCTCAAAA ATACTTTTAT AGTAACAAGT GCACGACTTT AGTTGGGTTA TTCAAGATGG 5100 

CACAAAAAGG TTTCCGCAGA GGTGGTATGC TGTGCTTTTG GCGCAAGTGG TGGGGGGATG 5160 

GGGGTGGGGG TGGAATTTTT TTCTCACTCT AATGACTTCC TATTGGAAAG GCATTGACAG 5220 

CCAGGGACAG GAGCCAGGGT GGGGGTAGTT TTGTGGGAAA GCAGAACTGA AGTTAGCTTA 5280 

AGCATAAAAA CAAAGAAAAA TCTTCGCTTT TCATGTATGT GGAATCCAAG AATAACCATA 5340 

GGCTCTACCA GACCAGGAGG GTAAGGATGG ACACTAAAAT GAAACAAATA CCAAGGTATT 5400 

CCTTCTGCTG CAGCCTGGAG ACCACCGAGA GTCGAGCTGG GGCACACACA CACCTGGCCG 5460 

GGACCCGGCA GGGACAAGGC GGGCCGTGGC CTCCTCCACC AAGTCTCTCT AGACAATTCA 5520 

GGGCCTGCTT TCCCCAGCTC CATGCATGGC TGGACTGGTG ATTCCAGGGT GCAGAAGGGA 5580 

TTCATATTCC CAGAACGCTT TAAGTGTACA CCTGCAGGAT AAAGAGATAC CGGTTACATT 5640 

ATTAAATGAT TCTAGGGATT CACTGGGGGA TATTTTTGTT GCTTTTACTT TCATGGTTAG 5700 

AGCTACAAAG AACAGTGATT TTTTTTTTTT CTCCCTTCCC CATTCAGAAA CATTATACAT 5760 

TGGGCCATTT TTCTTTCTCC CAAAGAAGAT TCATGGATAG TCAGACTGAA CTGTGTGCAA 5820 

CAGGAAAAGT CAAAAGGGAA AAGGCAGCTG ATGAGGTTAC ATGGTTACAT GTTCTACATC 5880 

ATGCAGAGTA GCTTGAAATC TAGTCTGGAG AAAACTGGAT CAAGATTCTA GCCCACTGGA 5940 

GTTGCAAGGA ATGAGAGGCA AAAATTCTAA AGATTTGGGT TATATTTTCA ACTTGGGGGA 6000 

CAGAGAGAAA TGGAGAGCAG GAATTACAGT TCCAACAAAC ATCATGATAG TCTGGTAGTC 6060 

AAGACAGAGA TTAAGTAAAA CAGGTTTTAC TGTTTAGCTG AGTTCAGTTA ATACAAAATG 6120 

TACATAAAAC GTTAGTCCTT TGAGACTGAC ATGATTAATG ATCAGTGTGG TGGGAAATGA 6180 

TGTAGTTATT GTACACAAGC ACTTGCAAAC TCTTTATCCC TATTTCTTTA AAACAAAATA 6240 

AGGTGAAATA CGAAGTCCTT GGTCTGATAT AAAGCCCCTA TTGGATTCTT CGGATGCGTA 6300 

AAAGAAATTG CCTGTTTCAG CCAGAAGACT GGTGAAAACA CATACATCAG ACTATGTTGT 6360 

GAGCCAGGTT GATTTTTTAT TTTATTATAT GCAGGTGAGT GTTGAAACTG TTAAAATTCC 6420 

AATTTGTTTT CATTCAGTAT TAGTTTAGTT CTAAATATAG CAAACCCCAT CCAGGTGCTA 6480 

TCAGATGACC AGTTACTGCT TAGTTAACTA GGTGTAAAGT TTTACATATA CATTAATTTC 6540 

AATAGTTTAT TACAAGTTGT GTAAAATGGA CTCTAGTTTA ATAATGGGGG AAAAAAGATT 6600 

AGGTTGCTCC TGAAACTGAC TGTAGAGCAT GTAAAATGAT TTTACTGGAT TCTGTTCAAC 6660 

TGTAATCAAT GAAAAAGATG TACGTTGTAG ACAAAGTTGC AGAATTAAAA AAAGAAATCT 6720 

GCTTTTAATT TATTCTTTTT GTATTAAGAA TTTGTATAGT ATCTTTACAT TTTGCAAAAC 6780 

AGTGTTGTCA ACACTTATTA AAGCATTTTC AAAATG 



Seq ID NO: 32 Protein sequence: 
Protein Accession #: BAA92532 



1 ll 21 31 41 51 

I I I I I I 

MAVQLVPDSA LGLLMMTEGR RCQVHLLDDR KLELLVQPKL LAKELLDLVA SHFNLKEKEY 60 

FGIAFTDETG HLNWLQLDRR VLEHDFPKKS GPWLYFCVR FYIESISYLK DNATIELFFL 120 

NAKSCIYKEL IDVDSEWFE LASYILQEAK GDFSSNEWR SDLKKLPALP TQALKEHPSL 180 

AYCEDRVIEH YKKLNGQTRG QAIVNYMSIV ESLPTYGVHY YAVKDKQGIP WWLGLSYKGI 240 

FQYDYHDKVK PRKIFQWRQL ENLYFREKKF SVEVHDPRRA SVTRRTFGHS GIAVHTWYAC 3 00 

PALIKSIWAM AISQHQFYLD RKQSKSKIHA ARSLSEIAID LTETGTLKTS KLANMGSKGK 360 

IISGSSGSLL SSGSQESDSS QSAKKDMLAA LKSRQEALEE TLRQRLEELK KLCLREAELT 420 

GKLPVEYPLD PGEEPPIVRR RIGTAFKLDE QKILPKGEEA ELERLEREFA IQSQITEAAR 4 80 

RLASDPNVSK KLKKQRKTSY LNALKKLQEI ENAINENRIK SGKKPTQRAS LIIDDGNIAS 540 

EDSSLSDALV LEDEDSQVTS TISPLHSPHK GLPPRPPSHN RPPPPQSLEG LRQMHYHRND 600 

YDKSPIKPKM WSESSLDEPY EKVKKRSSHS HSSSHKRFPS TGSCAEAGGG SNSLQNSPIR 660 

GLPHWNSQSS MPSTPDLRVR SPHYVHSTRS VDISPTRLHS IiALHFRHRSS SLESQGKLLG 720 

SENDTGSPDF YTPRTRSSNG SDPMDDCSSC TSHSSSEHYY PAQMNANYST LAEDSPSKAR 780 

QRQRQRQRAA GALGSASSGS MPNLAARGGA GGAGGAGGGV YLHSQSQPSS QYRIKEYPLY 840 

IEGGATPVW RSLESDQECH YSVKAQFKTS NSYTAGGLFK ESWRGGGGDE GDTGRLTPSR 900 

SQILRTPSLG REGAHDKGAG RAAVSDELRQ WYQRSTASHK EHSRLSHTSS TSSDSGSQYS 960 

TSSQSTFVAH SRVTRMPQMC KATSAALPQS QRSSTPSSEI GATPPSSPHH ILTWQTGEAT 1020 
ENSPILDGSE SPPHQSTDE 



Seq ID NO: 33 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014331 

Coding sequence: 1..1506 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATG GTCAGAA agcctgttgt gtccaccatc TCCAAAGGAG GTTACCTGCA GGGAAATGTT 60 

AACGGGAGGC TGCCTTCCCT GGGCAACAAG GAGCCACCTG GGCAGGAGAA AGTGCAGCTG 120 

AAGAGGAAAG TCACTTTACT GAGGGGAGTC TC CATTATC A TTGGCACCAT CATTGGAGCA 18 0 

GGAATCTTCA TCTCTCCTAA GGGCGTGCTC CAGAACACGG GCAGCGTGGG CATGTCTCTG 240 

ACCATCTGGA CGGTGTGTGG GGTCCTGTCA CTATTTGGAG CTTTGTCTTA TGCTGAATTG 300 

GGAACAACTA TAAAGAAATC TGGAGGTCAT TACACATATA TTTTGGAAGT CTTTGGTCCA 360 

TTACCAGCTT TTGTACGAGT CTGGGTGGAA CTCCTCATAA TACGCCCTGC AGCTACTGCT 420 

GTGATATCCC TGGCATTTGG ACGCTACATT CTGGAACCAT TTTTTATTCA ATGTGAAATC 480 
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CCTGAACTTG CGATCAAGCT CATTACAGCT GTGGGCATAA CTGTAGTGAT GGTCCTAAAT 540 

AGCATGAGTG TCAGCTGGAG CGCCCGGATC CAGATTTTCT TAACCTTTTG CAAGCTCACA 600 

GCAATTCTGA TAATTATAGT CCCTGGAGTT ATGCAGCTAA TTAAAGGTCA AACGCAGAAC 660 

TTTAAAGACG CGTTTTCAGG AAGAGATT C A AGTATTACGC GGTTGCCACT GGCTTTTTAT 720 

5 TATGGAATGT ATGCATATGC TGGCTGGTTT TACCTCAACT TTGTTACTGA AGAAGTAGAA 780 

AACCCTGAAA AAACCATTCC CCTTGCAATA TGTATATCCA TGGCCATTGT CACCATTGGC 840 

TATGTGCTGA CAAATGTGGC CTACTTTACG ACCATTAATG CTGAGGAGCT GCTGCTTTCA 900 

AATGCAGTGG CAGTGACCTT TTCTGAGCGG CTACTGGGAA ATTTCTCATT AGCAGTTCCG 960 

ATCTTTGTTG CCCTCTCCTG CTTTGGCTCC ATGAACGGTG GTGTGTTTGC TGTCTCCAGG 1020 

10 TTATTCTATG TTGCGTCTCG AGAGGGTCAC CTTCCAGAAA TCCTCTCCAT GATTCATGTC 1060 

CGCAAGCACA CTCCTCTACC AGCTGTTATT GTTTTGCACC CTTTGACAAT GATAATGCTC 114 0 

TTCTCTGGAG ACCTCGACAG TCTTTTGAAT TTCCTCAGTT TTGCCAGGTG GCTTTTTATT 1200 

GGGCTGGCAG TTGCTGGGCT GATTTATCTT CGATACAAAT GCCCAGATAT GCATCGTCCT 1260 

TTCAAGGTGC CACTGTTCAT CCCAGCTTTG TTTTCCTTCA CATGCCTCTT CATGGTTGCC 1320 

15 CTTTCCCTCT ATTCGGACCC ATTTAGTACA GGGATTGGCT TCGTCATCAC TCTGACTGGA 1380 

GTCCCTGCGT ATTATCTCTT TATTATATGG GACAAGAAAC CCAGGTGGTT TAGAATAATG 1440 

TCAGAGAAAA TAACCAGAAC ATTACAAATA ATACTGGAAG TTGTACCAGA AGAAGATAAG 1500 

TTATGAACTA ATGGACTTGA GAT CTTGGC A ATCTGCCCAA GGGGAGACAC AAAATAGGGA 1560 

TTTTTACTTC ATTTTCTGAA AGTCTAGAGA ATTACAACTT TGGTGATAAA CAAAAGGAGT 1620 

20 CAGTTATTTT TATTCATATA TTTTAGCATA TTCGAACTAA TTTCTAAGAA ATTTAGTTAT 1680 

AACTCTATGT AGTTATAGAA AGTGAATATG CAGTTATTCT ATGAGTCGCA CAATTCTTGA 1740 

GTCTCTGATA CCTACCTATT GGGGTTAGGA GAAAAGACTA GACAATTACT ATGTGGTCAT 1800 

TCTCTACAAC ATATGTTAGC ACGGCAAAGA ACCTTCAAAT TGAAGACTGA GATTTTTCTG 1860 

TATATATGGG TTTTGTAAAG ATGGTTTTAC ACACTACAGA TGTCTATACT GTGAAAAGTG 1920 

25 TTTTCAATTC TGAAAAAAAG CATACATCAT GATTATGGCA AAGAGGAGAG AAAGAAATTT 1980 

ATTTTACATT GACATTGCAT TGCTTCCCCT ' TAGATACCAA TTTAGATAAC AAACACTCAT 2040 

GCTTTAATGG ATTATACCCA GAGCACTTTG AACAAAGGTC AGTGGGGATT GTTGAATACA 2100 

TTAAAGAAGA GTTTCTAGGG GCTACTGTTT ATGAGACACA TCCAGGAGTT ATGTTTAAGT 2160 

AAAAATCCTT GAGAATTTAT TATGTCAGAT GTTTTTTCAT TCATTATCAG GAAGTTTTAG 2220 

30 TTATCTGTCA TTTTTTTTTT TCACATCAGT TTGATCAGGA AAGTGTATAA CACATCTTAG 2280 

AGCAAGAGTT AGTTTGGTAT TAAATCCTCA TTAGAACAAC CACCTGTTTC ACTAATAACT 2340 

TACCCCTGAT GAGTCTATCT AAACATATGC ATTTTAAGCC TTCAAATTAC ATTATCAACA 2400 

TGAGAGAAAT AACCAACAAA GAAGATGTTC AAAATAATAG TCCCATATCT GTAATCATAT 2460 

CTACATGCAA TGTTAGTAAT TCTGAAGTTT TTTAAATTTA TGGCTATTTT TACACGATGA 2520 

35 TGAATTTTGA CAGTTTGTGC ATTTTCTTTA TACATTTTAT ATTCTTCTGT TAAAATATCT 2580 

CTTCAGATGA AACTGTCCAG ATTAATTAGG AAAAGG CATA TATTAACATA AAAATTGCAA 264 0 

AAGAAATGTC GCTGTAAATA AGATTTACAA CTGATGTTTC TAGAAAATTT CCACTTCTAT 2700 

ATCTAGGCTT TGTCAGTAAT TTCCACACCT TAATTATCAT TCAACTTGCA AAAGAGACAA 2760 

CTGATAAGAA GAAAATTGAA ATGAGAATCT GTGGATAAGT GTTTGTGTTC AGAAGATGTT 2820 

40 GTTTTGCCAG TATTAGAAAA TACTGTGAGC CGGGCATGGT GGCTTACATC TGTAATCCCA 2880 

GCACTTTGGG AGGCTGAGGG GGTGGATCAC CTGAGGTCGG GAGTTCTAGA CCAGCCTGAC 2940 

CAACATGGAG AAACCCCATC TCTACTAAAA ATACAAAATT AGCTGGGCAT GGTGGCACAT 3000 

GCTGGTAATC TCAGCTATTG AGGAGGCTGA GGCAGGAGAA TTGCTTGAAC CCGGGAGGCG 3060 

GAGGTTGCAG TGAGCCAAGA TTGCACCACT GTACTCCAGC CTGGGTGACA AAGTCAGACT 3120 

45 CCATCTCCAA AAAAAAAAAA AAAA 



Seq ID NO: 34 Protein sequence: 
Protein Accession ft: NP 055146 

50 
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MVRKPWSTI SKGGYLQGNV NGRLPSLGNK EPPGQEKVQL KRKVTLLRGV SIIIGTIIGA 60 
55 GIFISPKGVL QNTGSVGMSL TIWTVCGVLS LFGALSYAEL GTTIKKSGGH YTYIIiEVFGP 120 
LPAFVRVWVE LLrl I RPAATA VISLAFGRYI LEPFFIQCEI PELAIKIjITA VGITWMVLN 180 
SMSVSWSARI QIFLTFCKLT AILIIIVPGV MQLIKGQTQN FKDAFSGRDS SITRLPLAFY 24 0 
YGMYAYAGWF YLNFVTEEVE NPEKTIPLAI CISMAIVTIG YVLTNVAYFT TINAEELLLS 300 
NAVAVTFSER LLGNFSLAVP IFVALSCFGS MNGGVFAVSR LFYVASREGH LPEILSM1HV 360 
60 RKHTPLPAVI VLHPLTMIML FSGDLDSLLN FLSFARWLFI GLAVAGLIYL RYKCPDMHRP 420 
FKVPLFIPAL FSFTCLFMVA LSLYSDPFST GIGFVITLTG VPAYYLFIIW DKKPRWFRIM 480 
SEKITRTLQI ILEWPEEDK L 



65 Seq ID NO: 35 Nucleotide sequence: 
Nucleic Acid Accession #: NM_002422 

Coding sequence: 64.. 1497 (underlined sequences correspond to start and stop codons) 



70 1 11 21 31 41 51 

I I I I I I 

ACAAGGAGGC AGGCAAGACA GCAAGGCATA GAGACAACAT AGAGCTAAGT AAAGCCAGTG 60 

GAAATGAAGA GTCTTCCAAT CCTACTGTTG CTGTGCGTGG CAGTTTGCTC AGCCTATCCA 120 

TTGGATGGAG CTGCAAGGGG TGAGGACACC AGCATGAACC TTGTTCAGAA ATATCTAGAA 180 

75 AACTACTACG ACCTCAAAAA AGATGTGAAA CAGTTTGTTA GGAGAAAGGA CAGTGGTCCT 240 

GTTGTTAAAA AAATCCGAGA AATGCAGAAG TTCCTTGGAT TGGAGGTGAC GGGGAAGCTG 300 
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GACTCCGACA CTCTGGAGGT GATGCGCAAG CCCAGGTGTG GAGTTCCTGA TGTTGGTCAC 360 

TTCAGAACCT TTCCTGGCAT CCCGAAGTGG AGGAAAACCC ACCTTACATA CAGGATTGTG 420 

AATTATACAC CAGATTTGCC AAAAGATGCT GTTGATTCTG CTGTTGAGAA AGCTCTGAAA 480 

GTCTGGGAAG AGGTGACTCC ACTCACATTC TCCAGGCTGT ATGAAGGAGA GGCTGATATA 540 

ATGATCTCTT TTGCAGTTAG AGAACATGGA GACTTTTACC CTTTTGATGG ACCTGGAAAT 600 

GTTTTGGCCC ATGCCTATGC CCCTGGGCCA GGGATTAATG GAGATGCCCA CTTTGATGAT 660 

GATGAACAAT GGACAAAGGA TACAACAGGG ACCAATTTAT TTCTCGTTGC TGCTCATGAA 720 

ATTGGCCACT CCCTGGGTCT CTTTCACTCA GCCAACACTG AAGCTTTGAT GTACCCACTC 780 

TATCACTCAC TCACAGACCT GACTCGGTTC CGCCTGTCTC AAGATGATAT AAATGGCATT 840 

CAGTCCCTCT ATGGACCTCC CCCTGACTCC CCTGAGACCC CCCTGGTACC CACGGAACCT 900 

GTCCCTCCAG AACCTGGGAC GCCAGCCAAC TGTGATCCTG CTTTGTCCTT TGATGCTGTC 960 

AGCACTCTGA GGGGAGAAAT CCTGATCTTT AAAGACAGGC ACTTTTGGCG CAAATCCCTC 1020 

AGGAAGCTTG AACCTGAATT GCATTTGATC TCTTCATTTT GGCCATCTCT TCCTTCAGGC 1080 

GTGGATGCCG CATATGAAGT TACTAGCAAG GACCTCGTTT TCATTTTTAA AGGAAATCAA 1140 

TTCTGGGCCA TCAGAGGAAA TGAGGTACGA GCTGGATACC CAAGAGGCAT CCACACCCTA 1200 

GGTTTCCCTC CAACCGTGAG GAAAATCGAT GCAGCCATTT CTGATAAGGA AAAGAACAAA 1260 

ACATATTTCT TTGTAGAGGA CAAATACTGG AGATTTGATG AGAAGAGAAA TTCCATGGAG 1320 

CCAGGCTTTC CCAAGCAAAT AGCTGAAGAC TTTCCAGGGA TTGACTCAAA GATTGATGCT 1380 

GTTTTTGAAG AATTTGGGTT CTTTTATTTC TTTACTGGAT CTTCACAGTT GGAGTTTGAC 1440 

CCAAATGCAA AGAAAGTGAC ACACACTTTG AAGAGTAACA GCTGGCTTAA TTGTTGAAAG 1500 

AGATATGTAG AAGGCACAAT ATGGGCACTT TAAATGAAGC TAATAATTCT TCACCTAAGT 1560 

CTCTGTGAAT TGAAATGTTC GTTTTCTCCT GCCTGTGCTG TGACTCGAGT CACACTCAAG 1620 

GGAACTTGAG CGTGAATCTG TATCTTGCCG GTCATTTTTA TGTTATTACA GGGCATTCAA 1680 

ATGGGCTGCT GCTTAGCTTG CACCTTGTCA CATAGAGTGA TCTTTCCCAA GAGAAGGGGA 1740 

AGCACTCGTG TGCAACAGAC AAGTGACTGT ATCTGTGTAG ACTATTTGCT TATTTAATAA 1800 
AGACGATTTG TCAGTTGTTT T 



Seq ID NO: 36 Protein sequence: 
Protein Accession #: NP 002413 



1 11 21 31 41 51 

I I I I I I 

MKSLPILLLL CVAVCSAYPL DGAARGEDTS MNLVQKYLEN YYDLKKDVKQ FVRRKDSGPV 60 

VKKIREMQKF LGLEVTGKLD SDTLEVMRKP RCGVPDVGHF RTFPGIPKWR KTHLTYRIVN 120 

YTPDLPKDAV DSAVEKALKV WEEVTPLTFS RLYEGEADIM ISFAVREHGD FYPFDGPGNV 180 

LAHAYAPGPG INGDAHFDDD EQWTKDTTGT NLFLVAAHEI GHSLGLFHSA NTEALMYPLY 240 

HSLTDLTRFR LSQDDINGIQ SLYGPPPDSP ETPLVPTEPV PPEPGTPANC DPALSFDAVS 3 00 

TLRGEILIFK DRHFWRKSLR KLEPELHLIS SFWPSLPSGV DAAYEVTSKD LVFIFKGNQF 360 

WAIRGNEVRA GYPRGIHTLG FPPTVRKIDA AISDKEKNKT YFFVEDKYVJR FDEKRNSMEP 420 
GFPKQIAEDF PGIDSKIDAV FEEFGFFYFF TGSSQLEFDP NAKKVTHTLK SNSWLNC 



Seq ID NO: 37 Nucleotide sequence : 
Nucleic Acid Accession #: NM_003246 

Coding sequence: 112.. 3624 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGACGCACAG GCATTCCCCG CGCCCCTCCA GCCCTCGCCG CCCTCGCCAC CGCTCCCGGC 60 

CGCCGCGCTC CGGTACACAC AGGATCCCTG CTGGGCACCA ACAGCTCCAC CATGGGGCTG 120 

GCCTGGGGAC TAGGCGTCCT GTTCCTGATG CATGTGTGTG GCACCAACCG CATTCCAGAG 180 

TCTGGCGGAG ACAACAGCGT GTTTGACATC TTTGAACTCA CCGGGGCCGC CCGCAAGGGG 24 0 

TCTGGGCGCC GACTGGTGAA GGGCCCCGAC CCTTCCAGCC CAGCTTTCCG CATCGAGGAT 300 

GCCAACCTGA TCCCCCCTGT GCCTGATGAC AAGTTCCAAG ACCTGGTGGA TGCTGTGCGG 3 60 

GCAGAAAAGG GTTTCCTCCT TCTGGCATCC CTGAGGCAGA TGAAGAAGAC CCGGGGCACG 42 0 

CTGCTGGCCC TGGAGCGGAA AGACCACTCT GGCCAGGTCT TCAGCGTGGT GTCCAATGGC 48 0 

AAGGCGGGCA CCCTGGACCT CAGCCTGACC GTCCAAGGAA AGCAGCACGT GGTGTCTGTG 54 0 

GAAGAAGCTC TCCTGGCAAC CGGCCAGTGG AAGAGCATCA CCCTGTTTGT GCAGGAAGAC 600 

AGGGCCCAGC TGTACATCGA CTGTGAAAAG ATGGAGAATG CTGAGTTGGA CGTCCCCATC 660 

CAAAGCGTCT TCACCAGAGA CCTGGCCAGC ATCGCCAGAC TCCGCATCGC AAAGGGGGGC 720 

GTCAATGACA ATTTCCAGGG GGTGCTGCAG AATGTGAGGT TTGTCTTTGG AACCACACCA 780 

GAAGACATCC TCAGGAACAA AGGCTGCTCC AGCTCTACCA GTGTCCTCCT CACCCTTGAC 84 0 

AACAACGTGG TGAATGGTTC CAGCCCTGCC ATCCGCACTA ACTACATTGG CCACAAGACA 900 

AAGGACTTGC AAGCCATCTG CGGCATCTCC TGTGATGAGC TGTCCAGCAT GGTCCTGGAA 960 

CTCAGGGGCC TGCGCACCAT TGTGACCACG CTGCAGGACA GCATCCGCAA AGTGACTGAA 1020 

GAGAACAAAG AGTTGGCCAA TGAGCTGAGG CGGCCTCCCC TATGCTATCA CAACGGAGTT 1080 

CAGTACAGAA ATAACGAGGA ATGGACTGTT GATAGCTGCA CTGAGTGTCA CTGTCAGAAC 1140 

TCAGTTACCA TCTGCAAAAA GGTGTCCTGC CCCATCATGC CCTGCTCCAA TGCCACAGTT 1200 

CCTGATGGAG AATGCTGTCC TCGCTGTTGG CCCAGCGACT CTGCGGACGA TGGCTGGTCT 1260 

CCATGGTCCG AGTGGACCTC CTGTTCTACG AGCTGTGGCA ATGGAATTCA GCAGCGCGGC 1320 

CGCTCCTGCG ATAGCCTCAA CAACCGATGT GAGGGCTCCT CGGTCCAGAC ACGGACCTGC 1380 

CACATTCAGG AGTGTGACAA AAGATTTAAA CAGGATGGTG GCTGGAGCCA CTGGTCCCCG 1440 

TGGTCATCTT GTTCTGTGAC ATGTGGTGAT GGTGTGATCA CAAGGATCCG GCTCTGCAAC 1500 
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TCTCCCAGCC CCCAGATGAA TGGGAAACCC TGTGAAGGCG 
TGCAAGAAAG ACGCCTGCCC CATCAATGGA GGCTGGGGTC 
TGTTCTGTCA CCTGTGGAGG AGGGGTACAG AAACGTAGTC 
CCCCAGTTTG GAGGCAAGGA CTGCGTTGGT GATGTAACAG 
5 CAGGACTGTC CAATTGATGG ATGCCTGTCC AATCCCTGCT 
AGCTACCCTG ATGGCAGCTG GAAATGTGGT GCTTGTCCCC 
ATCCAGTGCA CAGATGTTGA TGAGTGCAAA GAAGTGCCTG 
GGAGAGCACC GGTGTGAGAA CACGGACCCC GGCTACAACT 
TTCACCGGCT CACAGCCCTT CGGCCAGGGT GTCGAACATG 

10 TGCAAGCCCC GTAACCCCTG CACGGATGGG ACCCACGACT 
AACTACCTGG GCCACTATAG CGACCCCATG TACCGCTGCG 
GGCAATGGCA TCATCTGCGG GGAGGACACA GACCTGGATG 
GTGTGCGTGG CCAATGCGAC TTACCACTGC AAAAAGGATA 
TCAGGGCAGG AAGACTATGA CAAGGATGGA ATTGGTGATG 

15 AATGATAAAA TTCCAGATGA CAGGGACAAC TGTCCATTCC 
GACTATGACA GAGATGATGT GGGAGACCGC TGTGACAACT 
GATCAGGCAG ACACAGACAA CAATGGGGAA GGAGACGCCT 
GACGGTATCC TCAATGAACG GGACAACTGC CAGTACGTCT 
ACTGATATGG ATGGGGTTGG AGATCAGTGT GACAATTGCC 

20 CAGCTGGACT CTGACTCAGA CCGCATTGGA GATACCTGTG 
GAAGATGGCC ACCAGAACAA TCTGGACAAC TGTCCCTATG 
GACCATGACA AAGATGGCAA ' GGGAGATGCC TGTGACCACG 
CCTGATGACA AGGACAACTG CAGACTCGTG CCCAATCCCG 
GATGGTCGAG GTGATGCCTG CAAAGATGAT TTTGACCATG 

25 GACATCTGTC CTGAGAATGT TGACATCAGT GAGACCGATT 
CCTCTGGACC CCAAAGGGAC ATCCCAAAAT GACCCTAACT 
AAAGAACTCG TCCAGACTGT CAACTGTGAT CCTGGACTCG 
AATGCTGTGG ACTTCAGTGG CACCTTCTTC ATCAACACCG 
GGATTTGTCT TTGGCTACCA GTCCAGCAGC CGCTTTTATG 

30 ACCCAGTCCT ACTGGGACAC CAACCCCACG AGGGCTCAGG 
AAAGTTGTAA ACTCCACCAC AGGGCCTGGC GAGCACCTGC 
GGAAACACCC CTGGCCAGGT GCGCACCCTG TGGCATGACC 
GATTTCACCG CCTACAGATG GCGTCTCAGC CACAGGCCAA 
GTGATGTATG AAGGGAAGAA AATCATGGCT GACTCAGGAC 

35 GCTGGTGGTA GACTAGGGTT GTTTGTCTTC TCTCAAGAAA 
AAATACGAAT GTAGAGATCC C TAAT CATCA AATTGTTGAT 
AATGCTGGTA TTGCACCTTC TGGAACTATG GGCTTGAGAA 
CTTGGCTTCC TTCTTTTCTG TGCTTGCATC AGTGTGGACT 
TCAAGAAAAT GCAGTTTTCA AAAACAGACT CATCAGCATT 

40 ATCTTCCAAG CATATAAACA ATTGCTTTGG TTTCCTTTTG 
AGTTGGGAAG GTGCCCATTC CACTCTGCCT TTGTCACAGA 
CATCTCTGAG CAGTGGACTC AAAAGCATTT TCAGGCATGT 
AGAATTAGCA AACAAAACCA CCCTGACATC CTCCTTCAGG 
AAGCACTAAG GGGAGGGCGC ATACCCGAGA CGATTGTATG 

45 TTACATGTTC GGTACTAAGT CATTTTCAGG GGATTGAAAG 
TGCTGACTGG CGTTAGCTGA TTAACCCATG TAAATAGGCA 
GGAGACAAAG ACTGGCTTCT GGACTTCCTC CCTGATCCCC 
AGTGGCCAGA ATTAGGGAAT CAGAATCAAA CCAGTGTAAG 
CTGGTCACAT TGAAATTGGT GGCTTCATTC TAGATGTAGC 

50 AATAGGAAAA CCTACCATCT CAGTGAGCAC CAGCTGCCTC 
CTTATATTTT TATGGTTACA ATGGCACAAA ATTATTATCA 
TTCTCTTTTT TCCGTAATTA CTAGGTAGTT TTCTAATTCT 
TTTTAAAGTC TTTACGATGT AAAATATTTA TTTTTTACTT 
AGGATTATTC ATGGAACAGG AAGAAGCGTA AAGACTATCC 

55 CTTCGTGACT GTAAGATTGT AAATACAGAT TATTTATTAA 
TAGGCTTCAT ACGGAAAGTG TTTGAGAGCA AGTAGTTGAC 
CAAGAACAGC ACAAGGAAAA TCAGTCTAAT AAGCTGCTCT 
ATGTTATGGG ATTCCTTTTT TCTCTGTTTT ATCTTTTCAA 
ATTTGCAAAT GTTTTAAATT GCAAAGAAAG CCATGAGGTC 

60 TCCCTTGTGC ATATTTCCAG GGAGAAGGAA AGCATATACA 
AAAAGAGAAA AAAATGACAA AAGGTGAAAC TTACATACAA 
TGACTGAGTA AAGAATTTTT GGATCAAG CG GAAAGAGTTT 
GCTACTGTAG TACCTAAAAA GTCAGTGTTG TACATAGCAT 
TTCCCAATAA GGAAATAGCA TTGAAATGTT AAATACAATT 

65 TCTATCATCT GGTATACCAT TGCTTTATTT TTATAAATTA 
TAGAATATTC AGATTGTGTA GATATGCTAT TTAAATAATT 
AGAGTTAGTA TTTCTATTTT TATATAATGT TTGCACACTG 
TTTCTTTTTT TTGTTTTTTT r j M j"j^"j , 'p*p i p , j , Y TTTTTTTTTG 
CTATTTGCCA ATACCTTTTT CTAGGAATGT GCTTTTTTTT 

70 TACATTCTAA AGCAGTGTAA GTTGTATATT ACTGTTTCTT 
AATCATATGG AAATTTATAT TT 

Seq ID NO: 38 Protein secruence: 
Protein Accession #: NP_003237 
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7 



AAGCGCGGGA 


GACCAAAGCC 


1560 


CTTGGTCACC 


ATGGGACATC 


1620 


GTCTCTGCAA 


CAACCCCGCA 


1680 


AAAACCAGAT 


CTGCAACAAG 


1740 


TTGCCGGCGT 


GAAGTGTACT 


1800 


CTGGTTACAG 


TGGAAATGGC 


1860 


ATGCCTGCTT 


CAACCACAAT 


1920 


GCCTGCCCTG 

^a ^» ^» ^a ^a ^a ^ 


CCCCCCACGC 


1980 


CCACGGCCAA 


CAAACAGGTG 

Vji * ftft Aft A^i*« ^ 


2040 


GCAACAAGAA 

w m ft Xa* ftft ft^rf * ft* m 


CGCCAAGTGC 


2100 


AGTGCAAGCC 

«1W X ^A* NAftftftft^A' X* ^A> 


TGGCTACGCT 


2160 


GCTGGCCCAA 

\_> X V^J >^ Vrf 4 ft* ft 


TGAGAACCTG 

^^^ft ft^^ft ftft " 


2220 


ATTGCCCCAA 

■ A X X w Xa Xa Va *a * 


CCTTCCCAAC 


2280 


CCTGTGATGA 

\^ A» A ft X ^»*A ft 


TGACGATGAC 


2340 


ATTACAACCC 

* * X X «* ** ft^A 


AGCTCAGTAT 


2400 


GTCCCTACAA 


CCACAACCCA 


2460 


GTGCTGCAGA 


CATTGATGGA 


2520 


ACAATGTGGA 


CCAGAGAGAC 


2580 


CCTTGGAACA 


CAATCCGGAT 


2640 


ACAACAATCA 

• * VA ♦ ftft A 'i » • ** » X XA* * 


GGATATTGAT 


2700 


TGCCCAATGC 


CAACCAGGCT 


2760 


ATGATGACAA 


CGATGGCATT 

\ri A ^A ^A W« A 


2820 


ACCAGAAGGA 


CTCTGACGGC 


2880 


ACAGTGTGCC 

aTa^ Vaa^\J X VJ x Xa 


AGACATCGAT 

r«wi*wftft x ^» vji a a> 


2 940 


TCCGCCGATT 


CCAGATGATT 

Va> \rf4»V-»ftft X Sa/ftA A- A 


3000 


GGGTTGTACG 


CCATCAGGGT 


3060 


CTGTAGGTTA 

\a- X VJ X *l w \J X X *k 


TGATGAGTTT 

X Wla X xxx 


3120 


AAAGGGACGA 


TGACTATGCT 

X \J J» v— X A* x VJ X 


3180 

X* W V 


TTGTGATGTG 


GAAGCAAGTC 

^Jft^Lftt >to> Aft 1\J X V 


3240 


GATACTCGGG 

VJ.fi. X «*V* X ^V3\J\J 


CCTTTCTGTG 

V-\-t* XXX Va X X v 


3300 


GGAACGCCCT 


GTGGCACACA 

V^J X W\JSM>V ft^^ft ft 


3360 


CTCGTCACAT 

Nv X Sv\J X Va«xVa** X 


AGGCTGGAAA 

**^J Sa X NJ \J ft ftft ftft ft 


3420 


AGACGGGTTT 


CATTAGAGTG 


3480 


CCATCTATGA 

V^ %aJ* X V^ X AA X V-* ^ * 


TAAAACCTAT 

X ft7^4~ftA^ft^ X^ X* X ft ft X 


3540 


TGGTGTT CTT 

X VJVJ X W X X X* x x 


CTCTGACCTG 

Va x ^a x wr*v x* x w 


3600 


TGAAAGACTG 


ATCATAAACC 

«• X %A*xXftTftT^*N^X* 


3660 

—J v w w 


AACCCCCAGG 


ATCACTTCTC 

ft* X \^Aft%A X X w X %a- 


3720 


CCTAGAACGT 


GCGACCTGCC 


3780 


CAGCCTCCAA 


TGAATAAGAC 

X NsTftAft^ X ft maw# 


3840 


AAAAAGCATC 


TACTTGCTTC 

X AftNA" X X W X X >-A 


3900 


GCAGGGTGCT 


ATTGTGAGGC 

ftft X X X Wi V 


3960 


CAGAGAAGGG 


AGGACTCACT 


4020 


AACACGGGGA 


GCAGAGGCCA 


4080 


AAGAAAATAT 


GGAGGAACTG 

^#^ffl ft^A^Aft AA A X^ * ^A 


4140 


ACTATTGCTG 


G ATTT CATG A 


4200 


CTTAAATAGA 


AGCAGGAAAG 


4260 


ACCCTTACTC 


ATCACCTTGC 


4320 


GCAGTGCTGG 


CTGCCATTGC 


4380 


TTGTGCAGAT 


GTAGCAGGAA 

\J X • »V XA* ft^A ftft ft 


4440 


CCAAAGGAGG 

ft ftft ftft ft M ft>«^ 


GGCAGCCGTG 

A 1 hA ^A ^ * * "SA 


4500 


ACCTAACTAA 


AACATTCCTT 

ft ft4 ft^^^ft ft ^» X X^ X ^fe 


4560 


CTCTTTTGGA 


AGTATGATTT 


4620 


ATTCTGG^AG 

X X X«« X ^^ftUft^J 


ATCTGGCTGA 

«ft X «^ X W X w * « 


4680 

x w V V 


ATGTCATCTT 

A A ^ X A ft X X X 


TGTTGAGAGT 

X Va X X X 


4740 


CTCTGTTCTG 


CCTGGAAATT 

^1 ^_|^ A* _ . ft Ai ftft ft 


4800 


ATTTATCAGC 

ft ft X X X ft ft X \pm ft^al ftjtf 


AAATCTCTTG 

AAA AA A X W X ^A X X >-A 


4860 

A V/ \J \J 


GCCCCTTGTG 

W X^ X^ X X W X VJ 


CTCAGAGTGG 


4920 


GTGGAATTAG 

>J X X X ■* ft V_l 


TTGGTTATC C 

X X \J v X A ft ft X X* 


4980 


TTCAATACTG 


TTTTACCCCA 

XXX X ft ft ^a Va « * ft 


5040 


11 1 1 lv,l X 


1 1 1111 




ATATTACCTC 


ATTTGTTGTG 


5160 


AAGTGTCTAA 


CAAACTTAAA 


5220 


AAAAACT CTG 


CAGAGAAGTA 


5280 


TCTGAAAGTT 


ATGTTTTTTT 


5340 


TTTTCTCATT 


GCCATTGGAA 


5400 


TATCAGGAAA 


TACTGCCTGT 


5460 


AATTGAAGAA 


TTGTTGGTTT 


5520 


CTTTTGACCT 


CCCATTTTTA 


5580 


GTACACATTT 


TTATCCATTT 


5640 


ATGTACAAGG 


AACAACAATA 


5700 
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11 21 31 41 51 

I I I I I 

MGLAWGLGVL FLMHVCGTNR IPESGGDNSV FDIFELTGAA RKGSGRRLVK GPDPSSPAPR 60 

IEDANLIPPV PDDKFQDLVD AVRAEKGFLL LASLRQMKKT RGTLLALERK DHSGQVFSW 120 

5 SNGKAGTLDL SLTVQGKQHV VSVEEALLAT GQWKSITLFV QEDRAQLYID CEKMENAELD 180 

VPIQSVFTRD LASIARLRIA KGGVNDNFQG VLQNVRFVFG TTPEDILRNK GCSSSTSVLL 240 

TLDNNWNGS SPAIRTNYIG HKTKDLQAIC GISCDELSSM VLELRGLRTI VTTLQDSIRK 300 

VTEENKELAN ELRRPPLCYH NGVQYRNNEE WTVDSCTECH CQNSVTICKK VSCPIMPCSN 360 

ATVPDGECCP RCWPSDSADD GWSPWSEWTS CSTSCGNGIQ QRGRSCDSLN NRCEGSSVQT 420 

10 RTCHIQECDK RFKQDGGWSH WSPWSSCSVT CGDGVITRIR LCNSPSPQMN GKPCEGEARE 480 

TKACKKDACP INGGWGPWSP WDICSVTCGG GVQKRSRLCN NPAPQFGGKD CVGDVTENQI 540 

CNKQDCPIDG CLSNPCFAGV KCTSYPDGSW KCGACPPGYS GNGIQCTDVD ECKEVPDACF 600 

NHNGEHRCEN TDPGYNCLPC PPRFTGSQPF GQGVEHATAN KQVCKPRNPC TDGTHDCNKN 660 

AKCNYLGHYS DPMYRCECKP GYAGNGIICG EDTDLDGWPN ENLVCVANAT YHCKKDNCPN 720 

15 LPNSGQEDYD KDGIGDACDD DDDNDKI PDD RDNCPFHYNP AQYDYDRDDV GDRCDNCPYN 780 

HNPDQADTDN NGEGDACAAD IDGDGILNER DNCQYVYNVD QRDTDMDGVG DQCDNCPLEH 840 

NPDQLDSDSD RIGDTCDNNQ DIDEDGHQNN LDNCPYVPNA NQADHDKDGK GDACDHDDDN 900 

DGIPDDKDNC RLVPNPDQKD SDGDGRGDAC KDDFDHDSVP DIDDICPENV DISETDFRRF 960 

QMIPLDPKGT SQNDPNWWR HQGKEIjVQTV NCDPGLAVGY DEFNAVDFSG TFFINTERDD 1020 

20 DYAGFVFGYQ SSSRFYWMW KQVTQSYWDT NPTRAQGYSG LSVKWNSTT GPGEHLRNAL 1080 

WHTGNTPGQV RTLWHDPRHI GWKDFTAYRW RLSHRPKTGF IRWMYEGKK IMADSGPIYD 1140 
KTYAGGRLGL FVFSQEMVFF SDLKYECRDP 



25 

Seq ID NO: 39 Nucleotide sequence : 
Nucleic Acid Accession #: BC004299 

Coding sequence: 69.. 1235 (underlined sequences correspond to start and stop codons) 

30 

1 11 21 31 41 51 

I I I I I I 

CCCGACCCGT GCGAGGGCCA GGTCCGCGCC TGCCCCGCCA GGCGAAGCGA GGCGACCCGC 60 

GTGCGGCCAT GGCTTCGCTG CTGGGAGCCT ACCCTTGGCC CGAGGGTCTC GAGTGCCCGG 120 

35 CCCTGGACGC CGAGCTGTCG GATGGACAAT CGCCGCCGGC CGTCCCCCGG CCCCCGGGGG 180 

ACAAGGGCTC CGAGAGCCGT ATCCGGCGGC CCATGAACGC CTTCATGGTT TGGGCCAAGG 240 

ACGAGAGGAA ACGGCTGGCA GTGCAGAACC CGGACCTGCA CAACGCCGAG CTCAGCAAGA 300 

TGCTGGGAAA GTCGTGGAAG GCGCTGACGC TGTCCCAGAA GAGGCCGTAC GTGGACGAGG 360 

CGGAGCGGCT GCGCCTGCAG CACATGCAGG ACTACCCCAA CTACAAGTAC CGGCCGCGCA 420 

40 GGAAGAAGCA GGCCAAGCGG CTGTGCAAGC GCGTGGACCC GGGCTTCCTT CTGAGCTCCC 480 

TCTCCCGGGA CCAGAACGCC CTG CCGGAG A AGAGAAGCGG CAGCCGGGGG GCGCTGGGGG 54 0 

AGAAGGAGGA CAGGGGTGAG TACTCCCCCG GCACTGCCCT GCCCAGCCTC CGGGGCTGCT 600 

ACCACGAGGG GCCGGCTGGT GGTGGCGGCG GCGGCACCCC GAGCAGTGTG GACACGTACC 66 0 

CGTACGGGCT GCCCACACCT CCTGAAATGT CTCCCCTGGA CGTGCTGGAG CCGGAG C AG A 72 0 

45 CCTTCTTCTC CTCCCCCTGC CAGGAGGAGC ATGGCCATCC CCGCCGCATC CCCCACCTGC 78 0 

CAGGGCACCC GTACTCACCG GAGTACGCCC CAAGCCCTCT CCACTGTAGC CACCCCCTGG 84 0 

GCTCCCTGGC CCTTGGCCAG TCCCCCGGCG TCTCCATGAT GTCCCCTGTA CCCGGCTGTC 900 

CCCCATCTCC TGCCTATTAC TCCCCGGCCA CCTACCACCC ACTCCACTCC AACCTCCAAG 960 

CCCACCTGGG CCAGCTTTCC CCGCCTCCTG AGCACCCTGG CTTCGACGCC CTGGATCAAC 102 0 

50 TGAGCCAGGT GGAACTCCTG GGGGACATGG ATCGCAATGA ATTCGACCAG TATTTGAACA 108 0 

CTCCTGGCCA CCCAGACTCC GCCACAGGGG CCATGGCCCT CAGTGGGCAT GTTCCGGTCT 114 0 

CCCAGGTGAC ACCAACGGGT CCCACAGAGA CCAGCCTCAT CTCCGTCCTG GCTGATGCCA 120 0 

CGGCCACGTA CTACAACAGC TACAGTGTGT CA TAG AGCTG GAGGCGCCCC GTCCGGTCAG 1260 

CCCTCGCGCC CTCTCCTTCT TGTGCCTTGA GTGGCAGAGG AGCCGTCCAG CCACACCAGC 1320 

55 TTTCCTCCCA CCGCTCAGGG CAGGGAGGTC TGAACTGCGG CCCCAGAGCC TTTGGCCTAA 1380 

GCTGGACTCT CCTTATCCGA GTGCCGCCTC TATCCCCTTC CCCACGTTCC AGCCCCTGCA 1440 

GCCCACATTT TAAGTATATT CCTTCAAGTG AGTTTTCCTC CAGCCCCTGA GAGTTGCTGT 150 0 

CTCCCAGTGG AATGTTCACT GACGTCTTTT CTTGGTAGCC ATCATCGAAA CTAATGGGGG 156 0 

GACAGACTTG ATAGCCAAGG TCCCTTCTGG TCCAGTTTTC TGATTTAGGG TTCTCTCAAG 162 0 

60 ATTAATAAAG GAAGATGGGG AAATTTGACT CATTAATGAG CTCGCTAACC TACGATCTGG 1680 

TGATAATTTT GTGTGCACAG CCCAAGGACC ACGAGGCTTT CTGCACTTTC TGCACCCCCT 1740 

TCCAAAGTGA CCACAAAATT TCAAAGGGAC TCATACAATT TGAGAAAAAA CAGTCAACCT 1800 

GATTTGAGAA ATTAACCAGT ATGGCTAACT ATATCACAGA AAATGGGATT GAGTTAAAAC 1860 

TATTTTATTT TAAATATACA TTTTAAAGCA GTTCTTTTTT TTTGTTAATT TGTTTATTAT 1920 

65 ACACACACTT CAAGAGCCAC CGCGCCCAGC CTACATTTAT AATTTTCATT CTCTTTTACC 1980 

TATAAAATTC AGTGTATTAG TTTCATTACA TAGGAGAAAT TATATTTCTA AACATTTTAT 2040 
GATGTTTAAA AACAAAACAG GCTGTTGTAA AAAAAAAAAA AAAAAAAAA 

Seq ID NO: 40 Protein sequence : 
70 Protein Accession #: AAH04299 

1 11 21 31 41 51 

| I I I I I 

MASLLGAYPW PEGLECPALD AELSDGQSPP AVPRPPGDKG SESRIRRPMN AFMVWAKDER 60 

75 KRLAVQNPDL HNAELSKMLG KSWKALTLSQ KRPYVDEAER LRLQHMQDYP NYKYRPRRKK 120 

QAKRLCKRVD PGFLLSSLSR DQNALPEKRS GSRGALGEKE DRGEYSPGTA LPSLRGCYHE 180 
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GPAGGGGGGT PSSVDTYPYG LPTPPBMSPL DVLEPEQTFF SSPCQEEHGH PRRIPHLPGH 240 

PYSPEYAPSP LHCSHPLGSL ALGQSPGVSM MSPVPGCPPS PAYYS PATYH PLHSNLQAHL 300 

GQLSPPPEHP GPDALDQLSQ VELLGDMDRN EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 

TPTGPTETSL ISVLADATAT YYNSYSVS 



Seq ID NO: 41 Nucleotide sequence: 
Nucleic Acid Accession #: NM__004449 

Coding sequence: 1..1389 (underlined sequences correspond to start and stop codons) 



1 
I 

ATGATTCAGA 
GAGGACCAGT 
ACCGCGTCCT 
CAGGATTGGC 
CAGGTGAATG 
GTGGGCAGCC 
CCACCCCCAA 
TGGAGTACAG 
GACGTCAACA 
GACGACTTCC 
TACCTCAGAG 
AACTCTCCAC 
TCAGCCTGGA 
TCCACAGTGC 
CCAACAAGTA 
CTGGAGCTCC 
GAGTTCAAGA 
CCCAACATGA 
ATGACCAAGG 
GCCCTCCAGC 
ATGGGCTCCT 
CTCCCCGTGA 
GGGGGTATAT 
TACTACTAA 



11 



21 



CTGTCCCGGA 
CGTTGTTTGA 
CCTCCAGCGA 
TGTCTCAACC 
GCTCAAGGAA 
CAGACACCGT 
ACATGACCAC 
ACCATGTGCG 
TCTTGTTATT 
AGAGGCTCAC 
AGACTCCTCT 
GGTTAATGCA 
CCGGTCACGG 
CCAAAACTGA 
GCCGCCTTGC 
TGTCGGACAG 
TGACGGATCC 
ACTACGATAA 
TCCATGGGAA 
CCCACCCCCC 
ATCACGCCCA 
CATCTTCCAG 
ACCCCAACAC 



CCCAGCAGCT 
GTGTGCCTAC 
CTATGGACAG 
CCCAGCCAGG 
CTCTCCTGAT 
TGGGATGAAC 
GAACGAGCGC 
GCAGTGGCTG 
CCAGAACATC 
CCCCAGCTAC 
TCCACATTTG 
TGCTAGAAAC 
CCACCCCACG 
AGACCAGCGT 
AAATCCAGGC 
CTCCAACTCC 
CGACGAGGTG 
GCTCAGCCGC 
GCGCTACGCC 
GGAGTCATCT 
CCCACAGAAG 
TTTTTTTGCT 
TAGGCTCCCC 



31 
I 

CATATCAAGG 
GGAACGCCAC 
ACTTCCAAGA 
GTCACCATCA 
GAATGCAGTG 
TACGGCAGCT 
AGAGTTATCG 
GAGTGGGCGG 
GATGGGAAGG 
AACGCCGACA 
ACTTCAGATG 
ACAGATTTAC 
CCCCAGTCGA 
CCTCAGTTAG 
AGTGGCCAGA 
AGCTGCATCA 
GCCCGGCGCT 
GCCCTCCGTT 
TACAAGTTCG 
CTGTACAAGT 
ATGAACTTTG 
GCCCCAAACC 
ACCAGCCATA 



41 
I 

AAGCCTTATC 
ACCTGGCTAA 
TGAGCCCACG 
AAATGGAATG 
TGGCCAAAGG 
ACATGGAGGA 
TGCCAGCAGA 
TGAAAGAATA 
AACTGTGCAA 
TCCTTCTCTC 
ATGTTGATAA 
CATATGAGCC 
AAGCTGCTCA 
ATC CTTATCA 
TCCAGCTTTG 
CCTGGGAAGG 
GGGGAGAGCG 
ACTACTATGA 
ACTTCCACGG 
ACCCCTCAGA 
TGGCGCCCCA 
CATACTGGAA 
TGCCTTCTCA 



51 
I 

AGTTGTGAGT 60 

GACAGAGATG 120 

CGTCCCTCAG 180 

TAACCCTAGC 240 

CGGGAAGATG 300 

GAAGCACATG 360 

TCCTACGCTA 420 

TGGCCTTCCA 480 

GATGACCAAG 540 

ACATCTCCAC 600 

AGCCTTACAA 660 

CCCCAGGAGA 720 

ACCATCTCCT 780 

GATTCTTGGA 840 

GCAGTTCCTC 900 

CACCAACGGG 960 

GAAGAGCAAA 1020 

CAAGAACATC 1080 

GATCGCCCAG 1140 

CCTCCCGTAC 1200 

CCCTCCAGCC 1260 

TTCACCAACT 1320 

TCTGGGCACT 1380 



Seq ID NO: 42 Protein sequence: 
Protein Accession #: NP 004440 



1 
I 

MIQTVPDPAA 
QDWLSQPPAR 
PPPNMTTNER 
DDFQRLTPSY 
SAWTGHGHPT 
LELLSDSSNS 
MTKVHGKRYA 
LPVTSSSFFA 



11 
I 

HIKEALSWS 
VTIKMECNPS 
RVIVPADPTL 
NADILLSHLH 
PQSKAAQPSP 
SCITWEGTNG 
YKFDFHGIAQ 
APNPYWNSPT 



21 
I 

EDQSLFECAY 
QVNGSRNSPD 
WSTDHVRQWL 
YLRETPLPHL 
STVPKTEDQR 
EFKMTDPDEV 
ALQPHPPESS 
GGIYPNTRLP 



31 
I 

GTPHIiAKTEM 
ECSVAKGGKM 
EWAVKEYGLP 
TSDDVDKALQ 
PQLDPYQILG 
ARRWGERKSK 
LYKYPSDLPY 
TSHMPSHLGT 



41 
I 

TASSSSDYGQ 
VGSPDTVGMN 
DVNILLFQNI 
NSPRLMHARN 
PTSSRLANPG 
PNMNYDKLSR 
MGSYHAHPQK 
YY 



51 
I 

TSKMSPRVPQ 60 

YGSYMEEKHM 120 

DGKELCKMTK 180 

TDLPYEPPRR 240 

SGQIQLWQFL 3 00 

ALRYYYDKNI 360 

MNFVAPHPPA 420 



Seq ID NO: 43 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005100 

Coding sequence: 192.. 5537 (underlined sequences correspond to start and stop codons) 



CCTTCTTTTA 
CAGTCGGCTG 
CTTGCCCCTG 
GCGGAGGAGC 
AGGGGAGCTC 
CGCCAGACAC 
AGAAGAATGG 
AGGAGGGTGA 
AGGAGGAAGA 
AAAGAGACTC 
ATGGGCAGGA 
AGCTAACACA 
TTGTTGGCTT 



11 

I 

AGGAGTTTGC 
GCGGCGAAGG 
TCCCTGCGGC 
CATGGGCGCC 
CACGCCGGCT 
CACCGCGGAC 
TCAGCTGTCC 
CCTAAATGGC 
AGAAGTCATT 
CGATAAAGAG 
G G AG AACCG A 
ACCCACTGAG 
TAAATTCACT 



21 
I 

CGCGAGCGCG 
AAGGCGCTCT 
TTGGGGAAAG 
GGGAGCTCCA 
GAGCCCGAGC 
CCCGCCATCG 
ACC AT CAATG 
CAGAAAGGAG 
GTCACGGAGG 
ATGGCTACTA 
AATATCGAAC 
TCCCAGGCTA 
GTGAAAAAGG 



31 
I 

TCTCCTTCAT 
CGGGACCTCA 
CGTAACCCGG 
CCGAGCAGCG 
CCAGCGGCGG 
CTGCCTCGGA 
GCGTAGCTGA 
CCCTGAACGG 
TTGGACAGAG 
AGTCAGCGGT 
AGATTCCTTC 
ATGATATTGG 
ATAAGACAGA 



41 
I 

TCGCAGGCTG 
CGGGCGCGCG 
CGGCTAGGCG 
CAGCCCGGAG 
CGGCCCCTCG 
CCCCGCCACC 
GCAAGATGAG 
TCAAGGAGCC 
AGACTCTGAA 
TGTTCACGAC 
TTCAGAAAGC 
ATTTAAGAAG 
GAAGCCTGAC 



51 

I 

GGCGCGTTCG 60 

TCTTTTGGCT 120 

CGGGAGAAGT 180 

CAGCCGCCCG 240 

GCCGAGGCGG 300 

AAGCTCCTAC 360 

CTCAGCCTCC 420 

CTAAACAGCC 4 80 

GATGTGAGCG 540 

ATCACAGATG 600 

AATTTAGAAG 660 

GTGTTTAAGT 720 

ACTGTCCAGC 780 
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TACTCACTGT GAAGAAAGAT GAAGGGGAGG 
CCAGCCTTGG GGCTGGAGAA GCAGCATCCA 
AACCCGAAGA GACCCTGAAG CGTGAGCAAA 
CTGGCCAAGC AGTGGAGGAA TGCAAAGAGG 
5 GCAAGTCTGC AGAATCTCCG ACTAGTCCCG 
AATTCTTCAC TCAAGGTTGG GCCGGCTGGC 
AGGATGAAGT GGAAGCTTCA GAGAAGAAAA 
AAGAAGACGG AAAGGCAGAG GTTGCCTCCG 
CACAGGAGCC GGCAGAAAGT GCCCACGAGC 

10 AGCTGCCCTC AGAGGAGCAA GTCAGTGGCT 
CGTTGGCGAC AGAAGTGTTT GATGAGAAAA 
AAGTCCACGT CAGCACCGTG GAGGAGAGAA 
CAGCAGGGTC TGTGCCAGCT GAAGAATTGG 
AACCTGCCAA GGAGCTGGTG AAGCTCAAAG 

15 AGGGAGCTGA CCTCAGTCCT GATGAGAAGG 
GTGAGGTGGA AATGCTGTCA TCACAGGAGA 
AGCTTTTTAC CAGCACTGGC TTAAAAAAGC 
GAGGAGGAGA CGAGGAATCA GGGGAGCACA 
AGGAGGAGCA AAAGGGCGAG AGCTCTGCCT 

20 GTCTGGAAAA GGGCTTAGCC GAGGTGCAGC 
CCGATGGAGA GAAAAAAAGA GAAGGTGTCA 
CGCCCAAGAA GCGTGTTAGA CGGCCTTCGG 
TCAAGAGCGC TACCTTGTCT TCCACCGAGA 
AAGGGAGCGT GGAAGAGCCA AAGCCGGAAG 

25 CTTGGGAAGC TTTAATTTGT GTGGGATCAT 
CTGATGAGGA AGGGGGACCA AAAGCAATGG 
GAAAAGACAA AGAGACGGGG ACAGACGGGA 
GGCAGGGAAG TTCCTCCCCG GAGCAAGCTG 
CCTGGGAGTC ATTTAAAAGG TTAGTCACGC 

30 AGAAAAGCGA AGACTCCATA GCTGGGTCTG 
CCGGTAAAGA AGAATCCTGG GTCTCAATCA 
GGCCAGATGG GAAACAAGAA CAAGCCCCTG 
AAGATGACTC TGATGTCCCG GCCGTGGTCC 
AGAAAATGGA GGCACAGCAA GCCCAAAAAG 

35 CTGAGGTGTC CAAGGAGCTC AGCGAGAGTC 
ACGGGACGAG GGCAGCTACC ATTATTGAAG 
TGACAGAACC TCTTGAACAA GTAGAAGCTG 
AAAGAGAAGT AATTGCAGAA GAAGAACCCC 
GAGAGGCCCG GGGCGACACG GTCGTTAGTG 

40 CTGCAGAAAC TGCAGGGCCA TTGGGTTCCG 
AGACCACAGA AATGGTGTCA GCAGTCTCCC 
AGGCCACTCC GGTGCAGGAG GTGGAAGGTG 
GGACTCAAGA GGTCCTCCAG GCAGTGGCAG 
GCACCGGTGG GCCAGAAGAT GTGCTTCAGC 

45 AAGAGCAGGC TGAAGCGTCG GGTCTGAAGA 
CTCAGGAGGC AAAAACTGAG CCTTTTACAC 
AAAGCTTTGA AAAAGCTCCT CAAGTCACAG 
CTTGTCAAGC CGAAACCTTA GCTGGGGTAA 
TCCCCCCTGA CTCGGTGGAA ACCCCTACAG 

50 CCGACTTTGA CGCACCAGGC ACAACCCAGA 
ATGAGGTCGC ATCTGGTACC CAGTCAGGGG 
AAGAGAGGCC TCCAGCACCT TCCAGTTTTG 
AGATGGAAGA CACTCTAGAG CATACAGATA 
TGTCAAAGAC TGAGGGGACT CAAGAGGCTG 

55 TACCATTTTT CGAAGGACTT GAGGGGTCTA 
AGGTCACTGA AGTTGCCCTT AAAGGTGAAG 
ATGCTCTTGA ACTGCAGAGT CACGCTAAGT 
TAGTTCAAGT CGAAAGGGAG AAAACAGAAG 
TTGAGCACGA AACAGCTGTT ACCGTATCTG 

60 TGAATGTGCC CATCATAGAT GGGGCAAAGG 
CCTGCCTAGG TCAAGAGGAG GCAGTATGCA 
CATTCACTCT AACAGCGGCT GCAGAGGAGG 
TAGAAACAGG TGAAACGTTG GAGCCTGCAG 
CTGAAAAAAA TGAAGACTTT GCCGCTCATC 

65 ACTGTCAGGC AAAATCGACA CCAGTGATAG 
CCGACCTGGA AGGAGAGAAA ACCACATCAC 
AGGTTGCTTG CCAGGAGGTC AAAGTGAGTG 
GGATTTTGGA ACTTGAGACC AAAAGCAGTA 
TTGACCAGTT TGTACGTACA GAAGAAACAG 

70 CACAAGCTCA CGTGATAAAA GCTGACAGCC 
GAGAGGAACC TCAGGCCTCT GCACAGGATG 
CAGAGTCAAC CGCAGTGGGA CAAGCACATT 
CAGAAAAGAC CATGACTGTT GAGGTAGAAG 
AGGTCGTCCT CCCATCTGAG GAAGAGGGAG 

75 ATGATGGTCA TGCCTTGTTA GCAGAAAGAA 
ATGAAAAAGG TGATGATGTT GATGACCCTG 



GAGCAGCAGG GGCTGGCGAC CACCAGGACC 840 

AAGAAAGCGA ACCCAAACAA TCTACAGAGA 900 

GCCACGCAGA AATTTCTCCC CCAGCCGAAT 960 

AAGGAGAAGA GAAACAAGAA AAAGAACCTA 1020 

TGACCAGTGA AACAGGATCA ACCTTCAAAA 1080 

GCAAAAAGAC CAGTTTCAGG AAGCCGAAGG 1140 

AGGAACAAGA GCCAGAAAAA GTAGACACAG 1200 

AGAAACTGAC CGCCTCCGAG CAAGCCCACC 1260 

CCCGGTTATC AGCTGAATAT GAGAAAGTTG 1320 

CGCAGGGACC TTCTGAAGAG AAACCTGCTC 1380 

TAGAAGTCCA CCAAGAAGAG GTTGTGGCCG 144 0 

CCGAAGAGCA GAAAACGGAG GTGGAAGAAA 1500 

TTGGAATGGA TGCAGAACCT CAGGAAGCCG 1560 

AAACGTGTGT TTCCGGAGAG GACCCTACAC 1620 

TGCTGTCCAA ACCCCCCGAA GGCGTTGTGA 168 0 

GAATGAAGGT GCAGGGAAGT CCACTAAAGA 1740 

TTTCTGGAAA GAAACAGAAA GGGAAAAGAG 1800 

CTCAGGTTCC AGCCGATTCT CCGGACAGCC 1860 

CATCCCCTGA GGAGCCCGAG GAGATCACGT 1920 

AGGATGGGGA AGCTGAAGAA GGAGCTACTT 1980 

CTCCCTGGGC ATCATTCAAA AAGATGGTGA 2040 

AAAGTGATAA AGAAGATGAG CTGGACAAGG 2100 

GCACAGCCTC TGAAATGCAA GAAGAAATGA 2160 

AACCAAAGCG CAAGGTGGAT ACCTCAGTAT 2220 

CCAAGAAAAG AGCAAGGAGA AGGTCCTCTT 2280 

GAGGAGACCA CCAGAAAGCT GATGAGGCCG 2340 

TCCTTGCTGG TTCCCAAGAA CATGATCCAG 2400 

GAAGCCCTAC CGAAGGGGAG GGCGTTTCCA 2460 

CAAGAAAAAA ATCAAAGTCC AAGCTGGAAG 2520 

GTGTAGAACA TTCCACTCCA GACACTGAAC 2580 

AGAAGTTTAT TCCTGGACGA AGGAAGAAAA 2640 

TTGAAGACGC AGGGCCAACA GGGGCCAACG 2700 

CTCTGTCTGA GTATGATGCT GTAGAAAGGG 2760 

GCGCAGAGCA GCCCGAGCAG AAGGCAGCCA 2820 

AGGTTCATAT GATGGCAGCA GCTGTCGCTG 2880 

AAAGGTCTCC TTCTTGGATA TCTGCTTCAG 2940 

AAGCCGCACT GTTAACTGAG GAGGTATTGG 3000 

CCACGGTTAC TGAACCTCTG CCAGAGAACA 3060 

AGGCGGAATT GACCCCCGAA GCTGTGACAG 3120 

AAGAAGGAAC CGAAGCATCT GCTGCTGAAG 3180 

AGTTAACCGA CTCCCCAGAC ACCACAGAGG 3240 

GCGTACCTGA CATAGAAGAG CAAGAGAGGC 3300 

AAAAAGTGAA AGAGGAATCC CAGCTGCCTG 3360 

CTGTGCAGAG AGCAGAGGCA GAAAGACCAG 3420 

AAGAGACGGA TGTAGTGTTG AAAGTAGATG 34 80 

AAGGGAAGGT GGTGGGGCAG ACCACCCCAG 354 0 

AGAGCATAGA GTCCAGTGAG CTTGTAACCA 360 0 

AATCACAGGA GATGGTGATG GAACAGGCTA 3660 

ACAGTGAGAC TGATGGAAGC ACCCCCGTAG 372 0 

AAGACGAGAT TGTGGAAATC CATGAGGAGA 378 0 

GCACAGAAGC AGAGGCAGTT CCTGCACAGA 384 0 

TGTTCCAGGA AGAAACTAAA GAACAATCAA 3900 

AAGAGGTGTC AGTGGAAACT GTATCCATTC 3960 

ACCAGTATGC TGATGAGAAA ACCAAAGACG 4020 

TAGACACAGG CATAACAGTC AGTCGGGAAA 4080 

GGACAGAAGA AGCTGAATGT AAAAAGGATG 4140 

CTCCTCCATC CCCCGTGGAG AGAGAGATGG 42 00 

CAGAGCCAAC CCATGTGAAT GAAGAGAAGC 4260 

AAGAGGTCAG TAAGCAGCTC CTCCAGACAG 4320 

AAGTCAGCAG TTTGGAAGGA AGCCCTCCTC 43 80 

CCAAAATTCA AGTTCAGAGC TCTGAGGCAT 4440 

AAAAGGTCTT AGGAGAAACT GCCAACATTT 4500 

GTGCACATTT AGTTCTGGAA GAGAAATCCT 4560 

CAGGGGAAGA TGCTGTGCCC ACAGGGCCCG 4 620 

TATCTGCTAC TACCAAGAAA GGCTTAAGTT 46 80 

TGAAGTGGAA GTCAGATGAA GTCGATGAGC 4740 

TAGCAATTGA GGATTTAGAG CCTGAAAATG 4 800 

AACTTGTCCA AAACATCATC CAGACAGCCG 4860 

CCACCGAAAT GTTGACGTCT GAGTTACAGA 4 920 

AGGACGCTGG ACAGGAAACG GAGAAAGAAG 4 980 

AAACACCAAT TACTTCAGCC AAAGAGGAGT 5040 

CTGATATTTC CAAAGACATG AGTGAAGCCT 5100 

GTTCCACTGT AAATGATCAG CAGCTGGAAG 5160 

GTGGAGCTGG AACAAAGTCT GTGCCAGAAG 5220 

TAGAGAAGTC ACTAGTTGAA CCGAAAGAAG 5280 

AAAACCAGAA CTCAGCCCTG GCTGATACTG 5340 
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ATGCCTCAGG AGGCTTAACC AAAGAGTCCC CAGATACAAA TGGACCAAAA CAAAAAGAGA 5400 

AGGAGGATGC CCAGGAAGTA GAATTGCAGG AAGGAAAAGT GCACAGTGAA TCAGATAAAG 5460 

CGATCACACC CCAAGCACAG GAGGAGTTAC AGAAACAAGA GAGAGAATCT GCAAAGTCAG 5520 

AACTTACAGA ATCT TAA AAC ATCATGCAGT TAAACTCATT GTCTGTTTGG AAGACCAGAA 5580 

TGTGAAGACA AGTAGTAGAA GAAAATGAAT GCTGCTGCTG AGACTGAAGA CCAGTATTTC 564 0 

AGAACTTTGA GAATTGGAGA GCAGGCACAT CAACTGATCT CATTTCTAGA GAGCCCCTGA 5700 

CAATCCTGAG GCTTCATCAG GAGCTAGAGC CATTTAACAT TTCCTCTTTC CAAGACCAAC 5760 

CTACAATTTT CCCTTGATAA CCATATAAAT TCTGATTTAA GGTCCTAAAT TCTTAACCTG 5820 

GAACTGGAGT TGGCAATACC TAGTTCTGCT TCTGAAACTG GAGTATCATT CTTTACATAT 5880 

TTATATGTAT GTTTTAAGTA GTCCTCCTGT ATCTATTGTA TATTTTTTTC TTAATGTTTA 5940 

AGGAAATGTG CAGGATACTA CATGCTTTTT GTATCACACA GTATATGATG GGGCATGTGC 6000 

CATAGTGCAG GCTTGGGGAG CTTTAAGCCT CAGTTATATA ACCCACAAAA AACAGAGCCT 6060 

CCTAGATGTA ACATTCCTGA TCAAGGTACA ATTCTTTAAA ATTCACTAAT GATTGAGGTC 6120 

CATATTTAGT GGTACTCTGA AATTGGTCAC TTTCCTATTA CACGGAGTGT GCCAAAACTA 6180 

AAAAG CATTT TGAAACATAC AGAATGTTCT ATTGTCATTG GGAAATTTTG CTTTCTAACC 6240 

CAGTGGAGGT TAGAAAGAAG TTATATTCTG GTAGCAAATT AACTTTACAT CCTTTTTCCT 6300 

ACTTGTTATG GTTGTTTGGA CCGATAAGTG TGCTTAATCC TGAGGCAAAG TAGTGAATAT 6360 

GTTTTATATG TTATGAAGAA AAGAATTGTT GTAAGTTTTT GATTCTACTC TTATATGCTG 6420 

GACTGCATTC ACACATGGCA TGAAATAAGT CAGGTTCTTT ACAAATGGTA TTTTGATAGA 64 80 

TACTGGATTG TGTTTGTGCC ATATTTGTGC CATTCCTTTA AGAACAATGT TGCAACACAT 6540 

TCATTTGGAT AAGTTGTGAT TTGACGACTG ATTTAAATAA AATATTTGCT TCACTTAAAA 6600 
AAAAAAAA 

Seq ID NO: 44 Protein sequence: 
Protein Accession #: NP 005091 



1 11 21 31 41 51 

I I I I I I 

MGAGSSTEQR SPEQPPEGSS TPAEPEPSGG GPSAEAAPDT TADPAIAASD PATKLLQKNG 60 

QLSTINGVAE QDELSLQEGD LNGQKGALNG QGALNSQEEE EVIVTEVGQR DSEDVSERDS 120 

DKEMATKSAV VHDITDDGQE ENRNIEQIPS SESNLEELTQ PTESQANDIG FKKVFKFVGF 180 

KFTVKKDKTE KPDTVQLLTV KKDEGEGAAG AGDHQDPSLG AGEAASKESE PKQSTEKPEE 240 

TLKREQSHAE ISPPAESGQA VEECKEEGEE KQEKEPSKSA ESPTSPVTSE TGSTFKKFFT 300 

QGWAGWRKKT SFRKPKEDEV EASEKKKEQE PEKVDTEEDG KAEVASEKLT ASEQAHPQEP 360 

AESAHEPRLS AEYEKVEIiPS EEQVSGSQGP SEEKPAPLAT EVFDEKIEVH QEEWAEVHV 420 

STVEERTEEQ KTEVEETAGS VPAEELVGMD AEPQEAEPAK ELVKLKETCV SGEDPTQGAD 480 

LSPDEKVLSK PPEGWSEVE MLSSQERMKV QGSPLKKLFT STGLKKLSGK KQKGKRGGGD 540 

EESGEHTQVP ADSPDSQEEQ KGESSASSPE EPEEITCLEK GLAEVQQDGE AEEGATSDGE 600 

KKREGVTPWA SFKKMVTPKK RVRRPSESDK EDELDKVKSA TLSSTESTAS EMQEEMKGSV 660 

EEPKPEEPKR KVDTSVSWEA LICVGSSKKR ARRRSSSDEE GGPKAMGGDH QKADEAGKDK 72 0 

ETGTDGIliAG SQEHDPGQGS SSPEQAGSPT EGEGVSTWES FKRLVTPRKK SKSKLEEKSE 780 

DSIAGSGVEH STPDTEPGKE ESWVSIKKFI PGRRKKRPDG KQEQAPVEDA GPTGANEDDS 840 

DVPAWPLSE YDAVEREKME AQQAQKGAEQ PEQKAATEVS KELSESQVHM MAAAVADGTR 900 

AATIIEERSP SWISASVTEP LEQVEAEAAL LTEEVLEREV IAEEEPPTVT EPLPENREAR 960 

GDTWSEAEL TPEAVTAAET AGPLGSEEGT EASAAEETTE MVSAVSQLTD SPDTTEEATP 1020 

VQEVEGGVPD IEEQERRTQE VLQAVAEKVK EESQLPGTGG PEDVLQPVQR AEAERPEEQA 1080 

EASGLKKETD WLKVDAQEA KTEPFTQGKV VGQTTPESFE KAPQVTESIE SSELVTTCQA 114 0 

ETLAGVKSQE MVMEQAIPPD SVETPTDSET DGSTPVADFD APGTTQKDEI VEIHEENEVA 1200 

SGTQSGGTEA EAVPAQKERP PAPSSFVFQE ETKEQSKMED TLEHTDKEVS VETVSILSKT 1260 

EGTQEADOYA DEKTKDVPFF EGLEGSIDTG ITVSREKVTE VALKGEGTEE AECKKDDAIiE 1320 

LQSHAKSPPS PVEREMWQV EREKTEAEPT HVNEEKLEHE TAVTVSEEVS KQLLQTVNVP 1380 

IIDGAKEVSS LEGSPPPCLG QEEAVCTKIQ VQSSEASFTL TAAAEEEKVL GETANILETG 144 0 

ETLEPAGAHL VLEEKSSEKN EDFAAHPGED AVPTGPDCQA KSTPVIVSAT TKKGLSSDLE 1500 

GEKTTSLKWK SDEVDEQVAC QEVKVSVAIE DLEPENGILE LETKSSKLVQ NIIQTAVDQF 1560 

VRTEETATEM LTSELQTQAH VIKADSQDAG QETEKEGEEP QASAQDETPI TSAKEESEST 162 0 

AVGQAHSDIS KDMSEASEKT MTVEVEGSTV NDQQLEEWL PSEEEGGGAG TKSVPEDDGH -1680 

ALLAERIEKS LVEPKEDEKG DDVDDPENQN SALADTDASG GLTKESPDTN GPKQKEKEDA 1740 
QEVELQEGKV HSESDKAITP QAQEELQKQE RESAKSELTE S 

Seq ID NO: 45 Nucleotide sequence: 
Nucleic Acid Accession #: NM_001290 

Coding sequence: 110 .. 1231 (underlined sequences correspond to start and 

1 11 21 31 41 51 

I I I I I I 

GTGAGCGTGT GTGCGTGCGT CTACTTTGTA CTGGGAAGAA CACAGCCCAT GTGCTCTGCA 60 

TGGACGTTAC TGATACTCTG TTTAGCTTGA TTTTCGAAAA GCAGGCAAGA TGTCCAGCAC 120 

ACCACATGAC CCCTTCTATT CTTCTCCTTT CGGCCCATTT TATAGGAGGC ATACACCATA 180 

CATGGTACAG CCAGAGTACC GAATCTATGA GATGAACAAG AGACTGCAGT CTCGCACAGA 240 

GGATAGTGAC AACCTCTGGT GGGACGCCTT TGCCACTGAA TTTTTTGAAG ATGACGCCAC 300 

ATTAACCCTT TCATTTTGTT TGGAAGATGG ACCAAAGCGA TACACTATCG GCAGGACCCT 360 

CATCCCCCGT TACTTTAGCA CTGTGTTTGA AGGAGGGGTG ACCGACCTGT ATTACATTCT 420 

CAAACACTCG AAAGAGTCAT ACCACAACTC ATCCATCACG GTGGACTGCG ACCAGTGTAC 480 

CATGGTCACC CAGCACGGGA AGCCCATGTT TACCAAGGTA TGTACAGAAG GCAGACTGAT 540 
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CTTGGAGTTC ACCTTTGATG ATCTCATGAG AATCAAAACA TGGCACTTTA CCATTAGACA 600 

ATACCGAGAG TTAGTCCCGA GAAGCATCCT AGCCATGCAT GCACAAGATC CTCAGGTCCT 660 

GGATCAGCTG TCCAAAAACA TCACCAGGAT GGGGCTAACA AACTTCACCC TCAACTACCT 720 

CAGGTTGTGT GTAATATTGG AGCCAATGCA GGAACTGATG TCGAGACATA AAACTTACAA 780 

CCTCAGTCCC CGAGACTGCC TGAAGACCTG CTTGTTTCAG AAGTGGCAGA GGATGGTGGC 840 

TGCGCCAGCA GAACCCACAA GGCAACCAAC AACCAAACGG AGAAAAAGGA AAAATTCCAC 900 

CAGCAGCACT TCCAACAGCA GCGCTGGGAA CAATGCAAAC AGCACTGGCA GCAAGAAGAA 960 

GACCACAGCT GCAAACCTGA GTCTGTCCAG TCAGGTACCT GATGTGATGG TGGTAGGAGA 1020 

GCCAACTCTG ATGGGAGGTG AGTTTGGGGA CGAGGACGAA AGGCTAATCA CTAGATTAGA 1080 

AAACACGCAA TATGATGCGG CCAACGGCAT GGACGACGAG GAGGACTTCA ACAATTCACC 1140 

CGCGCTGGGG AACAACAGCC CGTGGAACAG TAAACCTCCC GCCACTCAAG AGACCAAATC 1200 

AGAAAACCCC CCACCCCAGG CTTCCCAATA AGATGATCGG CACCAGAATC CACTGTCAAT 1260 

AGGCCCGTGG GTGATCATTA CAATTGCAAA TCTTTACTTA CAGGAGAGGA AACAGAAGAG 1320 

ATAAAAACTT TTCCATGCAA ATATCTATTT CTAAACCACA ATGATCTGAT TTTCTTTCTT 1380 

CTTTCTTTTT TTCTAATTGA GAGGATTATT CCCAGTAAGC TTCCATGACC CTTTCTTGGA 1440 

GGCCTTCACA GGTAATACAG ATACTGGCAC TGATTGTAAT TAAAATGAGA GAAAACTCTA 1500 

GCGCATCTTC TGGCACGGTT TTAACAACGT GTTTGTGTTG AATTTCCTTT TTATGCATCA 1560 

AACGAAGGCC ATATTGTCCA TAAATGCTCA GTGCTCAGGA TCTCATTAAT ATGCCGAACC 1620 

TAA CTACAGA TGACTTTTTA ATATTGTAAA ATATTTTCTG CTTTTTGACT TGCATCTGAG 1680 

AGTTTCTTGT TTCAGTAAAA AAAGAAAAGA CAAAAAAATC AGCTTTGGAA AGTAATTTAA 1740 

ATGTACCTTA TTTTTTTTTT CTTTATGTTT TCTTTCATTG GGCAACAGCT AAGAGGGCCC 1800 

AGCAAGGTAA TTTATGGTTG AGCTGATGTC AATTGGTTCT TGTCTTGAGT CGACTCAATT 1860 

TAGCCCAAGT GCTGAAACAA GAAATGT CAT TTTTTTCATC AAAGACACCA GGGCAGATTT 1920 

TTAAGTAAAG AAAGACAATT GGACCCTTAA GAATTTATGC ATTTGTAAAG TTGCTGTTGA 1980 

TCCAAATATT TTCAAGCCAT GTAATCCATT GGTTTTGTGG GCAGTTTAAT AAACCTGAAC 2040 

CTTTGTGTGT TTTCTAATTG TACCTGAGTT GACCATCCTT TCTTTTTATA GTATATTTCT 2100 

TGTATGATAT TTTGTAAAGC TCTCACCTGG TTCTTTTATG GGGACTTTTC GTTTTTGGGC 2160 

AACTCCAGTG TATTTATGTG AAACTTTATA AGAGAATTAA TTTTTCCATT TGCATATTAA 2220 

TATGTTCCTC CACACATGTA AAGGCACAGT GGCTCCGTGT GTTAAAAAAC AGCTGTATTT 2280 
TATGTATGCT TTACTGATAA GTGTGCCAAT AATAAACTGT GTTAATGACC 



Seq ID NO: 46 Protein sequence: 
Protein Accession #: NP 001281 



1 11 21 31 41 51 

I I I I I I 

MSSTPHDPFY SSPFGPFYRR HTPYMVQPEY RIYEMNKRLQ SRTEDSDNLW WDAFATEFFE 60 

DDATLTLSFC LEDGPKRYTI GRTLIPRYFS TVFEGGVTDL YYILKHSKES YHNSSITVDC 120 

DQCTMVTQHG KPMFTKVCTE GRLILEFTFD DLMRIKTWHF TIRQYRELVP RSILAMHAQD 180 

PQVLDQLSKN ITRMGLTNFT LNYLRLCVIL EPMQELMSRH KTYNLSPRDC LKTCLFQKWQ 240 

RMVAPPAEPT RQPTTKRRKR KNSTSSTSNS SAGNNANSTG SKKKTTAANL SLSSQVPDVM 300 

WGEPTLMGG EFGDEDERLI TRLENTQYDA ANGMDDEEDF NNSPALGNNS PWNSKPPATQ 360 
ETKSENPPPQ ASQ 

Seq ID NO: 47 Nucleotide sequence: 
Nucleic Acid Accession #: NM_004126 

Coding sequence: 108.. 329 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCACGAGCT CGTGCCGGCC TTCAGTTGTT TCGGGACGCG CCGAGCTTCG CCGCTCTTCC 60 

AGCGGCTCCG CTGCCAGAGC TAGCCCGAGC CCGGTTCTGG GGCGAAA ATG CCTGCCCTTC 120 

ACATCGAAGA TTTGCCAGAG AAGGAAAAAC TGAAAATGGA AGTTGAGCAG CTTCGCAAAG 180 

AAGTGAAGTT GCAGAGACAA CAAGTGTCTA AATGTTCTGA AGAAATAAAG AACTATATTG 240 

AAGAACGTTC TGGAGAGGAT CCTCTAGTAA AGGGAATTCC AGAAGACAAG AACCCCTTTA 300 

AAGAAAAAGG CAGCTGTGTT ATTTCATAAA TAACTTGGGA GAAACTGCAT CCTAAGTGGA 360 

AGAACTAGTT TGTTTTAGTT TTCCCAGATA AAACCAACAT GCTTTTTAAG GAAGGAAGAA 420 

TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAG CAT AT 480 

GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 

ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

Seq ID NO: 4 8 Protein sequence: 
Protein Accession #: NP_004117 

1 11 21 31 41 51 

I I I i I I 

MPALHIEDLP EKEKLKMEVE QLRKEVKLQR QQVSKCSEEI KNYIEERSGE DPLVKGIPED 60 
KNPFKEKGSC VIS 



Seq ID NO: 4 9 Nucleotide sequence: 
Nucleic Acid Accession #: XM_051896 

Coding sequence: 139.. 2388 (underlined sequences correspond to start and stop codons) 



203 



WO 02/079492 



PCT/US02/04915 



1 11 21 31 41 51 

I I I I I I 

GTTTTAAAGA CGCTAGAGTG CCAAAGAAGA CTTTGAAGTG TGAAAACATT TCCTGTAATT 60 

GAAACCAAAA TGTCATTTAT AGATCCTTAC C AG C AC ATT A TAGTGGAGCA CCAGTATTCC 120 

CACAAGTTTA CGGTAGTGGT GTTACGTGCC ACCAAAGTGA CAAAGGGGGC CTTTGGTGAC 180 

ATGCTTGATA CTCCAGATCC CTATGTGGAA CTTTTTATCT CTACAACCCC TGACAGCAGG 240 

AAGAGAACAA GACATTTCAA TAATGACATA AACCCTGTGT GGAATGAGAC CTTTGAATTT 300 

ATTTTGGATC CTAATCAGGA AAATGTTTTG GAGATTACGT TAATGGATGC CAATTATGTC 360 

ATGGATGAAA CTCTAGGGAC AGCAACATTT ACTGTATCTT CTATGAAGGT GGGAGAAAAG 420 

AAAGAAGTTC CTTTTATTTT CAACCAAGTC ACTGAAATGG TTCTAGAAAT GTCTCTTGAA 480 

GTTTGCTCAT GCCCAGACCT ACGATTTAGT ATGGCTCTGT GTGATCAGGA GAAGACTTTC 540 

AGACAACAGA GAAAAGAACA CATAAGGGAG AGCATGAAGA AACTCTTGGG TCCAAAGAAT 600 

AGTGAAGGAT TGCATTCTGC ACGTGATGTG CCTGTGGTAG CCATATTGGG TTCAGGTGGG 660 

GGTTTCCGAG CCATGGTGGG ATTCTCTGGT GTGATGAAGG CATTATACGA ATCAGGAATT 720 

CTGGATTGTG CTACCTACGT TGCTGGTCTT TCTGGCTCCA CCTGGTATAT GTCAACCTTG 780 

TATTCTCACC CTGATTTTCC AGAGAAAGGG CCAGAGGAGA TTAATGAAGA ACTAATGAAA 840 

AATGTTAGCC ACAATCCCCT TTTACTTCTC ACACCACAGA AAGTTAAAAG ATATGTTGAG 900 

TCTTTATGGA AGAAGAAAAG CTCTGGACAA CCTGTCACCT TTACTGATAT CTTTGGGATG 960 

TTAATAGGAG AAACACTAAT TCATAATAGA ATGAATACTA CTCTGAGCAG TTTGAAGGAA 1020 

AAAGTTAATA CTGCACAATG CCCTTTACCT CTTTTCACCT GTCTTCATGT CAAACCTGAC 1080 

GTTTCAGAGC TGATGTTTGC AGATTGGGTT GAATTTAGTC CATACGAAAT TGGCATGGCT 114 0 

AAATATGGTA CTTTTATGGC TCCCGACTTA TTTGGAAGCA AATTTTTTAT GGGAACAGTC 1200 

GTTAAGAAGT ATGAAGAAAA CCCCTTGCAT TTCTTAATGG GTGTCTGGGG CAGTGCCTTT 1260 

TCCATATTGT TCAACAGAGT TTTGGGCGTT TCTGGTTCAC AAAGCAGAGG CTCCACAATG 1320 

GAGGAAGAAT TAGAAAATAT TACCACAAAG CATATTGTGA GTAATGATAG CTCGGACAGT 1380 

GATGATGAAT CACACGAACC CAAAGGCACT GAAAATGAAG ATGCTGGAAG TG ACT AT C AA 1440 

AGTGATAATC AAGCAAGTTG GATTCATCGT ATGATAATGG CCTTGGTGAG TGATTCAGCT 1500 

TTATTCAATA CCAGAGAAGG ACGTGCTGGG AAGGTACACA ACTTCATGCT GGGCTTGAAT 1560 

CTCAATACAT CTTATCCACT GTCTCCTTTG AGTGACTTTG CCACACAGGA CTCCTTTGAT 1620 

GATGATGAAC TGGATGCAGC TGTAGCAGAT CCTGATGAAT TTGAGCGAAT ATATGAGCCT 1680 

CTGGATGTCA AAAGTAAAAA GATTCATGTA GTGGACAGTG GGCTCACATT TAACCTGCCG 1740 

TATCCCTTGA TACTGAGACC TCAGAGAGGG GTTGATCTCA TAATCTCCTT TGACTTTTCT 1800 

GCAAGGCCAA GTGACTCTAG TCCTCCGTTC AAGGAACTTC TACTTGCAGA AAAGTGGGCT 1860 

AAAATGAACA AGCTCCCCTT TCCAAAGATT GATCCTTATG TGTTTGATCG GGAAGGGCTG 192 0 

AAGGAGTGCT ATGTCTTTAA ACCCAAGAAT CCTGATATGG AGAAAGATTG CCCAACCATC 1980 

ATCCACTTTG TTCTGGCCAA CATCAACTTC AGAAAGTACA GGGCTCCAGG TGTTCCAAGG 2040 

GAAACTGAGG AAGAGAAAGA AATCGCTGAC TTTGATATTT TTGATGACCC AGAATCACCA 2100 

TTTTCAACCT TCAATTTTCA ATATCCAAAT CAAGCATTCA AAAGACTACA TGATCTTATG 2160 

CACTTCAATA CTCTGAACAA CATTGATGTG ATAAAAGAAG CCATGGTTGA AAGCATTGAA 2220 

TATAGAAGAC AGAATCCATC TCGTTGCTCT GTTTCCCTTA GTAATGTTGA GGCAAGAAGA 2280 

TTTTTCAACA AGGAGTTTCT AAGTAAACCC AAAGCATAGT TCATGTACTG GAAATGGCAG 2340 

CAGTTTCTGA TGCTGAGGCA GTTTGCAATC CCATGACAAC TGGATTTAAA AGTACAGTAC 2400 

AGATAGTCGT ACTGATCATG AGAGACTGGC TGATACTCAA AGTTGCAGTT ACTTAGCTGC 24 60 

ATGAGAATAA TACTATTATA AGTTAGGTTG ACAAATGATG TTGATTATGT AAGGATATAC 2520 

TTAGCTACAT TTTCAGTCAG TATGAACTTC CTGATACAAA TGTAGGGATA TATACTGTAT 2580 

TTTTAAACAT TTCTCACCAA CTTTCTTATG TGTGTTCTTT TTAAAAATTT TTTTTCTTTT 2640 

AAAATATTTA ACAGTTCAAT CTCAATAAGA CCTCGCATTA TGTATGAATG TTATTCACTG 2700 

ACTAGATTTA TTCATACCAT GAGACAACAC TATTTTTATT TATATATGCA TATATATACA 2760 
TACATGAAAT AAATACATCA ATATAAAAAT 

Seq ID NO: 50 Protein sequence ; 
Protein Accession #: XP_051896 

1 11 21 31 41 51 

| I I I I I 

MSPIDPYQHI IVEHQYSHKF TVWLRATKV TKGAFGDMLD TPDPYVELFI STTPDSRKRT 60 

RHFNNDINPV WNETFEFILD PNQENVLEIT LMDANYVMDE TLGTATFTVS SMKVGEKKEV 120 

PFIFNQVTEM VLEMSLEVCS CPDLRFSMAL CDQEKTFRQQ RKEHIRESMK KLLGPKNSEG 180 

LHSARDVPW AILGSGGGFR AMVGFSGVMK ALYESGILDC ATYVAGLSGS TWYMSTLYSH 240 

PDFPEKGPEE INEELMKNVS HNPLLLLTPQ KVKRYVESLW KKKSSGQPVT FTDIFGMLIG 300 

ETLIHNRMNT TLSSLKEKVN TAQCPLPLFT CLHVKPDVSE LMFADWVEFS PYEIGMAKYG 360 

TFMAPDLFGS KFFMGTWKK YEENPIjHFLM GVWGSAFSIL FNRVLGVSGS QSRGSTMEEE 420 

LENITTKHIV SNDSSDSDDE SHEPKGTENE DAGSDYQSDN QASWIHRMIM ALVSDSALFN 4 80 

TREGRAGKVH NFMLGLNLNT SYPLSPLSDF ATQDSFDDDE LDAAVADPDE FERIYEPLDV 540 

KSKKIHWDS GLTFNLPYPL ILRPQRGVDL IISFDFSARP SDSSPPFKEL LLAEKWAKMN 600 

KLPFPKIDPY VFDREGLKEC YVFKPKNPDM EKDCPTIIHF VLANINFRKY KAPGVPRETE 660 

EEKEIADFDI FDDPESPFST FNFQYPNQAF KRLHDLMHFN TLNNIDVIKE AMVESIEYRR 720 
QNPSRCSVSL SNVEARRFFN KEFLSKPKA 



Seq ID NO: 51 Nucleotide sequence: 
Nucleic Acid Accession #: NM_00652S 

Coding sequence: 57 . , 764 (underlined sequences correspond to start and stop codons) 
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10 



15 



20 



25 



30 



GCCGCCAGCG 
ACCCCGCTCG 
GCGATGCTGC 
ACGGACCCTG 
GCCAGTTCCT 
GCGACGATGC 
TGGACGACCA 
GTGAAAAATT 
AAGCTACTTG 
AAGATGAGGG 
CCTGTGATGC 
AGGATTGCAA 
GCTTTGCCAG 
ATCTTGTTTG 
GCATGAGGAA 
TTCAAAAATT 
TTTAATTTAT 
AAATATGACT 
AAACAACATA 
CC 



11 

I 

GCTTTCTCGG 
CCCCCTGGGG 
TCAGGAGCCA 
CCGGGCCCTA 
GTACGGGGGC 
TTGCTGGAGG 
GTGTGAGGGG 
CTTTTCCGGT 
TATGGGCTTC 
ACTGTGCTCT 
TTTCACCTAT 
ACGTGCATGT 
TAGAATCCGG 
TCTTTATGGC 
ACAAATCATT 
TGGATTTTTT 
GGTTCAACTG 
CACTCATTTC 
AGACAATATA 



21 

I 

ACGCCTTGCC 
CTGTCGATTC 
ACAGGAAATA 
CTTCTCCGTT 
TGCGAGGGCA 
ATAGAAAAAG 
TCCACAGAAA 
GGGTGTCACC 
TGCGCACCAA 
GCCAATGTGA 
ACTGGCTGTG 
GCAAAAGCTT 
AAAATTCGGA 
TTATTTGCCT 
GGTGATTTAT 
TATATATAAC 
TTTGTGAGAC 
TTGGGGTCGT 
ATCATGTGCT 



31 

I 

CAGCGGGCCG 
TGCTGCTTTT 
ACGCGGAGAT 
ACTACTACGA 
ACGCCAACAA 
TTCCCAAAGT 
AGTATTTCTT 
GGAACCGGAT 
AGAAAATTCC 
CTCGCTATTA 
GAGGGAATGA 
TGAAAAAGAA 
AGAAGCAATT 
TTATGGTTGT 
TCACCAGTTT 
TAGCTGCTAT 
GAATTCTTGC 
ATTCCTGATT 
TTTAACATAT 



41 

I 

CCCGACCCCC 
CCTGACGGAG 
CTGTCTCCTG 
CAGGTACACG 
TTTCTACACC 
TTGCCGGCTG 
TAATCTAAGT 
TGAGAACAGG 
ATCATTTTGC 
TTTTAATCCA 
CAATAACTTT 
AAAGAAGATG 
TTAA ACATTC 
ATCTGAAGAA 
TTATTAATAC 
TCAAATGTGA 
AATGCATAAG 
TCAGAAGAGG 
TTGAGAATAA 



51 



TGCACCATGG 
GCTGCACTGG 
CCCCTAGACT 
CAGAGCTGCC 
TGGGAGGGTT 
CAAGTGAGTG 
TCCATGACAT 
TTTCCAGATG 
TACAGTCCAA 
AGATACAGAA 
GTTAGCAGGG 
CCAAAGCTTC 
TTAATATGTC 
TAATATGACA 
AAGTCACTTT 
GTCTACCATT 
ATATAAAAGC 
ATCATAACTG 
AAAGGACTAG 



Seq ID NO: 52 Protein sequence: 
Protein Accession #: NP_006519 



31 



41 



51 



1 11 21 

I I I I I I 

MDPARPLGLS ILLLPLTEAA LGDAAQEPTG NNAEICLLPL DYGPCRALLL RYYYDRYTQS 

CRQFLYGGCE GNANNFYTWE ACDDACWRIE KVPKVCRLQV SVDDQCEGST EKYFFNLSSM 

TCEKFFSGGC HRNRIENRFP DEATCMGFCA PKKIPSFCYS PKDEGLCSAN VTRYYFNPRY 

RTCDAFTYTG CGGNDNNFVS REDCKRACAK ALKKKKKMPK LRFASRIRKI RKKQF 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 



60 
120 
180 



35 Seq ID NO: 53 Nucleotide sequence: 
Nucleic Acid Accession #: AA478778 
Coding sequence: no ORF found 



40 



45 



50 



55 



60 



TATTTTTGTA 
TGATCCTTCA 
ACTTTTTTAA 
GTATAAAAGA 
TCACAGATTG 
GGAAAAAAAT 
GAAATATAAA 
CACTTGGATG 
AGACTTAGAC 
GTGCCTTGGT 
TATTGGGAAA 
TTACCAGTAG 
CTGGAGAAAT 
GCTGGTGATG 
CTGCAGTAAT 
TTCCGATTGC 
CATGGTCTTT 
ACACTTGGTT 
CAGTATTCTC 
TGAATAAGTG 
CACCACACCC 



11 

I 

CGTAAAATGA 
TTATCACGGT 
AATGAATTTT 
TATTTTTGGC 
TACCAACTAT 
ATGCTGCCTT 
CACTTTTAAT 
AAATAAGACC 
TTTATCCTTA 
CTCTCCACAA 
GTGAGATCCT 
AAAGACACAG 
TCAGAACCAG 
TGACTTCTCT 
GGACGTTTGT 
TCATTAATTC 
CTGCCCCTCC 
TGAGAAACCC 
CAACTCCAAA 
TTATTCTCCA 
AAAAAAAAAA 



21 
i 

TTCTATTATG 
ACACTATTGT 
TTTAAAACAA 
ATTTCTAGGC 
TAACTATGTT 
GGTGCTAATA 
GAAAGGGAGG 
AGCTCTTTAC 
TTGTTGTTAG 
TCAAATGGAG 
CTCACCATTT 
GATGCACAGA 
GTTCTGAATC 
TCAGGCCATG 
GTGAAGAAAT 
ACTTTTTTGT 
AAGCTGATGA 
TGCCCACTTC 
CAAGCTCTAG 
TTATTAATGT 
AAAAAAAAAA 



31 
I 

ACTGCCTTTG 
TTACTTTTCA 
TCTAGCCATC 
AAGTATCAGC 
AAATAAGTAT 
TTGTATGTAT 
AACGGAAGGA 
CCTTATTTTT 
TGTTGTTAAT 
GATCCCCCAA 
TGCCAAGATA 
ATGGGCATGA 
ATCACGATTG 
AG CCTAAC A Y 
GAACTGTGGA 
TACTTCTTTC 
AGGGAAGCCT 
CAAAGACCAA 
AGTGCTCCAG 
GTTCTGAAAA 
AAAA 



41 



CATGTAGTAA 
TCTGTAAATG 
ATCAAGGTGC 
CAATAAGTAT 
TCAGTTTCAT 
TTAAATGATC 
CAATTTCCAG 
GGATATGCCT 
ATTCGTTGCT 
GCAGCTTCAT 
CTCTAAAATG 
CCTTCAGCTC 
CCTTTTGCAT 
CCTGCCGGTT 
GTACAAAATG 
CAAAATGGAA 
TTGCCAATGG 
AGAGATTAGG 
GAAAAGTTAT 
TATATTATGA 



51 
I 

TATGACAAAG 6 0 

TTTTATTGTT 120 

TATAAGAGTT 180 

GTTAGTGATA 24 0 

GTGATCTCTG 300 

ATCTGACTCA 360 

TGCACAGAAT 420 

TTTTTGGAAG 480 

TCAGCCCACG 54 0 

TACAGAGTGA 600 

ACATCCAAGT 660 

ACGAGCACAC 72 0 

GAAAACATCG 780 

TTCATGCCCG 84 0 

CTTTGAGTCT 900 

GTGCTGAAGC 96 0 

CCCATGGAAG 1020 

AAAAGCCTGG 1080 

ATTCAGTATA 1140 

ATAAATACAT 1200 



65 Seq ID NO: 54 Nucleotide sequence: 
Nucleic Acid Accession #: NM_020663 

Coding sequence: 1..645 (underlined sequences correspond to start and stop codons) 



70 



75 



ATG AACTGCA 
ATGTTGAAGT 
TACGCCAACG 
ACTGTGACTG 
GACTACAACC 



11 
I 

AAGAGGGAAC 
GTGTGGTGGT 
ACGCCTTCCC 
TGGGAGGCAA 
AGCTGAGGCC 



21 
I 

TGACAGCAGC 
GGGGGACGGT 
AGAGGAATAC 
GCAACACTTG 
ACTCTCCTAC 



31 
I 

TGCGGCTGCA 
GCCGTGGGGA 
GTGCCCACTG 
CTCGGACTGT 
CCCAACACGG 



41 
I 

GGGGCAACGA 
AAACCTGCCT 
TGTTTGACCA 
ATGACACCGC 
ATGTGTTTTT 



51 
I 

CGAGAAGAAG 60 

GCTGATGAGC 120 

CTATGCAGTT 18 0 

GGGACAGGAG 24 0 

GATCTGCTTC 300 



205 



WO 02/079492 



PCT/US02/04915 



TCTGTCGTAA ACCCTGCCTC TTACCACAAT GTCCAGGAGG AATGGGTCCC CGAGCTCAAG 360 

GACTGCATGC CTCACGTGCC TTATGTCCTC ATAGGGACCC AGATTGATCT CCGTGATGAC 420 

CCAAAAACCT TGGCCCGTTT GCTGTATATG AAAGAGAAAC CTCTCACTTA CGAGCATGGT 480 

GTGAAGCTCG CAAAAGCGAT CGGAGCACAG TGCTACTTGG AATGTTCAGC TCTGACTCAG 540 

AAAGGTCTCA AAGCGGTTTT TGATGAAGCA ATCCTCACCA TTTTCCACCC CAAGAAAAAG 600 
AAGAAACGCT GTTCTGAGGG TCACAGCTGC TGTTCAATTA TCTGA 



Seq ID NO: 55 Protein sequence: 
Protein Accession #: NP 065714 



1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDEKK MLKCWVGDG AVGKTCLLMS YANDAFPEEY VPTVFDHYAV 60 
TVTVGGKQHL LGLYDTAGQE DYNQLRPIjSY PNTDVFLICF SWNPASYHN VQEEWVPELK 120 
DCMPHVPYVL IGTQIDLRDD PKTLARLLYM KEKPLTYEHG VKLAKAIGAQ CYLECSALTQ 180 
KGLKAVFDEA ILTIFHPKKK KKRCSEGHSC CSII 



Seq ID NO: 56 Nucleotide sequence: 

Nucleic Acid Accession #: fgenesh prediction 

Coding sequence: 1-546 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ATGGCCTTGG GCAGCTCCGC CCCTGTGGCT TTGCAGGGTA ATGCCCACTT CCCTGCTGCT 60 

TTCATGGCTG GCATTAAGTG TCTGTGGCTT TTCCAGGTAG TCCCCCTGGG GCTCCCCGAG 120 

TTGGTGCAAA GGCTCCTGGG TGGAGCTCGA ACTGAAACTC GCTTTGTGCC CGCAGCCCTG 180 

CAGCTCGCCG GTGCCCTCGA CCTGCCCGCT GGGTCCTGTG CCTTTGAAGA GAGCACTTGC 240 

GGCTTTGACT CCGTGTTGGC CTCTCTGCCG TGGATTTTAA ATGAGGAAGG CCAGCAACCT 300 

TTCTGGTCCT CAGGAGACAT GTCTGACTGG GACTACTGGG TTGGCTGGCG GAAGTTAATT 360 

CATTCTCCTC TGAGCACTCC AGGGTGGAGC AGGCAGGTTA GGCTCCAGTT GTTCCAGCTT 420 

CAGTTTGTCA AAGGCCAGAA CTTGGACGTA ACAGTGTACT GCAGGCTCCA GGGCAGTGAG 480 

AAACCCTTTG AAACTGGTTC CATGGTTCCA TTCACCTTCA TGTACTGGAT CCACCATGGA 54 0 
AAGTAG 



Seq ID NO: 57 Protein sequence: 

Protein Accession #: fgenesh prediction 



1 11 21 31 41 51 

I I I I l l 

MALGSSAPVA LQGNAHFPAA FMAGIKCLWL FQWPLGLPE LVQRLLGGAR TETRFVPAAL 60 
QLAGALDLPA GSCAFEESTC GFDSVLASLP WILNEEGQQP FWSSGDMSDW DYWVGWRKLI 12 0 
HSPLSTPGWS RQVRLQLFQL QFVKGQNLDV TVYCRLQGSE KPFETGSMVP FTFMYWIHHG 180 

K 



Seq ID NO: 58 Nucleotide sequence: 
Nucleic Acid Accession #: XM_050478 

Coding sequence: 27.. 4508 (underlined sequences correspond to start and stop codons) 
1 ~ 11 21 31 41 51 

I I I I I I 

CCGGCGGCGC CTGAGCCCAG CCGAGG ATG G AGAACCGGCC TGGGTCCTTC CAGTACGTCC 60 

CTGTGCAGCT GCAAGGGGGG GCACCCTGGG GCTTCACCCT TAAGGGGGGT CTGGAACACT 120 

GTGAGCCGCT CACAGTGTCT AAGATTGAAG ATGGAGGCAA GGCAGCTTTG TCCCAGAAGA 180 

TGAGGACTGG TGATGAGCTG GTGAATATCA ATGGCACTCC ATTATATGGC TCCCGCCAAG 240 

AGGCCCTCAT TCTCATCAAA GGCTCCTTCC GG ATTCTCAA GCTGATTGTC AGGAGGAGGA 300 

ACGCCCCTGT CAGTAGGCCG CACTCATGGC ATGTGGCCAA GCTGCTGGAG GGATGCCCTG 360 

AAGCAGCCAC CACCATGCAT TTCCCTTCTG AAGCCTTCAG CTTGTCCTGG CATTCTGGCT 420 

GCAACACAAG TGACGTGTGT GTGCAGTGGT GTCCACTCTC CCGGCATTGC AGCACCGAGA 480 

AAAGCAGCTC CATTGGCAGC ATGGAGAGCC TGGAGCAACC AGGCCAAGCC ACCTATGAGA 540 

GCCATCTGTT GCCTATTGAC CAGAACATGT ACCCTAACCA GCGTGACTCA GCCTACAGCT 600 

CCTTCTCGGC CAGCTCAAAT GCTTCTGACT GTGCCCTTTC CCTCAGGCCA GAGGAGCCAG 660 

CCTCTACAGA CTGCATCATG CAAGGCCCAG GGCCAACTAA GGCCCCCAGT GGCCGGCCTA 720 

ATGTGGCTGA GACCTCAGGA GGTAGTCGGC GCACCAATGG GGGCCACCTG ACCCCCAGCT 780 

CTCAGATGTC ATCCCGTCCA CAGGAGGGAT ACCAGTCAGG GCCCGCCAAA GCAGTCAGGG 840 

GCCCACCACA ACCTCCAGTG AGGCGGGACA GCCTTCAGGC CTCCAGAGCC CAACTCCTCA 900 

ATGGAGAGCA GCGCAGGGCA TCTGAGCCTG TGGTCCCCTT GCCACAGAAG GAGAAACTGA 960 

GCTTAGAGCC TGTGCTACCC GCAAGGAACC CTAATAGGTT CTGTTGCCTC AGTGGGCATG 1020 

ACCAAGTGAC AAGTGAGGGC CATCAGAACT GTGAGTTCAG TCAGCCTCCT GAATCCAGCC 1080 

AACAGGGCTC TGAGCATCTA CTGATGCAGG CCTCAACCAA AGCTGTTGGA TCCCCAAAAG 1140 

CCTGTGACAG AGCTTCCAGC GTGGATTCCA ACCCACTCAA TGAGGCTTCT GCAGAGCTAG 1200 

CTAAGGCTTC TTTTGGCAGA CCTCCACATC TCATAGGACC CACAGGGCAT CGCCATAGTG 1260 

CCCCTGAACA GCTGCTGGCA TCCCACCTGC AGCATGTGCA CCTTGATACC AGGGGCAGCA 1320 

AAGGGATGGA GCTCCCACCC GTACAGGATG GGCACCAGTG GACTCTGTCC CCTTTGCACA 1380 



206 



WO 02/079492 



PCT/US02/04915 



GCAGCCACAA AGGGAAGAAA AGTCCATGCC 
GCAAAGAAAG AAAGACCAGA CAAGTGGATG 
AAAGCAGTCC CCCACATGGA GAGGCTGATG 
CAAACAGAAC AAGCAGAGCA GCCAGTGAAT 
5 CCCTTGTTCA ACAAGCCACG GACTGTTCTT 
CAGGTGAAGA AGGGGACAGC GAGCCCAAGG 
GAGGGACCCG GGGCCGCTCG ATCCAAAACC 
TGCGTAATGA AATTCAGAGG AGGAAGGCCC 
AGCTGTGTGA CACTAAGGAG CCAGTGGAAG 

10 TCACTGCCTC TAACACATCT CTTCTATCTT 
AGCTCTTCAA CAAAAGCATG ATGCTCAGAG 
CTGAGAGCCA TGAATCTAGG ACAGGCTTAG 
GCCAGTCCTC TTTGGGCCTG AACACCTGGT 
CTGAGAAAGC ACATGCTCAC TGTGGAGTCC 

15 ATAATTCACA GCCACTTGTG GCAGCAGCCA 
AGGAATTGAA GGCTTCTACT GCTCAAGCTG 
ACAGAAGAAA GTTCTTTGAA GAGAGTAGCA 
TAACCACTCA TAGCAACAAG ACTTTTACCC 
AGCCAATGAG CTCCAGCTGT AGGGAATTGA 

20 CCGCAGACCA ACCATATCAT GCCACAGACC 
CAGAAACTCC CACTTACTCA GAATGTTTTG 
GTAAGCCACT ACACTGTGGT GATTTTGATT 
TTCAAGGAGC TCTAGTCCAT GATCCTTGCA 
TGCTAAAGAG AAATATGATG CCAAATTGCT 

25 TTCGGTGTTC AGTTTGCTAT CATAATCCTC 
CACCTGGCAA CACTTGGAAA CCCAGGAAGC 
GGAATCCAAT AACAGGAAAC AGGAAGACCA 
AGACTAGCTT TTCATGGGCA ACCCCTTTCC 
TGTCAAGCTA CCGAGCAATT TCTTCTCTTG 

30 AAAAATCAGA GGAAACTTCA GTTTATGAGG 
CACTGCGCAG CCGTGCCTTC TCAGAGAGTC 
CCTGGGGGCA GCATAGGAGG GAGCTCTTTA 
TCGGAGCCAG GAAGAAGGCC TTTCCTCCTC 
ACAGGCTCTT TCGTGCAGCC CAGCAGCAGA 

35 AGGAGGAGGA GGAGGAGGAA GAAGAAGAAG 
CAGAGGAGGA GGAAGAGGAG CTGCCACCCC 
GTGCTCTCAA TCCTGAGGAG GTCCTAGAGC 
AGGGCTCGAG ACAGGGTTCA CAAAGTGTCC 
CCAGTGATTT CTTGCCTCCA ATAAGGGGTC 

40 CCCCTTGCTA CTATGGCATT GGTGGGCTTT 
CCGCCAAACA AGAGTTTCAG CACTTTTCGC 
CTTACTCAGC TTATTACAAT ATTTCTGTGG 
ACCAACCTGA GATGGCAGAG ATTGGCCTAG 
AAAAAAAGAT ACAGCTTATC GAAAGCATCA 

45 AGCGAGGGCT GCTAGAGGAC ATCAATGCCA 
ACTTAAAAGC CGTCTGCAAA TCCAATGAAT 
TGGACAAAGT GGTCAACCTG TTGCTGTCAC 
CTCTGAACAG CATCGATTCA GAGGCCAACC 
AGCAGCTGAC GGGGCAGTTG GCAGATGCCA 

50 AGAAGTTGGT GTTTGGCATG GTCTCCCGCT 
AGCACTTTGT CAAGATGAAA TCTGCTCTCA 
TCAAGCTCGG GGAAGAGCAA CTCAAATGTC 
ATTTCTAATT CTACCAGCAC TCTGCCACAG 
TCAATCTTCT TTGTTAGCAG TTTCTCAGCA 

55 CCTCTACCCT GGATGTCTCT CACTACCCCT 
ACCCTGGGGA AGCCACAAGC TTCTACCCAA 
CACACTCTCC TTCCCACAGT TGCCAAGGGC 
TGCCTTCATT CTGCTTTGTA CTAGGACACC 
ATCATCAACA GCCTCTAAAG GCTCAGAGGG 

60 GGCTTGTGGC CAGCCATTTC T C AC AG AG AG 
CAGTTTCAGG GCCTCACCCA AGCTTTGCAG 
AAAAAATGCA AGCAAAGGTT GAGTACCCCC 
ATAGGCTCTA CCCTTACCTT TCCCAGCAGC 
CTGGCTAGTG TGACCCTCTT CCTGTCCTAA 

65 TTTCCTTTAC ATTGCTGGGG GTTACCGCAG 
CATTAATAGC TCTACTAAAA CTGACTTCTA 
TCTTATTGTT ATATTTTAAA TGGCCTTTTG 
TTTTCTTTTT TAACTAATAA GGCGAGAAGA 
AAGGAAAGCA TTTTCTGCAG ATCAGCCTGA 

70 ' TCTCGTGTTG CTCACAACTA CCTGCCTGGA 
TAAAACACAA GATCAAATGA ACAATCCGAA 
CATGGTGGCT CACGCCTGAA ATCCCAGCAC 
GTCAGGAGAT CAAGACCATC CTGGCTAACA 
AAAAATTAGC CAGGTGTGGT GGCACGCACC 

75 AGGAGAATTG CTTGAACCTG GAAGGCAGAG 
CTCCATCCTG GGCAACAGAG TGAGACTTTG 



CCCCTACAGG AGGAACCCAT GACCAGTCCA 144 0 

ACAGGTCTTT AGTTTTGGGA CACCAGAGCC 1500 

GACACCCCTC AGAAAAAGGT TTCCTGGACC 1560 

TGGCCAACCA GCAACCCTCT GCCTCTGGCT 1620 

CAACCACTAA AGCAGCTAGT GGCACAGAGG 1680 

AGTGCAGCCG GATGGGTGGT AGGCGAAGTG 1740 

GGCGGAAGAG TGAGCGTTTT GCTACCAATC 1800 

AGCTCCAGAA AAGCAAGGGT CCCTTGTCAC 1860 

AGACCCAGGA GCCCCCAGAA AGTCCTCCAC 1920 

CATGTAAAAA ACCTCCCAGC CCCAGAGACA 1980 

CTAGGTCTTC CGAGTGCCTC AGCCAAGCCC 2040 

AGGGACGAAT AAGCCCTGGC CAGAGGCCTG 2100 

GGAAAGCACC TGACCCATCC TCCTCAGACC 2160 

GTGGAGGTCA TTGGAGATGG TCTCCAGAGC 2220 

TGGAAGGCCC TTCCAACCCA GGTGACAACA 2280 

GGGAGGATGC CATCCTCTTG CCTTTTGCAG 234 0 

AATCCTTATC TACATCTCAT TTGCCAGGTT 2400 

AGAGACCAAA ACCTATAGAC CAAAACTTCC 2460 

GGCGCCATCC CATGGACCAA TCATATCATT 2520 

AATCATATCA TTCCATGTCA CCCCTTCAGT 2580 

CAAGCAAAGG TCTAGAAAAT TCCATGTGTT 2640 

ACCACAGGAC CTGCTCTTAC TCCTGCAGTG 2700 

TTTATTGTTC TGGGGAAATC TGCCCTGCCT 2760 

ACAACTGCCG GTGCCACCAC CACCAATGCA 2820 

AGCACAGTGC CCTCGAGGAC AGCAGCTTGG 2880 

TGACAGTGCA GGAATTTCCT GGGGACAAAT 2940 

GCCAGTCAGG GAGGGAAATG GCTCATTCCA 3000 

ATCCTTGCCT TGAGAACCCA GCACTGGACT 3060 

ACCTCCTTGG AGACTTCAAA CATGCTTTGA 3120 

AGGGGAGCTC CCTTGCCTCC ATGCCCCACC 3180 

ACATC AG CTT GGCGCCCCAA AGCACCCGGG 3240 

GCAAAGGTGA TGAGACCCAG TCGGATCTTC 3300 

CTCGCCCTCC TCCTCCCAAC TGGGAGAAGT 3360 

AGCAGCAACA GCAGCAGCAG AAGCAACAGG 3420 

AAGAGGAAGA GGAAGAGGAG GAGGAGGAGG 3480 

AGTATTTCAG TTCAGAAACC TCTGGTTCCT 3540 

AGCCACAACC CCTCAGCTTT GGCCACCTGG 3600 

CAGCAGAGCA AGAATCCTTT GCACTCCATT 3660 

ACTTGGGATC TCAACCTGAG CAGGCTCAGC 3720 

GGAGGACATC GGGACAGGAA GCCACTGAAT 3780 

CTCCTTCAGG GGCCCCAGGA ATCCCTACCT 3840 

CCAAGGCAGA GCTGCTGAAC AAACTGAAAG 3900 

GAGAGGAGGA AGTTGACCAT GAACTGGCTC 3960 

GCAGAAAACT TTCTGTCTTG CGGGAGGCCC 4020 

ATTCTGCCCT TGGGGAGGAG GTGGAGGCCA 4 080 

TTGAAAAGTA CCACTTGTTT GTTGGGGACC 4140 

TCTCTGGACG ACTGGCCCGG GTGGAGAATG 4200 

AGGAGAAGTT GGTACTGATA GAGAAGAAGC 4 260 

AGGAGCTGAA GGAGCACGTG GACCGCCGGG 4320 

ACCTGCCTCA GGACCAGCTC CAAGATTACC 4380 

TCATTGAACA GCGAGAGCTG GAGGAGAAGA 4440 

TCAGGGAGAG TCTACTCCTG GGGCCCAGCA 4500 

CATCCCTGCC CAGCCATGTG GGAAGTGCTT 4560 

AGTAGATAGC AATTAGCAGT TTGTTCCAGC 4620 

TCCCTAGCAG TGGTCCTAAC CAGCTAGGAG 4680 

GGGAGCTGCA GCAAGGTGTG ATCTTAGAAC 474 0 

AAGTACTTGC TGCACAGAGA ACCAAGGAAG 4800 

AAAGACATCA AGTACTCATC ACCCACCCAT 4 860 

AATCTGCCTT GCAGCTCTAC TCTGCCCCAG 4920 

CTGGCTGCCT TGAGGGCATT CACCTGGCAC 4980 

GGGAAAGCAC AGAGGGAGGA ATTACACTGA 5040 

AGGTGCCCCT TAGGAAGGAA CCAGGTTTAA 5100 

AAGTTCAGGG GAAGAGGCCT ACTCTTAGCC 5160 

GACTTTGGTC CTACCACCTC TTGTTTCATC 5220 

GTGCCTACCC CAGGGCTTCA CCATATGGGC 5280 

GATGTAGGTT TCATTATTGG GGGAGGGGGT 534 0 

ATTTTATTTA TTTTTATGTT TTGATTATTT 5400 

GGGAAGTTGG AGAGGGAAAA GTTAGCCCAG 5460 

ATCCACCGTG GCTAGGCATA TTCTTGCTCT 552 0 

TGAATTTAGG AAAGTTGCAG GATACAAGGT 5580 

AATGTTATTA AGAAAACAGT TCCGGCCGGG 564 0 

TTTGGGAGGC CGAGGCAGGT GGATCACGAG 5700 

CGGTGAAACC CTATCTCTAC TAAAAATACA 5760 

AGTAGTCCCA GCTACTCGGG AGGCTGAGGC 5820 

ATTGCAGTGA GCTGAGACCA CACCACTGCA 5880 

TCTCAAAAAG AAAGAAAGAA AGAAAGAAAG 5940 
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AAAGAAAGAA AGAAAAGAAA GAAAGAAAGA AAGAAAGAAA ACAGTTCCAT TTACAATAGC 6000 
ATC 

Seq ID NO: 59 Protein sequence: 
Protein Accession #: XP_050478 

1 11 21 31 41 51 

I I I I I I 

MENRPGSFQY VPVQLQGGAP WGPTLKGGLE HCEPLTVSKI EDGGKAALSQ KMRTGDELVN 60 

INGTPLYGSR QEALILIKGS FRILKLIVRR RNAPVSRPHS WHVAKLLEGC PEAATTMHFP 120 

SEAFSLSWHS GCNTSDVCVQ WCPLSRHCST EKSSSIGSME SLEQPGQATY ESHLLPIDQN 180 

MYPNQRDSAY SSFSASSNAS DCALSLRPEE PASTDCIMQG PGPTKAPSGR PNVAETSGGS 240 

RRTNGGHLTP SSQMSSRPQE GYQSGPAKAV RGPPQPPVRR DSLQASRAQIi LNGEQRRASE 300 

PWPLPQKEK LSLEPVLPAR NPNRFCCLSG HDQVTSEGHQ NCEFSQPPES SQQGSEHLLM 360 

QASTKAVGSP KACDRASSVD SNPLNEASAE LAKASFGRPP HLIGPTGHRH SAPEQLLASH 420 

LQHVHLDTRG SKGMELPPVQ DGHQWTLSPL HSSHKGKKSP CPPTGGTHDQ SSKERKTRQV 480 

DDRSLVLGHQ SQSSPPHGEA DGHPSEKGFL DPNRTSRAAS ELANQQPSAS GSLVQQATDC 540 

SSTTKAASGT EAGEEGDSEP KECSRMGGRR SGGTRGRSIQ NRRKSERFAT NLRNEIQRRK 600 

AQLQKSKGPL SQLCDTKEPV EETQEPPESP PLTASNTSLL SSCKKPPSPR DKLFNKSMML 660 

RARSSECLSQ APESHESRTG LEGRISPGQR PGQSSLGLNT WWKAPDPSSS DPEKAHAHCG 720 

VRGGHWRWSP EHNSQPLVAA AMEGPSNPGD NKELKASTAQ AGEDAILLPF ADRRKFFEES 780 

SKSLSTSHLP GLTTHSNKTF TQRPKPIDQN FQPMSSSCRE LRRHPMDQSY HSADQPYHAT 840 

DQSYHSMSPL QSETPTYSEC FASKGLENSM CCKPLHCGDF DYHRTCSYSC SVQGALVHDP 900 

CIYCSGEICP ALLKRNMMPN CYNCRCHHHQ CIRCSVCYHN PQHSALEDSS LAPGNTWKPR 960 

KLTVQEFPGD KWNPITGNRK TSQSGREMAH SKTSFSWATP FHPCLENPAL DLSSYRAISS 1020 

LDLLGDFKHA LKKSEETSVY EEGSSLASMP HPLRSRAFSE SHISLAPQST RAWGQHRREL 1080 

FSKGDETQSD LLGARKKAFP PPRPPPPNWE KYRLFRAAQQ QKQQQQQQKQ QEEEEEEEEE 1140 

EEEEEEEEEE EAEEEEEELP PQYFSSETSG SCALNPBEVL EQPQPLSFGH liEGSRQGSQS 1200 

VPAEQESFAL HSSDFLPPIR GHLGSQPEQA QPPCYYGIGG LWRTSGQEAT ESAKQEFQHF 1260 

SPPSGAPGIP TSYSAYYNIS VAKAELLNKL KDQPEMAEIG LGEEEVDHEL AQKKIQLIES 1320 

ISRKLSVLRE AQRGLIiEDIN ANSALGEEVE ANLKAVCKSN EFEKYHLFVG DLDKWNLLL 1380 

SLSGRLARVE NALNSIDSEA NQEKLVLIEK KQQLTGQLAD AKELKEHVDR REKLVFGMVS 1440 
RYLPQDQLQD YQHFVKMKSA LIIEQRELEE KIKLGEEQIiK CLRESLLLGP SNF ' 



Seq ID NO: 60 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014705 

Coding sequence: 192.. 2489 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GGGAGAAGCT AGGAAAAAAT GTCTTTGAGC TGTGAGATGC TTGTATATTT TGAAAATATG 60 

ATTATATGCA TGTGTTTGTA TTTTATGACT TGGATAATCT GAAAATCAAT TTGCTTTGTC 120 

AATGCTTCCT GGATTAGAAT TCCACTATTT GGTCCCTATC CTAGTCTACT AAAGAAAATT 180 

GAGCGGGAAA C ATG GCGGGA AAGTGGCGTT TCATTAATTG CTACTGTAAC TCGTCTAATG 240 

GAGAGGTTGT TAGATTACAG AACTTCTATA AGACTGAACT GAACAAGGAG GAGATGTATA 300 

TACGCTACAT TCACAAACTC TATGATCTGC ATCTCAAAGC ACAGAACTTT ACAGAAGCTG 360 

CATATACCCT CCTCTTATAT GACGAGCTAC TGGAATGGTC TGATCGGCCC CTCAGGGAGT 42 0 

TCCTGACCTA CCCCATGCAA ACAGAATGGC AG CG CAAAG A GCACCTGCAC CTCACCATCA 4 80 

TCCAGAACTT TGACAGAGGC AAATGTTGGG AGAATGGCAT TATCTTGTGC CGGAAGATTG 540 

CAGAGCAGTA TGAGAGTTAT TATGACTACA GAAACCTGAG CAAGATGCGG ATGATGGAAG 600 

CCTCTTTGTA TGACAAAATT ATGGACCAGC AACGTCTTGA ACCAGAGTTC TTCAGAGTTG 660 

GATTTTATGG AAAAAAATTT CCATTTTTCT TAAGAAATAA GGAGTTTGTG TGTCGAGGGC 720 

A'TGACTACGA GAGGCTGGAA GCCTTCCAAC AGAGAATGCT GAACGAGTTC CCCCATGCCA 780 

TCGCCATGCA GCACGCCAAC CAGCCCGATG AGACCATCTT CCAGGCAGAA GCTCAGTATT 84 0 

TG CAG AT ATA TGCTGTGACT CCCATTCCAG AGAGCCAGGA GGTCCTGCAG AGAGAGGGTG 900 

TTCCGGACAA CATCAAAAGC TTCTATAAAG TGAATCACAT CTGGAAATTC CGCTATGACC 960 

GACCATTTCA CAAAGGCACA . AAAG AT AAAG AGAATGAATT CAAGAGTCTC TGGGTGGAGA 1020 

GAACGTCATT ATACTTGGTG C AG AGTTTG C CTGGCATCTC TCGCTGGTTT GAAGTGGAAA 1080 

AGCGTGAAGT GGTAGAAATG AGTCCTCTGG AAAATGCAAT TGAAGTGCTA GAAAATAAGA 114 0 

ATCAGCAGCT GAAGACTCTG ATTAGTCAGT GTCAGACAAG ACAGATGCAG AATATTAATC 1200 

CCCTGACTAT GTGCCTGAAT GGAGTTATAG ATGCTGCAGT TAATGGTGGC GTTTCCAGGT 1260 

ATCAAGAGGC ATTCTTTGTC AAAGAATATA TCTTAAGTCA CCCTGAAGAT GGGGAGAAAA 1320 

TTGCACGATT AAGAGAGCTG ATGCTTGAGC AGGCACAGAT TCTGGAATTT GGTTTGGCCG 1380 

TGCATGAGAA GTTTGTACCT CAAGATATGA GACCCCTTCA CAAAAAGCTG GTTGACCAAT 144 0 

TCTTTGTGAT GAAGTCGAGC TTAGGGATAC AGGAGTTCTC TGCTTGTATG CAAGCCAGTC 1500 

CTGTCCATTT TCCTAATGGA AGCCCTCGTG TGTGTAGAAA CTCAGCACCT GCTTCTGTGA 1560 

GCCCAGATGG TACCAGGGTA ATTCCTAGAC GCAGCCCGTT AAGTTACCCA GCTGTCAACC 1620 

GATATTCTTC CTCCTCACTG TCCTCACAAG CTTCTGCTGA AGTAAGCAAT ATTACAGGGC 1680 

AAT CAG AAAG CTCTGATGAA GTCTTTAACA TGCAGCCAAG TCCATCTACC TCAAGCTTGA 1740 

GTTCTACTCA CTCGGCTTCA CCTAATGTGA CAAGTTCTGC TCCATCGAGT GCCAGAGCTT 1800 

CTCCTTTGTT GTCTGACAAA CACAAACATT CC CGAGAAAA CTCTTGCCTG TCACCAAGAG I860 

AG AG AC C ATG CAGTGCCATC TATCCAACAC CTGTGGAGCC TTCGCAGAGG ATGCTGTTTA 1920 

ATCATATTGG AGACGGGGCC TTGCCACGCA GTGACCCAAA TCTCTCTGCA CCTGAAAAAG 1980 

CTTCACCAGC AAGACACACG ACATCAGTAT CCCCCTCGCC TGCCGGGCGA TCTCCATTGA 204 0 
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10 



15 



20 



25 



30 



35 



40 



45 



AGGGCTCTGT 
CCAACTCCCC 
GCACGTCGGA 
TGCCAGTGCC 
AGACTCCGCC 
ACAGCCTCTC 
CGCGATCCAG 
CCCTGCCCCG 
TTGCCCGTTT 
CTTACTCAGC 
ATATTAAAAG 
AAAACGTTTC 
GTTCACTTTT 
TGTGTGACAA 
TCAGTGCACA 
TTTATCAGTG 
GGGGTTGCAA 
GTTGCTTTTG 
CTAAATCCCT 
TGCTTCAGAG 
ATTTTGCGCT 
TGCTGAAGTA 
TACAGACAGT 
TTAGCACGAT 
CACAAAATAA 
CCTGTCGTCT 
ATAGCATCTT 
CTTGTATGTA 
TCAATTGACA 
TGTTGGGAAG 
AGTGACCACT 
TGGTTATAGA 
TAATAAGAGA 
CAGTATGATT 
ATGTGCTTTT 
ATATGTTAAA 
TTTAGCTTTG 
GATAACACTT 
TATTTCTGTT 
AACAAAGCTT 
GGGCACATTA 
AGTGAGCAGT 
AGCAGAACAT 
TAAATATTAA 



GCAGTCTTTC 

TGTCTTGTCG 

AACCTCAGGC 

GGTGCCCGTG 

CCCGTACAGC 

CATCCCCGTC 

CCACCTGGAG 

CAAGGTCTCT 

ACAAAATAAG 

TCCTTCGATG 

TTGCTGATCT 

TCATTTGGAA 

TTTTATATAG 

TGTTGTATCG 

TCCTAACACA 

CAGATCATCA 

AGCTTGGGAC 

CTGTTTGACT 

TACTACCCTG 

TTCCAATCAG 

TCCTTTCTTG 

TTAATGAGGC 

ATTAAATTAA 

TAAATGGCAA 

AAAATTCCTC 

TTCTCTTGAA 

AGGTCTAGAT 

GTAATGTATT 

ATAGATATGA 

CTCATTTTAG 

TTTTTATATT 

GGTTTTCTAT 

TACCATTATG 

GTTATAATTA 

AACAATTCTG 
TATTTATGTT 

TATCTTGCAA 
CGTGTTTGTA 
GCTTTAAAGA 
CCTTGATTTC 
TAAATCAGTG 
TGAATTTATC 
TTTAAGAGAT 
ATAAAATTAC 



ACCCCCTCTC 
GGCAGCTACA 
TTTGAAAATC 
CCGAGCTACG 
GTCTACGAGC 
ACGTCGGAGC 
AATGGGGCCC 
CAGTTATAAG 
AAGTATGATG 
AATGGAATTA 
AAAACGCCAG 
GTGGTAAATA 
TTTAATCTTA 
TTTTTACTGA 
GTGGTCCTTA 
GAATTAAAGT 
TGGAAATTGT 
GCTGTCTACA 
ACACCGTGGT 
CTAGATTAAG 
ATAGTTTCCT 
ACAAATGACT 
TAGCTTAAGT 
AAGGACTTAT 
ACGACTCTCC 
GGAGGATTGC 
AGGGATGCTA 
TTATATCTTT 
ACTGTATTTT 
TTTAACCATG 
CTCTTAATGA 
AATAAATGTT 
TGTAAAAAAA 
TGCCAAATAC 
CCATATTGAC 
TAGTGAAAGT 
GTTTTGCAGT 
ACCACATTCA 
AGTAAAACCT 
CTTTTCCTGT 
TTATTTGCTC 
TTGAATTTAT 
TCTGTTAGCC 
CAGATTAATC 



CAGTGGAGTA 
GCAGTGGGAT 
AGGTGAATGA 
GCGGGGAGGA 
GGACTCTGCG 
CGCCCGCGCT 
GGAGGACTGA 
TCACTTTTCT 
AGAAGACATT 
AAACTTGCTT 
ATGTTAAATG 
GTGATAAAGA 
AAACCAATAC 
ATACTTGATA 
TTTTAGAAGA 
TCAAGCAGGC 
TTTGTTCTTG 
TTCGTAAAAT 
ATCTACTGTA 
CAAGAGGCTC 
ATATAAAATT 
GTGCCCCATT 
GAAGAAAAAA 
AAAAGGCAAG 
ACTTTTACCA 
TGTAGACTTC 
ATGCCAGTTG 
GTTTTTTCTT 
AAATCATACT 
TTTGTTTTGT 
AACCATTCAG 
CAAGTATTTT 
AGTAAAAATA 
TTTACGTATG 
TTTACAATTT 
GTTCATAATT 
CAGAAATTTT 
TATATATATA 
TCCATTTAAA 
GTAATTTAAT 
TTGGAGCCAT 
CATGTGTGTG 
CACATGTTCA 
TT 



CCACTCGCCA 
TTCTTCTCTC 
ACAGTCGGCC 
GCCAGTGCGC 
GCGCCCCGTC 
GCCCCCCAAG 
CCCCGGCCCG 
ATGTACCTGC 
TAGTGTAGGC 
ATTAAATATC 
AAGTATGGCT 
CTCCTTTTGT 
GATATTGTCA 
CTTGGAGAAA 
CTTCTGTAAA 
GAGCAAGACA 
AAACAAAATA 
TCTATTTTGT 
TTTCTTTTCA 
CAGAAGAAAT 
TGTCATTGAA 
AGCAAGAATT 
AAAAACTTAG 
GGCATTAACT 
GTGGAGTTTG 
TCTAGCTTGA 
TAGAAGTGTG 
TTACTGACTG 
GTTAAATATT 
TGGTAGCTTA 
CAGGTATATG 
TGTATATAAC 
AACGCAAACA 
GAAAAAGAAT 
TGAATGTCGG 
GAGAAAAGGA 
TTGAACTAGC 
CATATATATG 
TAAGATGACA 
AGATTTGTTG 
TTTTTAAAAA 
TATTTCTGAA 
TGTTGGTTGC 



GGACTCATCT 2100 

AGCCGGTGCA 2160 

CCCCTGCCGG 2220 

AAGGAGAGCA 2280 

CCGCTACCTC 2340 

CCTCTGGCAG 2400 

CGGCCCAGGC 2460 

GATGCATTCT 2520 

ACTTTAATAA 2580 

ATGTTGCACA 2640 

GAATTTCATT 2700 

ACCTTTTTAT 2760 

AACGATACAA 2820 

GCTTATTAAG 2880 

TAAGGCAAGG 2940 

GTATACTTAA 3000 

CTTCTTTAAG 3060 

GAATTGGTAG 3120 

AGGTGCAATT 3180 

GTTTACTTGA 3240 

CAAGAGCAAA 3300 

CAGGAATCAA 3360 

TGAAAATGTA 3420 

TTCAGTCCTG 34 80 

TCTTAGCTGA 3540 

ATATTGCAAC 3600 

AAAAAAGCAC 3660 

TTTATAACAC 3720 

TTCCCTCTTT 3780 

CCTGGAAGGC 3840 

CTGTTGAGGC 3900 

TGGJTAATTT 3960 

GTTGTTGATG 4020 

ATTTGTACAT 4080 

AAAAATTAAT 4140 

ACATATGCAT 4200 

TTTTGCTTTT 4260 

TGAAGCTCCA 4320 

TGCATAAGAT 4380 

ACTAGTGCTT 4440 

AAATTTTGGC 4500 

GCAGCTACAT 4 560 

TGCTGAATGG 4620 



Seq ID NO: 61 Protein sequence: 
Protein Accession #: NP 055520 



50 | 



55 



60 



65 



MAGKVJRFINC 
LLYDELLEWS 
ESYYDYRNLS 
RLEAFQQRML 
IKSFYKVNHI 
VEMSPLENAI 
FFVKEYILSH 
KSSLGIQEFS 
SSLSSQASAE 
SDKHKHSREN 
RHTTSVSPSP 
TSGFENQVNE 
IPVTSEPPAL 



11 
I 

YCNSSNGEW 
DRPLREFLTY 
KMRMMEASLY 
NEFPHAIAMQ 
WKFRYDRPFH 
EVLENKNQQL 
PEDGEKIARL 
ACMQASPVHF 
VSNITGQSES 
SCLSPRERPC 
AGRSPLKGSV 
QSAPLPVPVP 
PPKPLAARSS 



21 



RLQNFYKTEL 
PMQTEWQRKE 
DKIMDQQRLE 
HANQPDETIF 
KGTKDKENEF 
KTLISQCQTR 
RELMLEQAQI 
PNGSPRVCRN 
SDEVFNMQPS 
SAIYPTPVEP 
QSFTPSPVEY 
VPVPSYGGEE 
HLENGARRTD 



31 
I 

NKEEMYIRYI 
HLHLTIIQNF 
PEFFRVGFYG 
QAEAQYLQIY 
KSLWVERTSL 
QMQNINPLTM 
LEFGLAVHEK 
SAPASVSPDG 
PSTSSLSSTH 
SQRMLFNHIG 
HSPGLISNSP 
PVRKESKTPP 
PGPRPRPLPR 



41 



HKLYDLHLKA 
DRGKCWENGI 
KKFPFFLRNK 
AVTPIPESQE 
YLVQSLPGIS 
CLNGVIDAAV 
FVPQDMRPLH 
TRVIPRRSPL 
SASPNVTSSA 
DGALPRSDPN 
VLSGSYSSGI 
PYSVYERTLR 
KVSQL 



51 

I 

QNFTEAAYTL 60 

IliCRKIAEQY 120 

EFVCRGHDYE 180 

VLQREGVPDN 240 

RWFEVEKREV 300 

NGGVSRYQEA 360 

KKLVDQFFVM 420 

SYPAVNRYSS 480 

PS SARAS PLL 54 0 

LSAPEKASPA 600 

SSLSRCSTSE 660 

RPVPLPHSLS 720 



Seq ID NO: 62 Nucleotide sequence: 

Nucleic Acid Accession #: fgenesh prediction 

Coding sequence: 1..2561 (underlined sequences correspond to start and stop codons) 



70 



75 



I 

ATGGACCGAG 
GAAAGGGAGA 
CGGGGGCTGA 
CGGCTGCCGG 
CTCAGGGTCC 



11 

I 

GCCAGGGTAA 
CTGGACGATC 
GGAACGCTCG 
GACAACCCGC 
CTGAATCTCC 



21 
I 

GAGGGGCCGC 
TGAAGCCGGA 
GAGGGGACTG 
CAGCAACCAG 
CAGTGTGGCA 



31 



GACGCCCGCA 
GAGGAGGAGG 
GGAGACGCGG 
GTACAGCTCT 
GAGAAAGTGA 



41 
! 

CTTGTTGCGG 
GAGAGAGGCG 
CGCTTATGCA 
CAGAGGTTCC 
AACTTGGTCA 



51 
I 

CGCCGGGCGG 60 

GGCGGTGGGG 120 

AAGGTGCCTT 180 

ACAGAGGAAG 240 

CCGATGCCTG 300 
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GAACTGCTGG 
TACCTCAGCG 
CAGCCCCTTC 
GGGCCCAGCT 
GACATGCCAG 
CCACGCCTGA 
GACTACTATG 
CACAATGGCT 
AGCTACCCTG 
ATGAGCAGCT 
CGACACCAGT. 
GCCTTCCTGC 
GAGGACCAGC 
CTGGATACCT 
CGTTTCACCC 
GAGGAGTGGC 
GAGGTCCCCA 
GAGAAGCAGA 
CGCCGGGAGA 
TCAATGAGCA 
ACACTGGCCG 
CTGAACGTGC 
CTGTATTTCC 
GGCTGTGAGG 
AACCGGCAGG 
GGGCTGCTGC 
GTGGATGTGG 
AGATCCTGCC 
ATGCAGGACG 
TGCAGTGAGA 
GCCAATCAAT 
GAAAAAGAGA 
CAGGAGAAGA 
CTGGAAGAAG 
GACCTGGAGC 
CCAGCCCTGG 
ACGCTCACCT 
CTTCTCAACC 



AGCAGCTGCT 
ATACCACCCT 
CAGCAAAGGA 
TCGTGGAATC 
AGGATGATGG 
GAAACGCGGC 
AAGAGGCCCT 
GCAGCCCCTC 
CAACCAGGGT 
CCTATGAGTC 
GGCCCTCAGA 
TGCGGAAAAA 
TCCTGTGTTA 
GCAGCATCAT 
AGGGGGCTAC 
TGAAGGTCAT 
GATCCCCAGT 
CCTCAGATTC 
CCTGTGATCA 
GGGCTGCGGG 
ATGACCTGCA 
TGGTGAACCA 
ACAAGGATCA 
TGGCCCCGGG 
AGGTGGCCAT 
TGGTGGAGAT 
AGACCTTAAC 
AGAATCAGTG 
AGGAGCCCGA 
AGTCCCATCG 
ACAAGTATGG 
AGCTGGAGAA 
GGGAACTGAA 
CCGTGGCCAC 
TGAAGCTGGT 
GGCTCTCCGT 
CCAATGGTGC 
CCAACACTAC 



CCCAGAGCTC 
GGAAAAGAAG 
GGTCTCCTAC 
CCTCTTTGAA 
GGAGCCCAGC 
CGACCTGCCT 
TCCTCTGGGA 
ACACTCGATT 
GAACGGCGAG 
CTACGATGAA 
GGAGGCCTCC 
GCGTTTCGGG 
CAAAAGCTCC 
CTACGTGCCC 
CGAGGTCTTG 
CCGAGAAGTG 
CCTCCTGTGC 
TGACAGCGTG 
CGGCAAAGGG 
CCGCAAGATC 
GACGTCCTCC 
GGGCTGGAAG 
CATGGACCTG 
CTTTGGGCCC 
CTTGGAGGCA 
GGGCTCCAGA 
CAG CATCGTC 
GCCTGAGCCC 
GCGCCCCACA 
TGTGGACCCG 
CAAGAACCGA 
AGAGAAAGAG 
GGAAGCCATT 
CCTGGAAGCT 
GGCTGTGAAG 
GAGCAGCAAG 
TCTCTCAGAG 
TGACATTTTG 



ACCGGGCTGC 
ATGGCCGTGG 
CTGTATGTGA 
GAATTTGACT 
AAAGGAGCCA 
CCACCGCTCC 
CCCGGCAAGT 
GTGGATGGCT 
CTTAAGAGCT 
GAGGAGGAGG 
ATGCACCTGG 
CAGTGGGCCA 
AAGGATCGGC 
AAGGACAGCC 
GTGCTGGCAC 
AGCAAGCCAG 
AAGTTGGACC 
GGTGTGGGTG 
AAGAAGAGCA 
ACCCGTATCA 
ACCGAGGAGG 
GAACGCTGGT 
CGAACCCATG 
CGACACCCAT 
AGCTGTTCAG 
GTCACTCCGG 
AGTGCTGGGC 
CGAGTCTATG 
GGGGCCCAGG 
CAGGTCAAAG 
GCCGAGGAGG 
ACGATTCGGA 
CGGAGCAGCC 
CAGTGTCGGG 
GAGCGCTTGC 
CCCAAGAGTG 
AGAACTTCTC 
GACCAGTAA 



TCAGCCTCCT 
CCTCCATCCT 
ACACAGCAGA 
GTGACCTGAG 
GCCCTGAGCT 
CCAACAAGCC 
CGCCTGAGTA 
ACTATGAGGA 
CCTATAATGA 
AAGGGAAGAG 
TGAGGGAATG 
AGCAGCTGAC 
AGCCACATCT 
GGCACAAGAG 
TGCAGAGCCG 
TTGGGGGAGC 
TGGACAAGAG 
ACAACTGTTC 
GCCTGGCAGA 
TTGGCTTCTC 
AGGTTCCCTG 
GCCGCCTGAA 
TGAACGCCAT 
TTGCCTTCAG 
AGGACATGGG 
AGGCGCTGCA 
GCAACTCCTT 
ATGATGTTCC 
TGAAGCGTCA 
TCAAACGCCA 
ATGCCCGGAG 
CAGAGCTGAT 
CAGGAGCAAA 
CAAAGGAGGA 
AGCAGTCCCT 
GGCAACTCTC 
TGACCTCATC 



GGACCACGAG 360 

GCAGAGCCTG 420 

CCTCCACTCG 480 

TGACCTTCGG 540 

AGCCAAGAGC 600 

TCCCCCTGAG 660 

CATCAGCTCC 720 

CGCAGACAGC 780 

CTCTGACGCA 840 

CCCGCAGCCC 900 

CAGGATATGT 960 

GGTCATCAGG 1020 

GAGGTTGGCA 1080 

GCACGAGCTG 1140 

AGAGCAGGCC 1200 

TGAGGGAGTG 1260 

GCTGTCCCAA 1320 

TACCCTTGGC 1380 

ACTGAAGGGC 1440 

CAAGAAGAAG 1500 

CTGTGGCTAC 1560 

GTGCAACACT 1620 

CGCCCTGCAA 1680 

GATCCTGCGC 1740 

TCGCTGGCTC 1800 

CTATGACTAC 1860 

CCTATATGCA 1920 

TT ATGAAAAG 1980 

CGCCTCCTCC 204 0 

CGCCTCCAGT 2100 

GTACTTGGTA 2160 

AGCACTGAGA 2220 

ATTAAAGGCT 2280 

GCGCCGGATT 234 0 

GGCAGGAGGG 2400 

TGAGGAAGAT 2460 

TACACCAGGG 2520 



Seg ID NO: 63 Protein sequence : 

Protein Accession #: fgenesh prediction 



I 

MDRGQGKRGR 
RLPGQPASNQ 
YLSDTTLEKK 
DMPEDDGEPS 
HNGCSPSHSI 
RHQWPSEEAS 
LDTCSIIYVP 
EVPRSPVLLC 
SMSRAAGRKI 
LYFHKDHMDL 
GLLLVEMGSR 
MQDEEPERPT 
EKEKLEKEKE 
DLELKLVAVK 
LLNPNTTDIL 



11 
I 

DARTCCGAGR 
VQLSEVPQRK 
MAVASILQSL 
KGASPELAKS 
VDGYYEDADS 
MHLVRECRIC 
KDSRHKRHEL 
KLDLDKRLSQ 
TRIIGFSKKK 
RTHVNAIALQ 
VTPEALHYDY 
GAQVKRHASS 
TIRTELIALR 
ERLQQSLAGG 



21 
I 

ERETGRSEAG 
LRVPESPSVA 
QPLPAKEVSY 
PRLRNAADLP 
SYPATRVNGE 
AFLLRKKRFG 
RFTQGATEVL 
EKQTSDSDSV 
TLADDLQTSS 
GCEVAPGFGP 
VDVETLTSIV 
CSEKSHRVDP 
QEKRELKEAI 
PALGLSVSSK 



31 • 
I 

EEEGERRAVG 
EKVKLGHRCL 
LYVNTADLHS 
PPLPNKPPPE 
LKSSYNDSDA 
QWAKQLTVIR 
VLALQSREQA 
GVGDNCSTLG 
TEEEVPCCGY 
RHPFAFRILR 
SAGRNSFLYA 
QVKVKRHASS 
RSSPGAKLKA 
PKSGQLSEED 



41 
I 

RGLRNARRGL 
ELLEQLLPEL 
GPSFVESLFE 
DYYEEALPLG 
MSSSYESYDE 
EDQLLCYKSS 
EEWLKVIREV 
RRETCDHGKG 
LNVLVNQGWK 
NRQEVAILEA 
RSCQNQWPEP 
ANQYKYGKNR 
LEEAVATLEA 
TLTSNGALSE 



51 
I 

GDAALMQRCL 60 

TGLLSLLDHE 12 0 

EFDCDLSDLR 180 

PGKSPEYISS 24 0 

EEEEGKS PQP 300 

KDRQPHLRIA 360 

SKPVGGAEGV 420 

KKSSLAELKG 480 

ERWCRLKCNT 540 

SCSEDMGRWL 600 

RVYDDVPYEK 660 

AEEDARRYLV 720 

QCRAKEERRI 780 

RTSLTSSTPG 84 0 



Seq ID NO: 64 Nucleotide sequence: 
Nucleic Acid Accession #: NM_004126.1 

Coding sequence: 10 8-129 (underlined sequences correspond to start and stop codons) 



I 

GGCACGAGCT 
AGCGGCTCCG 
ACATCGAAGA 
AAGTGAAGTT 
AAGAACGTTC 
AAGAAAAAGG 
AGAACTAGTT 



11 
I 

CGTGCCGGCC 
CTGCCAGAGC 
TTTGCCAGAG 
GCAGAGACAA 
TGGAGAGGAT 
CAGCTGTGTT 
TGTTTTAGTT 



21 
I 

TTCAGTTGTT 
TAGCCCGAGC 
AAGGAAAAAC 
CAAGTGTCTA 
CCTCTAGTAA 
ATTTCATAAA 
TTCCCAGATA 



31 
I 

TCGGGACGCG 
CCGGTTCTGG 
TGAAAATGGA 
AATGTTCTGA 
AGGGAATTCC 
TAACTTGGGA 
AAACCAACAT 



41 
I 

CCGAGCTTCG 
GGCGAAAATG 
AGTTGAGCAG 
AGAAATAAAG 
AGAAGACAAG 
GAAACTGCAT 
GCTTTTTAAG 



51 
I 

CCGCTCTTCC 60 

CCTGCCCTTC 120 

CTTCGCAAAG 180 

AACTATATTG 240 

AACCCCTTTA 300 

CCTAAGTGGA 360 

GAAGGAAGAA 42 0 



210 



WO 02/079492 



PCT/US02/049I5 



TGAAATTAAA AGGAGACTTT CTTAAGCACC ATATAGATAG GGTTATGTAT AAAAGCATAT 4 80 
GTGCTACTCA TCTTTGCTCA CTATGCAGTC TTTTTTAAGA GAGCAGAGAG TATCAGATGT 540 
ACAATTATGG AAATAAGAAC ATTACTTGAG CATGACACTT CTTTCAGTAT ATTGCTTGAT 600 
GCTTCAAATA AAGTTTTGTC TT 

5 

Seq ID NO: 65 Protein sequence: 
Protein Accession #: NP_004117 

10 1 11 ' 21 31 41 51 

I I I I I I 

MPAIiHIEDLP BKEKLKMEVB QLRKEVKLQR QQVSKCSEBI KNYIBERSGE DPLVKGIPED 60 
KNPFKEKGSC VIS 



15 



20 



Seq ID NO: 66 Nucleotide sequence: 
Nucleic Acid Accession #: NMJ)03842.I 

Coding sequence: 1-1236 (underlined sequences correspond to start and stop codons) 



11 21 31 41 51 

I I ! I I 

ATGGAACAAC GGGGACAGAA CGCCCCGGCC GCTTCGGGGG CCCGGAAAAG GCACGGCCCA 60 

GGACCCAGGG AGGCGCGGGG AGCCAGGCCT GGGCCCCGGG TCCCCAAGAC CCTTGTGCTC 120 

25 GTTGTCGCCG CGGTCCTGCT GTTGGTCTCA GCTGAGTCTG CTCTGATCAC CCAACAAGAC 180 

CTAGCTCCCC AGCAGAGAGC GGCCCCACAA CAAAAGAGGT CCAGCCCCTC AGAGGGATTG 240 

TGTCCACCTG GACACCATAT CTCAGAAGAC GGTAGAGATT GCATCTCCTG CAAATATGGA 300 

CAGGACTATA GCACTCACTG GAATGACCTC CTTTTCTGCT TGCGCTGCAC CAGGTGTGAT 360 

TCAGGTGAAG TGGAGCTAAG TCCCTGCACC ACGACCAGAA ACACAGTGTG TCAGTGCGAA 420 

30 GAAGGCACCT TCCGGGAAGA AGATTCTCCT GAGATGTGCC GGAAGTGCCG CACAGGGTGT 480 

CCCAGAGGGA TGGTCAAGGT CGGTGATTGT ACACCCTGGA GTGACATCGA ATGTGTCCAC 540 

AAAGAATCAG GCATCATCAT AGGAGTCACA GTTGCAGCCG TAGTCTTGAT TGTGGCTGTG 600 

TTTGTTTGCA AGTCTTTACT GTGGAAGAAA GTCCTTCCTT ACCTGAAAGG CATCTGCTCA '660 

GGTGGTGGTG GGGACCCTGA GCGTGTGGAC AGAAGCTCAC AACGACCTGG GGCTGAGGAC 720 

35f AATGTCCTCA ATGAGATCGT GAGTATCTTG CAGCCCACCC AGGTCCCTGA GCAGGAAATG 780 

GAAGTCCAGG AGCCAGCAGA GCCAACAGGT GTCAACATGT TGTCCCCCGG GGAGTCAGAG 840 

CATCTGCTGG AACCGGCAGA AGCTGAAAGG TCTCAGAGGA GGAGGCTGCT GGTTCCAGCA 900 

AATGAAGGTG ATCCCACTGA GACTCTGAGA CAGTGCTTCG ATGACTTTGC AGACTTGGTG 96 0 

CCCTTTGACT CCTGGGAGCC GCTCATGAGG AAGTTGGGCC TCATGGACAA TGAGATAAAG 1020 

40 GTGGCTAAAG CTGAGGCAGC GGGCCACAGG GACACCTTGT ACACGATGCT GATAAAGTGG 1080 

GTCAACAAAA CCGGGCGAGA TGCCTCTGTC CACACCCTGC TGGATGCCTT GGAGACGCTG 1140 

GGAGAGAGAC TTGCCAAGCA GAAGATTGAG GACCACTTGT TGAGCTCTGG AAAGTTCATG 1200 
TATCTAGAAG GTAATGCAGA CTCTGCCATG TCCTAA 

45 Seq ID NO: 67 Protein sequence: 

Protein Accession #: NP_003833.1 

1 11 21 31 41 51 

I I I I I I 

50 MEQRGQNAPA ASGARKRHGP GPREARGARP GPRVPKTLVL WAAVLLLVS AESALITQQD 60 

LAPQQRAAPQ QKRSSPSEGL CPPGHHISED GRDCISCKYG QDYSTHWNDL LFCLRCTRCD 120 

SGEVELSPCT TTRNTVCQCE EGTFREEDSP EMCRKCRTGC PRGMVKVGDC TPWSDIECVH 18 0 

KESGIIIGVT VAAWLIVAV PVCKSLLWKK VLPYLKGICS GGGGDPERVD RSSQRPGAED 24 0 

NVLNEIVSIL QPTQVPEQEM EVQEPAEPTG VNMLSPGESE HLLEPAEAER SQRRRLLVPA 300 

55 NEGDPTETLR QCFDDFADLV PFDSWEPLMR KLGLMDNEIK VAKAEAAGHR DTLYTMLIKW 360 

VNKTGRDASV HTLLDALETL GERLAKQKIE DHLLSSGKFM YLEGNADSAM S 

Seq ID NO: 68 Nucleotide sequence: 
Nucleic Acid Accession #: FGENESH predicted ORF 
60 Coding sequence: 361- 2220 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

GGCACCATCT GCTCCCTGCC CTGCCCAGAG GGCTTTCACG GACCCAACTG CTCCCAGGAA 6 0 

65 TGTCGCTGCC ACAACGGCGG CCTCTGTGAC CGATTCACTG GGCAGTGCCG CTGCGCTCCG 120 

GGTTACACTG GGGATCGGTG CCGGGAGGAG TGCCCGGTGG GCCGCTTTGG GCAGGACTGT 180 

GCTGAGACGT GCGACTGCGC CCCGGACGCC CGTTGCTTCC CGGCCAACGG CGCATGTCTG 24 0 

TGCGAACACG GCTTCACTGG GGACCGCTGC ACGGATCGCC TCTGCCCCGA CGGCTTCTAC 300 

GGTCTCAGCT GCCAGGCCCC CTGCACCTGC GACCGGGAGC ACAGCCTCAG CTGCCACCCG 360 

70 ATG AACGGGG AGTGCTCCTG CCTGCCGGGC TGGGCGGGCC TCCACTGCAA CGAGAGCTGC 420 

CCGCAGGACA CGCATGGGCC AGGGTGCCAG GAGCACTGTC TCTGCCTGCA CGGTGGCGTC 480 

TGCCAGGCTA CCAGCGGCCT CTGTCAGTGC GCGCCGGGTT ACACGGGCCC TCACTGTGCT 540 

AGTCTTTGTC CTCCTGACAC CTACGGTGTC AACTGTTCTG CACGCTGCTC ATGTGAAAAT 600 

GCCATCGCCT GCTCACCCAT CGACGGCGAG TGCGTCTGCA AGGAAGGTTG GCAGCGTGGT 660 

75 AACTGCTCTG TGCCCTGCCC ACCCGGAACC TGGGGCTTCA GTTGCAATGC CAGCTGCCAG 720 

TGTGCCCATG AGGCAGTCTG CAGCCCCCAA ACTGGAGCCT GTACCTGCAC CCCTGGGTGG 78 0 



211 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



CATGGGGCCC 
CGCTGTGACT 
GCTGGCTGGA 
TGTAGCAACA 
GTGTGTGCAC 
GGCAAACGCT 
ACCTGCTACT 
ACATTTGGTG 
GAGACTGGGG 
CAGGAGCCCT 
ATTGGCATTG 
CGGCACTGGC 
CTGGACGGCT 
AACCCCAGCT 
CCAGGCCCGC 
GATAACCACA 
CTGGACAGGG 
GGCCCATTCT 
CTGAGCAGTG 
CGGGAGAGCA 
CCTCAGTTCT 
TACGAGCAGC 
CCTCCGGGCC 
TATGACTTGC 



ACTGCCAGCT 
GTGACCACTC 
TGGGTGCCCG 
CCTGCACCTG 
CCGGATTCCG 
GTGTGCCCTG 
GCCTGGCTGG 
CTAACTGCTC 
CCTGTGTATG 
TTACTGTGAT 
CAGTGCTGGG 
AAAAAGGCAA 
CCGAGTATGT 
ACCACACCCT 
TCTTTGCCAG 
CCACCCTGCC 
GGAGCAGCCG 
ACAATAAAGG 
AGAAC CCATA 
GCTACATGGA 
GGGACAGCCA 
CCAGCCCCCT 
TACCCCCCGG 
CTCCAGTACG 



GCCCTGTCCG 
TGATGGCTGT 
CTGCCACCTG 
CAAGAATGGG 
GGGCCCCTCC 
CAAGTGCGCT 
CTGGACAGGC 
CCAGCCATGC 
TCCCCCAGGG 
GCCGACCACT 
GTCCCTTGTG 
GGAGCACCAC 
CATGCCAGAT 
GTCGCAGTGC 
CCTGCAGAAC 
TGCTGACTGG 
CCTGGACCGA 
GCTCATCTCT 
TGCCACCATC 
GATGAAAGGC 
GAGGCGGCGG 
GATCCATGAC 
CCACTATGAC 
GCATCCCCCA 



AAGGGGCAGT 
GACCCTGTTC 
TCCTGCCCTG 
GGCACCTGTC 
TGCCAGAGAT 
AACCACTCCT 
CCCGACTGCT 
CAGTGTGGTC 
CACAGTGGTG 
CCAGTAGCGT 
GTAGCCCTGG 
CACCTGGCTG 
GTCCCTCCGA 
TCCCCAAACC 
CCTGAGCGGC 
AAGCACCGCC 
AGCTACAGCT 
GAAGAGGAGC 
CGGGACCTGC 
CCTCCCTCAG 
CAACCCCAGC 
CGAGACTCTG 
TCACCCAAGA 
TCACCTCCAC 



TTGGAGAAGG 
ATGGACGCTG 
AGGGCTTATG 
TCCCTGAGAA 
CCTGTCAGCC 
TCTGCCACCC 
CCCAGCGCTG 
CTGGAGAAAA 
CACCTTGCAG 
ATAACTCGCT 
TGGCACTGTT 
TGGCTTACAG 
GCTACAGTCA 
CCCCACCCCC 
CAGGTGGGGC 
GGGAGCCCCC 
ATAGCTACAG 
TCGGGGCCAG 
CCAGCTTGCC 
GATCTCCCCC 
CACAGAGAGA 
TGGGCTCCCA 
ACAGCCACAT 
TTCGACGCCA 



TTGTGCCAGT 
TCAGTGCCAG 
GGGAGTCAAC 
TGGCAACTGC 
TGGCCGCTAT 
CTCGAACGGG 
CCCTCTGGGG 
GTGCCACCCA 
GATTGGAATC 
GGGTGCAGTG 
CATTGGCTAT 
CAGCGGGCGC 
CTACTACTCC 
TAACAAGGTT 
CCAAGGGCAT 
TCCAGGGCCT 
CAATGGCCCA 
TGTGGCTTCC 
AGGGGGCCCC 
CAGGCAGCCT 
CAGTGGCACC 
GCCCCCTCTG 
CCCTGGACAT 
GGACCGTTGA 



840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 



Seq ID NO: 69 Protein sequence: 

Protein Accession #: FGENESH prediction 



11 



GTICSLPCPE 
AETCDCAPDA 
MNGECSCLPG 
SLCPPDTYGV 
CAHEAVCSPQ 
AGWMGARCHL 
GKRCVPCKCA 
ETGACVCPPG 
RHWQKGKEHH 
PGPLFASLQN 
GPFYNKGLIS 
PQFWDSQRRR 
YDLPPVRHPP 



GFHGPNCSQE 
RCFPANGACL 
WAGLHCNESC 
NCSARCSCEN 
TGACTCTPGW 
SCPEGLWGVN 
NHSFCHPSNG 
HSGAPCRIGI 
HLAVAYSSGR 
PERPGGAQGH 
EEELGASVAS 
QPQPQRDSGT 
SPPLRRQDR 



21 
I 

CRCHNGGLCD 
CEHGFTGDRC 
PQDTHGPGCQ 
AIACSPIDGE 
HGAHCQLPCP 
CSNTCTCKNG 
TCYCLAGWTG 
QEPFTVMPTT 
LDGSEYVMPD 
DNHTTLPADW 
LSSENPYATI 
YEQPSPLIHD 



31 



RFTGQCRCAP 
TDRLCPDGFY 
EHCLCIiHGGV 
CVCKEGWQRG 
KGQFGEGCAS 
GTCLPENGNC 
PDCSQRCPLG 
PVAYNSLGAV 
VPPSYSHYYS 
KHRREPPPGP 
RDLPSLPGGP 
RDSVGSQPPL 



41 
I 

GYTGDRCREE 
GLSCQAPCTC 
CQATSGLCQC 
NCSVPCPPGT 
RCDCDHSDGC 
VCAPGFRGPS 
TFGANCSQPC 
IGIAVLGSLV 
NPSYHTLSQC 
LDRGSSRLDR 
RESSYMEMKG 
PPGLPPGHYD 



51 
I 

CPVGRFGQDC 
DREHSLSCHP 
APGYTGPHCA 
WGFSCNASCQ 
DPVHGRCQCQ 
CQRSCQPGRY 
QCGPGEKCHP 
VALVALFIGY 
SPNPPPPNKV 
SYSYSYSNGP 
PPSGSPPRQP 
SPKNSHIPGH 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 



Seq ID NO: 70 Nucleotide sequence: 
Nucleic Acid Accession #: NM_005458 

Coding sequence: 1..2826 (underlined sequences correspond to start and stop codons) 



11 



21 



75 



I 

ATG GCTTCCC 
GCGCGCCTGC 
GGCTGGGCGC 
CTCATGCCGC 
GTGGAACTGG 
CTGCGGCTCT 
ATAAAATACG 
ATCATTGCAG 
CCTGTTCTAG 
GCGGTGAATC 
CTGACGCAAG 
GGCGAGGACA 
AAAAAGCTGA 
GCAAAAGTGT 
ATTCCGGGCT 
CGCTGCCTCC 
CCCCTGAGCT 
GAGTACAACA 
GGCATCTGGG 
CGGCACCAGC 
AATGCCATGA 
GAGAGAATGG 
GAGTACAACG 
TCCGAACCAC 



CGCGGAGGTC 
TACTGCTACT 
GGGGCGCCCC 
TCACCAAGGA 
CCATCGAGCA 
ATGACACGGA 
GGCCGAACCA 
AGTCCCTCCA 
CCGATAAGAA 
C AG CCATTCT 
ACGTTCAGAG 
TTGAGATTTC 
AGGGGAATGA 
TCTGTTGTGC 
GGTACGAGCC 
GGAAGAATCT 
CCAAGCAGAT 
ACAAG CGGTC 
TCATCGCCAA 
GGATCCAGGA 
ACGAGACCAA 
GGACCATTAA 
CTGTGGCCGA 
CAAAAGACAA 



CGGGCAGCCA 
GCTGCTGCCG 
CCGGCCGCCG 
GGTGGCCAAG 
GATCCGCAAC 
GTGCGACAAC 
CTTGATGGTG 
AGGCTGGAAT 
AAAATACCCT 
GAAGTTGCTC 
GTTCTCTGAG 
AGACACCGAG 
TGTGCGGATC 
ATACGAGGAG 
TTCTTGGTGG 
GCTTGCTGCC 
CAAGACCATC 
AGGCGTGGGG 
GACACTGCAG 
CTTCAACTAC 
CTTCTTCGGG 
ATTTACTCAA 
CACACTGGAG 
GACCATCATC 



31 

I 

GGGCGGCCGC 
CTGCTGCTGC 
CCCAGCAGCC 
GGCAGCATCG 
GAGTCACTCC 
GCAAAAGGGT 
TTTGGAGGCG 
CTGGTGCAGC 
TATTTCTTTC 
AAGCACTACC 
GTGCGGAATG 
AGCTTCTCCA 
ATCCTTGGCC 
AACATGTATG 
GAGCAGGTGC 
ATGGAGGGCT 
TCAGGAAAGA 
CCCAGCAAGT 
AGGGCCATGG 
ACGGACCACA 
GTCACGGGTC 
TTTCAAGACA 
ATCATCAATG 
CTGGAGCAGC 



41 
I 

CGCCGCCGCC 
CTCTGGCGCC 
CGCCGCTCTC 
GGCGCGGTGT 
TGCGCCCCTA 
TGAAAGCCTT 
TCTGTCCATC 
TTTCTTTTGC 
GGACCGTCCC 
AGTGGAAGCG 
ACCTGACTGG 
ACGATCCCTG 
AGTTTGACCA 
GTAGTAAATA 
ACACGGAAGC 
ACATTGGCGT 
CTCCACAGCA 
TCCACGGGTA 
AGACACTGCA 
CGCTGGGCAG 
AAGTTGTATT 
GCAGGGAGGT 
ACACCATCAG 
TGCGGAAGAT 



51 
I 

ACCGCCGCCC 60 

CGGGGCCTGG 120 

CATCATGGGC 180 

GCTCCCCGCC 240 

CTTCCTCGAC 300 

CTACGATGCA 360 

CGTCACATCC 420 

TGCAACCACG 480 

ATCAGACAAT 540 

CGTGGGCACG 600 

AGTTCTGTAT 660 

TACCAGTGTC 720 

GAATATGGCA 780 

TCAGTGGATC 840 

CAACTCATCC 900 

GGATTTCGAG 960 

GTATGAGAGA 1020 

CGCCTACGAT 1080 

TGCCAGCAGC 1140 

GATCATCCTC 1200 

CCGGAATGGG 1260 

GAAGGTGGGA 1320 

GTTCCAAGGA 1380 

CTCCCTACCT 1440 



212 



WO 02/079492 



PCT/US02/04915 



10 



15 



20 



25 



CTCTACAGCA 
TTCTTCAACA 
AACCTTATCA 
GGATCCTTTG 
ACCGTGGGCT 
ATCTTCAAAA 
GTGGGGGGCA 
CTGCGAAGGA 
ATCCGCCCTC 
TATGCCTACA 
GTCAGCATCC 
ATCATGTGCA 
TTCTGCATCG 
GTGCCGAAGC 
CAGTTCACTC 
AACCAAGCCA 
AAGATCACAG 
GAAAAGACCA 
GGAAACTTCA 
AATCCCCAGC 
GATATAAACT 
CACGCCTACC 
CCCACCGCCA 
CTGTAA 



TCCTCTCTGC 
TCAAGAACCG 
TCCTTGGAGG 
TCTCTGAAAA 
ACACGACCGC 
ATGTGAAAAT 
TGCTGCTGAT 
CAGTGGAGAA 
TCCTGGAGCA 
AGGGACTTCT 
CCGCACTCAA 
TCATCGGGGC 
TGGCTCTGGT 
TCATCACCCT 
AGAATCAGAA 
GCACATCCCG 
AGCTGGATAA 
CCTACATTAA 
CTGAGAGCAC 
TACAGTGGAA 
CTCCAGAACA 
TCCCATCCAT 
GCCCCCGCCA 



CCTCACCATC 
GAATCAGAAG 
GATGCTCTCC 
GACCTTTGAA 
TTTTGGGGCC 
GAAGAAGAAG 
CGACCTGTGT 
GTACAGCATG 
CTGTGAGAAC 
CATGTTGTTC 
CGACAGCAAG 
CGCTGTCTCC 
CATCATCTTC 
GAGAACAAAC 
GAAAGAAGAT 
CCTGGAGGGC 
AGACTTGGAA 
ACAGAACCAC 
AGATGGAGGA 
CACAACAGAG 
CATCCAGCGT 
CGGAGGCGTG 
CAGACATGTG 



CTCGGGATGA 
CTCATAAAGA 
TATGCTTCCA 
ACACTTTGCA 
ATGTTTGCAA 
ATCATCAAGG 
ATCCTGATCT 
GAGCCGGACC 
ACCCATATGA 
GGTTGTTTCT 
TACATCGGGA 
TTCCTGACCC 
TGCAGCACCA 
CCAGATGCAG 
TCTAAAACGT 
CTACAGTCAG 
GAGGTCACCA 
TACCAAGAGC 
AAGGCCATTT 
CCCTCTCGAA 
CGGCTGTCCC 
GACGCCAGCT 
CCACCCTCCT 



TCATGGCCAG 
TGTCGAGTCC 
TATTTCTCTT 
CCGTCAGGAC 
AGACCTGGAG 
ACCAGAAACT 
GCTGGCAGGC 
CAGCAGGACG 
CCATCTGGCT 
TAGCTTGGGA 
TGAGTGTCTA 
GGGACCAGCC 
TCACCCTCTG 
CAACGCAGAA 
CCACCTCGGT 
AAAACCATCG 
TGCAGCTGCA 
TCAATGACAT 
TAAAAAATCA 
CATGCAAAGA 
TCCAGCTCCC 
GTGTCAGCCC 
TCCGAGTCAT 



TGCTTTTCTC 1500 

ATACATGAAC 1560 

TGGCCTTGAT 1620 

CTGGATTCTC 1680 

AGTCCACGCC 1740 

GCTTGTGATC 1800 

TGTGGACCCC 1860 

GGATATCTCC 1920 

TGGCATCGTC 1980 

GACCCGCAAC 2040 

CAACGTGGGG 2100 

CAATGTGCAG 2160 

CCTGGTATTC 2220 

CAGGCGATTC 2280 

CACCAGTGTG 2340 

CCTGCGAATG 24 00 

GGACACACCA 24 60 

CCTCAACCTG 2520 

CCTCGATCAA 2580 

TCCTATAGAA 264 0 

CATCCTCCAC 2700 

CTGCGTCAGC 2760 

GGTCTCGGGC 282 0 



Seq ID NO: 71 Protein sequence: 
Protein Accession #: NP 005449 



30 



35 



40 



45 



50 



55 



60 



65 



70 



ii 



MASPRRSGQP 
LMPLTKEVAK 
IKYGPNHLMV 
AVNPAILKLL 
KKLKGNDVRI 
RCLRKNLLAA 
GIWVIAKTLQ 
ERMGTIKFTQ 
LYSILSALTI 
GSFVSEKTFE 
VGGMLLIDLC 
YAYKGLLMLF 
FCIVALVIIF 
NQASTSRLEG 
GNFTESTDGG 
HAYLPSIGGV 



GRPPPPPPPP 
GSIGRGVLPA 
FGGVCPSVTS 
KHYQWKRVGT 
ILGQFDQNMA 
MEGYIGVDFE 
RAMETLHASS 
FQDSREVKVG 
LGMIMASAFL 
TLCTVRTWIL 
ILICWQAVDP 
GCFLAWETRN 
CSTITLCLVF 
LQSENHRLRM 
KAILKNHLDQ 
DASCVSPCVS 



21 
I 

ARLLLLLLLP 
VELAIEQIRN 
IIAESLQGWN 
LTQDVQRFSE 
AKVFCCAYEE 
PhSSKQIKTX 
RHQRIQDFNY 
EYNAVADTLE 
FFNIKNRNQK 
TVGYTTAFGA 
LRRTVEKYSM 
VSIPALNDSK 
VPKLITLRTN 
KITELDKDLE 
NPQLQWNTTE 
PTASPRHRHV 



31 
I 

LLLPLAPGAW 
ESLLRPYFLD 
LVQLSFAATT 
VRNDLTGVLY 
NMYGSKYQWI 
SGKTPQQYER 
TDHTLGRIIL 
IINDTIRFQG 
LIKMSSPYMN 
MFAKTWRVHA 
EPDPAGRDIS 
YIGMSVYNVG 
PDAATQNRRF 
EVTMQLQDTP 
PSRTCKDPIE 
PPSFRVMVSG 



41 
I 

GWARGAPRPP 
LRLYDTECDN 
PVLADKKKYP 
GEDIEISDTE 
IPGWYEPSWW 
EYNNKRSGVG 
NAMNETNFFG 
SEPPKDKTII 
NLI ILGGMLS 
I FKNVKMKKK 
IRPLLEHCEN 
IMCIIGAAVS 
QFTQNQKKED 
EKTTYIKQNH 
DINSPEHIQR 
L 



51 
I 

PSSPPLSIMG 60 

AKGLKAFYDA 12 0 

YFFRTVPSDN 180 

SFSNDPCTSV 24 0 

EQVHTEANSS 300 

PSKFHGYAYD 360 

VTGQWFRNG 42 0 

LEQLRKISLP 48 0 

YASIFLFGLD 54 0 

IIKDQK&LVI 600 

THMTIWLGIV 660 

FLTRDQPNVQ 72 0 

SKTSTSVTSV 780 

YQELNDILNL 84 0 

RLSLQLPILH 900 



Seq ID NO: 72 Nucleotide sequence; 
Nucleic Acid Accession #: NM_005795 

Coding sequence: 522-1940 {underlined sequences correspond to start and stop codons) 



75 



I 

GCACGAGGGA 
CAAGCTCTGC 
TTCCCACCTT 
TGAGAATATT 
AAGAAATTCT 
GACAATTGTG 
GAATAATAAA 
AAAGAAAACT 
ACAAGGTTGC 
ATTTGGGCTT 
TTATGATTCT 
TTACTAGAAA 
CCATTCAACA 
ACGATGTTGC 
ATCCATCAGA 
CAAGCAACAG 
AGACTGCACT 
TGCTTATCTC 
TACACAAAAA 
CTGCAGTGGC 
AGTTCATTCA 
ACCTACACAC 



11 
I 

ACAACCTCTC 
TAACTGAATC 
GCTTGTGGGT 
TCACAAAGAA 
TAAAGACAAT 
CATATCGTCT 
AACCCATACT 
ACTACAACTT 
TATAAAACAA 
AATGATGGAG 
TGTTACAGCA 
TAAAATCATG 
AGCAGAAGGC 
AGCAGGAACT 
AAAAGTTACA 
AACATGGACA 
AAATTTGTTT 
GCTTGGCATA 
TCTGTTCTTC 
CAACAACCAG 
TCTTTACCTG 
ACTCATTGTG 



21 



31 



41 



51 



I 



I 



I 



TCTCTSCAGC 
TCATCCTAAT 
AAATCTCTTC 
TTTCCTTAAG 
GTCAAATATG 
AATAATAAAA 
AGCCTATAGA 
GACAAGACTG 
GATTGCTACA 
AAAAAGTGTA 
GAATTAGAAG 
ACAGCTCAAT 
GTTTACTGCA 
GAATCAATGC 
AAGATCTGTG 
AATTATACCC 
TACCTGACCA 
TTCTTTTATT 
TCATTTGTTT 
GCCTTAGTAG 
ATGGGCTGTA 
GTGGCCGTGT 



AGAGAGTGTC 
TGCAGGATCA 
TGCGGAATCT 
AGCTGGACTG 
ATCCAAGAGA 
ACCCATACTA 
AAACAATATT 
CTGCAAACTT 
ACTTCTAGTT 
CCCTGTATTT 
AGAGTCCTGA 
ATGAATGTTA 
ACAGAACCTG 
AGCTCTGCCC 
ACCAAGATGG 
AGTGTAATGT 
TAATTGGACA 
TCAAGAGCCT 
GTAACTCTGT 
CCACAAATCC 
ATTACTTTTG 
TTGCAGAGAA 



ACCTCCTGCT 
CATTGCAAAG 
CAGAAAGTAA 
GGTCTTGACC 
AAATGTGATT 
GCCTATAGAA 
TGAAAGATTG 
CAATTGGTCA 
T ATG TTATAC 
TCTGGTTCTC 
GGACTCAATT 
CCAAAAGATT 
GGATGGATGG 
TGATTACTTT 
AAACTGGTTT 
TAACACCCAC 
CGGATTGTCT 
AAGTTGCCAA 
TGTAACAATC 
TGTTAGTTGC 
GATGCTCTGT 
GCAACATTTA 



TTAGGACCAT 
CTTTCACTCT 
AGTTCCATCC 
CCTGGAATTT 
TGAGTCTGGA 
AACAATATTT 
CTACCACTAA 
CCACAACTTG 
AGCATATTTC 
TTGCCTTTTT 
CAGTTGGGAG 
ATGCAAGACC 
CTCTGCTGGA 
CAGGACTTTG 
AGACATCCAG 
GAGAAAGTGA 
ATTGCATCAC 
AGGATTACCT 
ATTCACCTCA 
AAAGTGTCCC 
GAAGGCATTT 
ATGTGGTATT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
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ATTTTCTTGG CTGGGGATTT CCACTGATTC CTGCTTGTAT ACATGCCATT GCTAGAAGCT 13 80 

TATATTACAA TGACAATTGC TGGATCAGTT CTGATACCCA TCTCCTCTAC ATTATCCATG 1440 

GCCCAATTTG TGCTGCTTTA CTGGTGAATC TTTTTTTCTT GTTAAATATT GTACGCGTTC 15 00 

TCATCACCAA GTTAAAAGTT ACACACCAAG CGGAATCCAA TCTGTACATG AAAGCTGTGA 1560 

GAGCTACTCT TATCTTGGTG CCATTGCTTG GCATTGAATT TGTGCTGATT CCATGGCGAC 1620 

CTGAAGGAAA GATTGCAGAG GAGGTATATG ACTACATCAT GCACATCCTT ATGCACTTCC 1680 

AGGGTCTTTT GGTCTCTACC ATTTTCTGCT TCTTTAATGG AGAGGTTCAA GCAATTCTGA 1740 

GAAGAAACTG GAATCAATAC AAAATCCAAT TTGGAAACAG CTTTTCCAAC TCAGAAGCTC 1800 

TTCGTAGTGC GTCTTACACA GTGTCAACAA TCAGTGATGG TCCAGGTTAT AGTCATGACT 1860 

GTCCTAGTGA ACACTTAAAT GGAAAAAGCA TCCATGATAT TGAAAATGTT CTCTTAAAAC 1920 

CAGAAAATTT ATATAATTGA AAATAGAAGG ATGGTTGTCT CACTGTTTGG TGCTTCTCCT 1980 

AACTCAAGGA CTTGGACCCA TGACTCTGTA GCCAGAAGAC TTCAATATTA AATGACTTTG 2040 

GGGAATGTCA TAAAGAAGAG CCTTCACATG AAATTAGTAG TGTGTTGATA AGAGTGTAAC 2100 

ATCCAGCTCT ATGTGGGAAA AAAGAAATCC TGGTTTGTAA TGTTTGTCAG TAAATACTCC 2160 

CACTATGCCT GATGTGACGC TACTAACCTG ACATCACCAA GTGTGGAATT GGAGAAAAGC 2220 

ACAATCAACT TTTCTGAGCT GGTGTAAGCC AGTTCCAGCA CACCATTGAT GAATTCAAAC 2280 

AAATGGCTGT AAAACTAAAC ATACATGTTG GGCATGATTC TACCCTTATT CSCCCCAAGA 2340 

GACCTAGCTA AGGTCTATAA ACATGAAGGG AAAATTAGCT TTTAGTTTTA AAACTCTTTA 2400 

TCCCATCTTG ATTGGGGCAG TTGACTTTTT TTTTTTCCCA GAGTGCCGTA GTCCTTTTTG 2460 

TAACTACCCT CTCAAATGGA CAATACCAGA AGTGAATTAT CCCTGCTGGC TTTCTTTTCT 2520 

CTATGAAAAG CAACTGAGTA CAATTGTTAT GATCTACTCA TTTGCTGACA CATCAGTTAT 2580 

ATCTTGTGGC ATATCCATTG TGGAAACTGG ATGAACAGGA TGTATAATAT GCAATCTTAC 2640 

TTCTATATCA TTAGGAAAAC ATCTTAGTTG ATG CT AC AAA ACACCTTGTC AACCTCTTCC 2700 

TGTCTTACCA AACAGTGGGA GGGAATTCCT AGCTGTAAAT ATAAATTTTG TCCCTTCCAT 2760 

TTCTACTGTA TAAACAAATT AGCAATCATT TTATATAAAG AAAATCAATG AAGGATTTCT 2820 

TATTTTCTTG GAATTTTGTA AAAAGAAATT GTGAAAAATG AGCTTGTAAA TACTCCATTA 2880 

TTTTATTTTA TAGTCTCAAA TCAAATACAT ACAACCTATG TAATTTTTAA AGCAAATATA 2940 

TAATGCAACA ATGTGTGTAT GTTAATATCT GATACTGTAT CTGGGCTGAT TTTTTAAATA 3000 

AAATAGAGTC TGGAATGCTA TATTTGGTAA ATATTTTAAA GACAACCAGA TGCCAGCATC 3060 

AGAAGTCTGT TTGAGAACTA AGAGAACAGA AACATCTATC ATAAGATATA TTTATTTTAA 3120 

AAACACAAGG TCACTATTTT ACTGAATATA TTTGTTTTGA TAACTCATAC CTTAATAATA 3180 

GGTGTGTTTG ACATATTTCT TTTTTCATTT TGACAATGAA CTCACATTCT AATCCAGAAA 3240 

TTTTAAACAA CTACTGTGAT AAATACCAAT CTGCTACTTT TATAGATTTT ACCCCATTAA 3300 

AATATTACTT TACTGACTTT TACTATGTGA AGATATATAG CTTTGGAAAT GTCCCAGGCT 3360 

ATTCAAGAAA TATAAAAAAC TAGAAGGATA CTATATATAC CATATACAAT GCTTTAATAT 3420 

TTTAATAGAG CTACTGTATA TAATACAAAT TAGGGAAATA CTTGAATATA TCATTGAGAA 3480 

AAAATTATTG TCAGATCTTA CTGAATTATT GTCAGACTTT ATTAAATAAA GATAGAAGAA 3540 

AACCTTGCTA ATGAATTAAA GTGAAATTTG CATGGGATTC AGTTTCTCTA ATGTTATTTT 3 600 

CCGCTGAAAT CTCTAAAGAA CAAGAATGAC TTCAATTAGT AAAAGTCAAT TTTGGGAAAA 3660 

GTCATGGGTA TCTGTTTTTT AAGTGTGTCA ATCTGATTAA AATGGATGAA ACAAATTACT 3720 

CATCATAAGT TGTTTCTTAA GCTGTCAATA TGTCAATAGA TGGTGAGTTC AGAACTTATT 378 0 

TCAAATTGCT AAGACAAATT ATCTAAATTC GTAAGAATTA ACATATAGAA TGGTCTGGTC 3840 

AGTACATTTA TAATTTATCT ATGCATGAAA AAGTATTGTT TTGTTTGAAA CATGAATTTC 3900 
ATAGCAAGCT GCCATAGAAA GGA 

Seq ID NO: 73 Protein sequence : 
Protein Accession #: NM 005795 



1 11 21 31 41 51 

I I I I I I 

MLYSIFHLGL MMEKKCTLYF LVLLPFFMIL VTAELEESPE DSIQLGVTRN KIMTAQYECY 60 

QKIMQDPIQQ AEGVYCNRTW DGWLCWNDVA AGTESMQLCP DYFQDFDPSE KVTKICDQDG 120 

NWFRHPASNR TWTNYTQCNV NTHEKVKTAL NLFYLTIIGH GLSIASLLIS LGIFFYFKSL 180 

SCQRITLHKN LFFSFVCNSV VTIIHLTAVA NNQALVATNP VSCKVSQFIH LYLMGCNYFW 240 

MLCEGIYLHT LIWAVFAEK QHLMWYYFLG WGFPLIPACI HAIARSLYYN DNCWISSDTH 300 

LLYIIHGPIC AALLVNLFFL LNIVRVLITK LKVTHQAESN LYMKAVRATL ILVPLLGIEF 3 60 

VLIPWRPEGK IAEEVYDYIM HILMHFQGL.L VSTIFCFFNG EVQAI LRRNW NQYKIQFGNS 420 
FSNSEALRSA SYTVSTISDG PGYSHDCPSE HLNGKSIHDI ENVLLKPENL YN 

Seq ID NO: 74 Nucleotide sequence: 
Nucleic Acid Accession #: NM_000450.1 

Coding sequence: 117.. 1949 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I I I I I I 

CCTGAGACAG AGGCAGCAGT GATACCCACC TGAGAGATCC TGTGTTTGAA CAACTGCTTC 60 

CCAAAACGGA AAGTATTTCA AGCCTAAACC TTTGGGTGAA AAGAACTCTT GAAGTC ATG A 120 

TTGCTTCACA GTTTCTCTCA GCTCTCACTT TGGTGCTTCT CATTAAAGAG AGTGGAGCCT 180 

GGTCTTACAA CACCTCCACG GAAGCTATGA CTTATGATGA GGCCAGTGCT TATTGTCAGC 240 

AAAGGTACAC ACACCTGGTT GCAATTCAAA ACAAAGAAGA GATTGAGTAC CTAAACTCCA 300 

TATTGAGCTA TTCACCAAGT TATTACTGGA TTGGAATCAG AAAAGTCAAC AATGTGTGGG 360 

TCTGGGTAGG AACCCAGAAA CCTCTGACAG AAGAAGCCAA GAACTGGGCT CCAGGTGAAC 420 

CCAACAATAG GCAAAAAGAT GAGGACTGCG TGGAGATCTA CATCAAGAGA GAAAAAGATG 480 

TGGGCATGTG GAATGATGAG AGGTGCAGCA AGAAGAAGCT TGCCCTATGC TACACAGCTG 540 

CCTGTACCAA TACATCCTGC AGTGGCCACG GTGAATGTGT AGAGACCATC AATAATTACA 600 
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CTTGCAAGTG TGACCCTGGC TTCAGTGGAC TCAAGTGTGA GCAAATTGTG AACTGTACAG 660 

CCCTGGAATC CCCTGAGCAT GGAAGCCTGG TTTGCAGTCA CCCACTGGGA AACTTCAGCT 720 

ACAATTCTTC CTGCTCTATC AGCTGTGATA GGGGTTACCT GCCAAGCAGC ATGGAGACCA 780 

TGCAGTGTAT GTCCTCTGGA GAATGGAGTG CTCCTATTCC AGCCTGCAAT GTGGTTGAGT 840 

GTGATGCTGT GACAAATCCA GCCAATGGGT TCGTGGAATG TTTCCAAAAC CCTGGAAGCT 900 

TCCCATGGAA CACAACCTGT ACATTTGACT GTGAAGAAGG ATTTGAACTA ATGGGAGCCC 960 

AGAGCCTTCA GTGTACCTCA TCTGGGAATT GGGACAACGA GAAGCCAACG TGTAAAGCTG 1020 

TGACATGCAG GGCCGTCCGC CAGCCTCAGA ATGGCTCTGT GAGGTGCAGC CATTCCCCTG 1080 

CTGGAGAGTT CACCTTCAAA TCATCCTGCA ACTTCACCTG TGAGGAAGGC TTCATGTTGC 1140 

AGGGACCAGC CCAGGTTGAA TGCACCACTC AAGGGCAGTG GACACAGCAA ATCCCAGTTT 1200 

GTGAAGCTTT CCAGTGCACA GCCTTGTCCA ACCCCGAGCG AGGCTACATG AATTGTCTTC 1260 

CTAGTGCTTC TGGCAGTTTC CGTTATGGGT CCAGCTGTGA GTTCTCCTGT GAGCAGGGTT 1320 

TTGTGTTGAA GGGATCCAAA AGGCTCCAAT GTGGCCCCAC AGGGGAGTGG GACAACGAGA 1380 

AGCCCACATG TGAAGCTGTG AGATGCGATG CTGTCCACCA GCCCCCGAAG GGTTTGGTGA 1440 

GGTGTGCTCA TTCCCCTATT GGAGAATTCA CCTACAAGTC CTCTTGTGCC TTCAGCTGTG 1500 

AGGAGGGATT TGAATTATAT GGATCAACTC AACTTGAGTG CACATCTCAG GGACAATGGA 1560 

CAGAAGAGGT TCCTTCCTGC CAAGTGGTAA AATGTTCAAG CCTGGCAGTT CCGGGAAAGA 1620 

TCAACATGAG CTGCAGTGGG GAGCCCGTGT TTGGCACTGT GTGCAAGTTC GCCTGTCCTG 1680 

AAGGATGGAC GCTCAATGGC TCTGCAGCTC GGACATGTGG AGCCACAGGA CACTGGTCTG 1740 

GCCTGCTACC TACCTGTGAA GCTCCCACTG AGTCCAACAT TCCCTTGGTA GCTGGACTTT 1800 

CTGCTGCTGG ACTCTCCCTC CTGACATTAG CACCATTTCT CCTCTGGCTT CGGAAATGCT 1860 

TACGGAAAG C AAAGAAATTT GTTCCTGCCA GCAGCTGCCA AAGCCTTGAA TCAGACGGAA 1920 

GCTACCAAAA GCCTTCTTAC ATCCTTTAAG TTCAAAAGAA TCAGAAACAG GTGCATCTGG 1980 

GGAACTAGAG GGATACACTG AAGTTAACAG AGACAGATAA CTCTCCTCGG GTCTCTGGCC 2040 

CTTCTTGCCT ACTATGCCAG ATGCCTTTAT GGCTGAAACC GCAACACCCA TCACCACTTC 2100 

AATAGATCAA AGTCCAGCAG GCAAGGACGG CCTTCAACTG AAAAGACTCA GTGTTCCCTT 2160 

TCCTACTCTC AGGATCAAGA AAGTGTTGGC TAATGAAGGG AAAGGATATT TTCTTCCAAG 2220 

CAAAGGTGAA GAGACCAAGA CTCTGAAATC TCAGAATTCC TTTTCTAACT CTCCCTTGCT 2280 

CGCTGTAAAA TCTTGGCACA GAAACACAAT ATTTTGTGGC TTTCTTTCTT TTGCCCTTCA 2340 

CAGTGTTTCG ACAGCTGATT ACACAGTTGC TGTCATAAGA ATGAATAATA ATTATCCAGA 2400 

GTTTAGAGGA AAAAAATGAC TAAAAATATT ATAACTTAAA AAAATGACAG ATGTTGAATG 2460 

CCCACAGGCA AATGCATGGA GGGTTGTTAA TGGTGCAAAT CCTACTGAAT GCTCTGTGCG 2520 

AGGGTTACTA TGCACAATTT AATCACTTTC ATCCCTATGG GATTCAGTGC TTCTTAAAGA 2580 

GTTCTTAAGG ATTGTGATAT TTTTACTTGC ATTGAATATA TTATAATCTT CCATACTTCT 2640 

TCATTCAATA CAAGTGTGGT AGGGACTTAA AAAACTTGTA AATGCTGTCA ACTATGATAT 2700 

GGTAAAAGTT ACTTATTCTA GATTACCCCC TCATTGTTTA TTAACAAATT ATGTTACATC 2760 

TGTTTTAAAT TTATTTCAAA AAGGGAAACT ATTGTCCCCT AGCAAGGCAT GATGTTAACC 2820 

AGAATAAAGT TCTGAGTGTT TTTACTACAG TTGTTTTTTG AAAACATGGT AGAATTGGAG 2880 

AGTAAAAACT GAATGGAAGG TTTGTATATT GTCAGATATT TTTTCAGAAA TATGTGGTTT 294 0 

CCACGATGAA AAACTTCCAT GAGGCCAAAC GTTTTGAACT AATAAAAGCA TAAATGCAAA 3000 

CACACAAAGG TATAATTTTA TGAATGTCTT TGTTGGAAAA GAATACAGAA AGATGGATGT 3060 

GCTTTGCATT CCTACAAAGA TGTTTGTCAG ATGTGATATG TAAACATAAT TCTTGTATAT 3120 

TATGGAAGAT TTTAAATTCA CAATAGAAAC TCACCATGTA AAAGAGTCAT CTGGTAGATT 3180 

TTTAACGAAT GAAGATGTCT AATAGTTATT CCCTATTTGT TTTCTTCTGT ATGTTAGGGT 3240 

GCTCTGGAAG AGAGGAATGC CTGTGTGAGC AAGCATTTAT GTTTATTTAT AAGCAGATTT 3300 

AACAATTCCA AAGGAATCTC CAGTTTTCAG TTGATCACTG GCAATGAAAA ATTCTCAGTC 3360 

AGTAATTGCC AAAGCTGCTC TAGCCTTGAG GAGTGTGAGA ATCAAAACTC TCCTACACTT 3420 

CCATTAACTT AGCATGTGTT GAAAAAAAAA GTTTCAGAGA AGTTCTGGCT GAACACTGGC 34 80 

AACGACAAAG CCAACAGTCA AAACAGAGAT GTGATAAGGA TCAGAACAGC AGAGGTTCTT 354 0 

TTAAAGGGGC AGAAAAACTC TGGGAAATAA GAGAGAACAA CTACTGTGAT CAGGCTATGT 3600 

ATGGAATACA GTGTTATTTT CTTTGAAATT GTTTAAGTGT TGTAAATATT TATGTAAACT 3660 

GCATTAGAAA TTAGCTGTGT GAAATACCAG TGTGGTTTGT GTTTGAGTTT TATTGAGAAT 3 720 

TTTAAATTAT AACTTAAAAT ATTTTATAAT TTTTAAAGTA TATATTTATT TAAGCTTATG 3780 
TCAGACCTAT TTGACATAAC ACTATAAAGG TTGACAATAA ATGTGCTTAT GTTT 

Seq ID NO: 75 Protein sequence: 
Protein Accession #: NP_000441 

1 11 21 31 41 51 

I I I I I I 

MIASQFLSAL TLVLLIKESG AWSYNTSTEA MTYDEASAYC QQRYTHLVAI QNKEEIEYLN 60 

SILSYSPSYY WIGIRKVNNV WVWVGTQKPL TEEAKNWAPG EPNNRQKDED CVEIYIKREK 120 

DVGMWNDERC SKKKLALCYT AACTNTSCSG HGECVETINN YTCKCDPGFS GLKCEQIVNC 180 

TALES PEHGS LVCSHPLGNF SYNSSCSISC DRGYLPSSME TMQCMSSGEW SAPIPACNW 240 

ECDAVTNPAN GFVECFQNPG SFPWNTTCTF DCEEGFELMG AQSLQCTSSG NWDNEKPTCK 300 

AVTCRAVRQP QNGSVRCSHS PAGEFTFKSS CNFTCEEGFM LQGPAQVECT TQGQWTQQIP 360 

VCEAFQCTAL SNPERGYMNC LPSASGSFRY GSSCEFSCEQ GFVLKGSKRL QCGPTGEWDN 420 

EKPTCEAVRC DAVHQPPKGL VRCAHSPIGE FTYKSSCAFS CEEGFELYGS TQLECTSQGQ 4 80 

WTEEVPSCQV VKCSSLAVPG KINMSCSGEP VFGTVCKFAC PEGWTLNGSA ARTCGATGHW 54 0 

SGLLPTCEAP TESNI PLVAG LSAAGLSLLT LAPFLLWLRK CLRKAKKFVP ASSCQSLESD 600 
GSYQKPSYIL 



Seq ID NO: 76 Nucleotide sequence: 
Nucleic Acid Accession #: NM_031439 

Coding sequence: 69.. 1235 (underlined sequences correspond to start and stop codons) 
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CCCGACCCGT GCGAGGGCCA GGTCCGCGCC TGCCCCGCCA GGCGAAGCGA GGCGACCCGC 60 

GTGCGGCCAT GGCTTCGCTG CTGGGAGCCT ACCCTTGGCC CGAGGGTCTC GAGTGCCCGG 120 

CCCTGGACGC CGAGCTGTCG GATGGACAAT CGCCGCCGGC CGTCCCCCGG CCCCCGGGGG 180 

ACAAGGGCTC CGAGAGCCGT ATCCGGCGGC CCATGAACGC CTTCATGGTT TGGGCCAAGG 240 

ACGAGAGGAA ACGGCTGGCA GTGCAGAACC CGGACCTGCA CAACGCCGAG CTCAGCAAGA 300 

TGCTGGGAAA GTCGTGGAAG GCGCTGACGC TGTCCCAGAA GAGGCCGTAC GTGGACGAGG 360 

CGGAGCGGCT GCGCCTGCAG CACATGCAGG ACTACCCCAA CTACAAGTAC CGGCCGCGCA 420 

GGAAGAAGCA GGCCAAGCGG CTGTGCAAGC GCGTGGACCC GGGCTTCCTT CTGAGCTCCC 480 

TCTCCCGGGA CCAGAACGCC CTGCCGGAGA AGAGAAGCGG CAGCCGGGGG GCGCTGGGGG 540 

AGAAGGAGGA CAGGGGTGAG TACTCCCCCG GCACTGCCCT GCCCAGCCTC CGGGGCTGCT 600 

ACCACGAGGG GCCGGCTGGT GGTGGCGGCG GCGGCACCCC GAGCAGTGTG GACACGTACC 660 

CGTACGGGCT GCCCACACCT CCTGAAATGT CTCCCCTGGA CGTGCTGGAG CCGGAGCAGA 720 

CCTTCTTCTC CTCCCCCTGC CAGGAGGAGC ATGGCCATCC CCGCCGCATC CCCCACCTGC 780 

CAGGGCACCC GTACTCACCG GAGTACGCCC CAAGCCCTCT CCACTGTAGC CACCCCCTGG 840 

GCTCCCTGGC CCTTGGCCAG TCCCCCGGCG TCTCCATGAT GTCCCCTGTA CCCGGCTGTC 900 

CCCCATCTCC TGCCTATTAC TCCCCGGCCA CCTACCACCC ACTCCACTCC AACCTCCAAG 960 

CCCACCTGGG CCAGCTTTCC CCGCCTCCTG AGCACCCTGG CTTCGACGCC CTGGATCAAC 1020 

TGAGCCAGGT GGAACTCCTG GGGGACATGG ATCG CAATGA ATTCGACCAG TATTTGAACA 1080 

CTCCTGGCCA CCCAGACTCC GCCACAGGGG CCATGGCCCT CAGTGGGCAT GTTCCGGTCT 1140 

CCCAGGTGAC ACCAACGGGT CCCACAGAGA CCAGCCTCAT CTCCGTCCTG GCTGATGCCA 1200 

CGGCCACGTA CTACAACAGC TACAGTGTGT CATAGAGCTG GAGGCGCCCC GTCCGGTCAG 1260 

CCCTCGCGCC CTCTCCTTCT TGTGCCTTGA GTGGCAGAGG AGCCGTCCAG CCACACCAGC 1320 

TTTCCTCCCA CCGCTCAGGG CAGGGAGGTC TGAACTGCGG CCCCAGAGCC TTTGGCCTAA 1380 

GCTGGACTCT CCTTATCCGA GTGCCGCCTC TATCCCCTTC CCCACGTTCC AGCCCCTGCA 1440 

GCCCACATTT TAAGTATATT CCTTCAAGTG AGTTTTCCTC CAGCCCCTGA GAGTTGCTGT 1500 

CTCCCAGTGG AATGTTCACT GACGTCTTTT CTTGGTAGCC ATCATCGAAA CTAATGGGGG 1560 

GACAGACTTG ATAGCCAAGG TCCCTTCTGG TCCAGTTTTC TGATTTAGGG TTCTCTCAAG 1620 

ATTAATAAAG GAAGATGGGG AAATTTGACT, CATTAATGAG CTCGCTAACC TACGATCTGG 1680 

TGATAATTTT GTGTGCACAG CCCAAGGACC ACGAGGCTTT CTGCACTTTC TGCACCCCCT 174 0 

TCCAAAGTGA CCACAAAATT TCAAAGGGAC TCATACAATT TGAGAAAAAA CAGTCAACCT 1800 

GATTTGAGAA ATTAACCAGT ATGGCTAACT ATATCACAGA AAATGGGATT GAGTTAAAAC 1860 

TATTTTATTT TA aaTATACA TTTTAAAGCA GTTCTTTTTT TTTGTTAATT TGTTTATTAT 1920 

ACACACACTT CAAGAGCCAC CGCGCCCAGC CTAGATTTAT AATTTTCATT CTCTTTTACC 1980 

TATAAAATTC AGTGTATTAG TTTCATTACA TAGGAGAAAT TATATTTCTA AACATTTTAT 2 040 
GATGTTTAAA AACAAAACAG GCTGTTGTAA AAAAAAAAAA AAAAAAAAA 



Seq ID NO: 77 Protein sequence : 
Protein Accession #: NP_113627 

1 11 21 31 41 51 
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MASLLGAYPW PEGLECPALD AELSDGQSPP AVPRPPGDKG SESRIRRPMN AFMVWAKDER 60 
KRLAVQNPDL HNAELSKMLG KSWKALTLSQ KRPYVDEAER LRLQHMQDYP NYKYRPRRKK 120 
QAKRLCKRVD PGFLLSSLSR DQNALPEKRS GSRGALGEKE DRGEYSPGTA LPSLRGCYHE 180 
GPAGGGGGGT PSSVDTYPYG LPTPPEMSPL DVLEPEQTFF SS PCQEEHGH PRRIPHLPGH 240 
PYSPEYAPSP LHCSHPLGSL ALGQSPGVSM MSPVPGCPPS PAYYSPATYH PLHSNLQAHL 300 
GQLSPPPEHP GFDALDQLSQ VELLGDMDRN EFDQYLNTPG HPDSATGAMA LSGHVPVSQV 360 
TPTGPTETSL ISVLADATAT YYNSYSVS 

Seq ID NO: 78 Nucleotide sequence: 
Nucleic Acid Accession #: XM_035787 

Coding sequence: 329.. 949 {underlined sequences correspond to start and 
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TGCCCCGCCC CGCTCCCCAG CGCCCCGGAA GTGATCTGTG GCGGCTGCTG CAGAGCCGCC 60 

AGGAGGAGGG TGGATCTCCC CAGAGCAAAG CGTCGGAGTC CTCCTCCTCC TTCTCCTCCT 120 

CCTCCTCCTC CTCCTCCAGC CGCCCAGGCT CCCCCGCCAC CCGTCAGACT CCTCCTTCGA 18 0 

CCGCTCCCGG CGCGGGGCCT TCCAGGCGAC AAGGACCGAG TACCCTCCGG CCGGAGCCAC 240 

GCAGCCGCGG CTTCCGGAGC CCTCGGGGCG GCGGACTGGC TCGCGGTGCA GATTCTTCTT 300 

AATCCTTTGG TGAAAACTGA GACACAAAAT GGCTGCAAAT AAGCCCAAGG GTCAGAATTC 360 

TTTGGCTTTA CACAAAGTCA TCATGGTGGG CAGTGGTGGC GTGGGCAAGT CAGCTCTGAC 420 

TCTACAGTTC ATGTACGATG AGTTTGTGGA GGACTATGAG CCTACCAAAG CAGACAGCTA 480 

TCGGAAGAAG GTAGTGCTAG ATGGGGAGGA AGTCCAGATC GATATCTTAG ATACAGCTGG 540 

GCAGGAGGAC TACGCTGCAA TTAGAGACAA CTACTTCCGA AGTGGGGAGG GGTTCCTCTG 600 

TGTTTTCTCT ATTACAGAAA TGGAATCCTT TGCAGCTACA GCTGACTTCA GGGAGCAGAT 660 

TTTAAGAGTA AAAGAAGATG AGAATGTTCC ATTTCTACTG GTTGGTAACA AATCAGATTT 720 

AGAAGATAAA AGACAGGTTT CTGTAGAAGA GGCAAAAAAC AGAGCTGAGC AGTGGAATGT 780 

TAACTACGTG GAAACATCTG CTAAAACACG AGCTAATGTT GACAAGGTAT TTTTTGATTT 840 
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10 



15 



20 



25 



30 



AATGAGAGAA 
GAGGAAAAGT 
ACTCCTTTCT 
TTAATTGACT 
TTGGAACTGC 
AGCAAAGTTC 
AAGCTTGTGT 
TGGGAGGAAA 
AAATGTCTTG 
CCACTTTTTT 
CATATAGTTA 
TTAGAAAAGT 
GAAAAATATC 
CCATGTGTCC 
AAGTAACTGT 
AAAATAAGAT 
AGGGCCAGCA 
TTTGAATGAA 
GTGAGCATGT 
AGTGCTAATG 
ATTTCTAAAT 
TTTCTGTTTC 
CCATCAAACC 
TCACCATTGT 
CAGTGGGAAA 
AGTTGATTCA 
TGTGTTCAGG 
ACTCACTCTT 
TGAGATGTAC 
TACCAATGTA 
TTTAACCAGG 
GGTACTGAGG 
ACTACCCGAT 



ATTCGAGCGA 
TTAGCCAAGA 
TATCTTGACC 
GAAATTACTT 
AATGAAAGTC 
AACTTATTTC 
TTCTTGGGCA 
GGTGACTTCC 
GTCTTCTACT 
TAACCATTAT 
AACTGAGAGT 
GGTGTAAACT 
TGGTTATATC 
TGGTGAGAAA 
CCGCTAGAAG 
TCCAGAGCTC 
TATATACTTG 
CCCTCCTTTT 
AAGTGTTAAG 
CATTTTGCAC 
TTATATTCAT 
TGTTTATAAT 
TGGGTATAGT 
GTGGTGTACC 
TATGCCACTG 
ACAAAGTATT 
CATTCCAGGT 
TAGATATTTA 
ATCTTTCATT 
ACACTGGCCA 
GGTCCTAACC 
TGCTATGAAG 
TTGTTTATTT 



GAAAGATGGA 
GAATCAGAGA 
ATACTAATAA 
TAACATTTTG 
AAATTTACTT 
ATAATTGCCT 
GTCTTTCTTG 
TCTGGTGTTT 
GCCTTGAAAA 
TATGCAAAAT 
AATTCATCTG 
TGTACATGGA 
ATTCTGGGTG 
ATATATGCCT 
TCTGTCCAAA 
TTTGATCGCT 
CAAGATAATT 
CTCTGAGATT 
TTTTTAATCT 
TAGAACGCTT 
AAAGTTACAG 
GAAGAACACT 
GCAGAAAACG 
TGCTGGAAGA 
ACCGATTTTT 
TTTTTCTTTT 
AACAGGTGTG 
CATCCAGCTT 
TCGTATTTCT 
GCGGGCCCAG 
ACTAACATTG 
CCAACTGACA 
GCAATTTGAG 



AGACAGCAAA 
AAGATGCTGC 
ATATAATTTA 
GAAATTGTTG 
TAAAAAGAAA 
ACATTTATCA 
AAATTGAAGA 
ATTATAAAGC 
ATGACAATTG 
TTAGAAGAAA 
TGAATCTGCT 
ATTTTTTGAA 
TGTTCTTACT 
GGCACAGCTT 
TTTAAAATGT 
TTTAATAAAC 
TTCAGCTGCA 
CTGGTCCCTG 
GGGAGCAGGG 
CGGGAAAATA 
TTTGATACAG 
GTAGCTACAT 
TGGCACACAC 
ATTCTAGCAT 
TTTTTTTCCT 
TTCTCAGTCC 
TATGTAAAGT 
CTCATGTTAA 
CATAGGCTAT 
CAATCTCCAT 
TGACTTTGCT 
AAG ATG CATC 
CCATTTAAAG 



GAAAAGAATG 
ATTTT ATAA T 
TAAGCATTGC 
TATATCACTA 
TTAATATGGC 
TGGTCCTGAA 
GGTGAAATGG 
TTAAATTTTA 
TGAACATGAT 
AGTTATTGGC 
TTAATTACCT 
TATGCCTTAA 
GACACCAGGG 
TTGTATAGAA 
GTGCCATATT 
TGCAAGTTCA 
AGGATTCAGC 
GAAATCCCTT 
CATAGGAAGA 
TTCATGCTTG 
GAATTATTAG 
TTTCAGAAGT 
TGACCACACA 
GCTACTTGGG 
CTTTGCAGTG 
TAATTTGAAC 
TAAAAATAGG 
ATATTTGTCC 
GCCATGTGCG 
GTGTACTTAT 
TTGAGACCTT 
ACGTGTCTTA 
ACCAATAAAC 



GAAAAAAGAA 900 

CAAAGCCCAA 960 

CATTGAAGGC 1020 

AAAGCATGAA 1080 

TTCACCAAGA 1140 

TGTAGCGTGT 1200 

GGGTGGGGAG 1260 

TATCATTTTA 1320 

AGTTAAACTA 1380 

ATGGTTGTTG 1440 

GGTGAGTAAC 1500 

TTTAGAAACT 1560 

GTCCGCTGCC 1620 

AATTCTTGAG 1680 

CTGGTTCTTG 1740 

TTTTAAATG A 1800 

ACCAGTTATG 1860 

TCTGCTAGTG 1920 

AAATGTCAGT 1980 

CCATCTGTTC 2040 

GAGTAATTCT 2100 

TAACATCAAG 2160 

TTAGGCTGTG 2220 

GACATAATTT 2280 

GGGCTAGGAC 2340 

AGGTCAAAGA 2400 

CTTTTTAGGA 2460 

TTAAAGGGTT 2520 

GAATTCAAGT 2580 

TACAGTCTTA 2640 

TCCTCTCCTG 2700 

GGCTGATGCC 2760 
TTCCTTTTTT 



35 



Seq ID NO: 79 Protein sequence: 
Protein Accession #: XP 035787 



40 



45 



50 



55 



60 



65 



70 



11 21 31 41 51 

I I I I I 

MAANKPKGQN SLALHKVTMV GSGGVGKSAL TLQFMYDEFV EDYEPTKADS YRKKWLDGE 

EVQIDILDTA GQEDYAAIRD NYFRSGEGFL CVFSITEMES FAATADFREQ I LRVKEDENV 

PFLLVGNKSD LEDKRQVSVE EAKNRAEQWN VNYVETSAKT RANVDKVFFD LMREIRARKM 

EDSKEKNGKK KRKSLAKRIR ERCCIL 



60 
120 
180 



Seq ID NO: 80 Nucleotide sequence: 
Nucleic Acid Accession #: NM_003467 

Coding sequence: 89.. 1147 (underlined sequences' correspond to start and stop codons) 



75 



I 

GTTTGTTGGC 
CACCGCATCT 
CTACACCGAG 
AGAAAATGCT 
TGGCATTGTG 
CATGACGGAC 
TCCCTTCTGG 
AGTCCATGTC 
TCTGGACCGC 
GGCTGAAAAG 
CTTCATCTTT 
CAATGACTTG 
TGGTATTGTC 
CCACCAGAAG 
TTGGCTGCCT 
GCAAGGGTGT 
TTTCTTCCAC 
CTCTGCCCAG 
AGGAAAGCGA 
CAGCTAACAC 
ACATTTTTCA 
TTGTCTTGTG 
TGTTTCATAT 
CTCGTGGTAG 
AAGCTAGAAA 



11 
I 

TGCGGCAGCA 
GGAGAACCAG 
GAAATGGGCT 
AATTTCAATA 
GGCAATGGAT 
AAGTACAGGC 
GCAGTTGATG 
ATCTACACAG 
TACCTGGCCA 
GTGGTCTATG 
GCCAACGTCA 
TGGGTGGTTG 
ATCCTGTCCT 
CGCAAGGCCC 
TACTACATTG 
GAGTTTGAGA 
TGTTGTCTGA 
CACGCACTCA 
GGTGGACATT 
AGATGTAAAA 
GATATAAAAG 
TTTCTTTAGT 
TGATGTGTGT 
GACTGTAGAA 
TGATCCCCAG 



21 



31 



41 



51 



GGTAGCAAAG 
CGGTTACCAT 
CAGGGGACTA 
AAATCTTCCT 
TGGTCATCCT 
TGCACCTGTC 
CCGTGGCAAA 
TCAACCTCTA 
TCGTCCACGC 
TTGGCGTCTG 
GTGAGGCAGA 
TGTTCCAGTT 
GCTATTGCAT 
TCAAGACCAC 
GGATCAGCAT 
ACACTGTGCA 
ACCCCATCCT 
CCTCTGTGAG 
CATCTGTTTC 
GACTTTTTTT 
ACTGACCAAT 
TTTTGTGAAG 
CTAGGCAGGA 
AAGGGAACTG 
CTGTTTATGC 



TGACGCCGAG 
GGAGGGGATC 
TGACTCCATG 
GCCCACCATC 
GGTCATGGGT 
AGTGGCCGAC 
CTGGTACTTT 
CAGCAGTGTC 
CACCAACAGT 
GATCCCTGCC 
TGACAGATAT 
TCAGCACATC 
TATCATCTCC 
AGTCATCCTC 
CGACTCCTTC 
CAAGTGGATT 
CTATGCTTTC 
CAGAGGGTCC 
CACTGAGTCT 
TATACGATAA 
ATTGTACAGT 
TTTAATTGAC 
CCTGTGGCCA 
AACATTCCAG 
ATAGATAATC 



GGCCTGAGTG 
AGTATATACA 
AAGGAACCCT 
TACTCCATCA 
TACCAGAAGA 
CTCCTCTTTG 
GGGAACTTCC 
CTCATCCTGG 
CAGAGGCCAA 
CTCCTGCTGA 
ATCTGTGACC 
ATGGTTGGCC 
AAGCTGTCAC 
ATCCTGGCTT 
ATCCTCCTGG 
TCCATCACCG 
CTTGGAGCCA 
AGCCTCAAGA 
GAGTCTTCAA 
ATAACTTTTT 
TTTTATTGCT 
TTATTTATAT 
AGTTCTTAGT 
AGCGTGTAGT 
TCTCCATTCC 



CTCCAGTAGC 
CTTCAGATAA 
GTTTCCGTGA 
TCTTCTTAAC 
AACTGAGAAG 
TCATCACGCT 
TATGCAAGGC 
CCTTCATCAG 
GGAAGCTGTT 
CTATTCCCGA 
GCTTCTACCC 
TTATCCTGCC 
ACTCCAAGGG 
TCTTCGCCTG 
AAATCATCAA 
AGGCCCTAGC 
AATTTAAAAC 
TCCTCTCCAA 
GTTTTCACTC 
TTTAAGTTAC 
TGTTGGATTT 
AAATTTTTTT 
TGCTGTATGT 
GAATCACGTA 
CGTGGAACGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
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TTTTCCTGTT CTTAAGACGT GATTTTGCTG TAGAAGATGG CACTTATAAC CAAAGCCCAA 1560 
AGTGGTATAG AAATGCTGGT TTTTCAGTTT TCAGGAGTGG GTTGATTTCA GCACCTACAG 1620 
TGTACAGTCT TGTATTAAGT TGTTAATAAA AGTACATGTT AAACTTACTT AGTGTTATG 

Seq ID NO: 81 Protein sequence: 
Protein Accession #: NP 003458 



1 11 21 31 41 51 

I I I I I I 

MEGISIYTSD NYTEEMGSGD YDSMKEPCFR EENANFNKIF LPTIYSIIPL TGIVGNGLVI 60 
LVMGYQKKLR SMTDKYRLHL SVADLLFVIT LPFWAVDAVA NWYFGNFLCK AVHVIYTVNIr 120 
YSSVLILAFI SLDRY1AIVH ATNSQRPRKL LAEKWYVGV WIPALLLTIP DPIFANVSEA 180 
DDRYICDRPY PNDLWVWFQ FQHIMVGLIL PGIVILSCYC IIISKLSHSK GHQKRKALKT 240 
TVILILAFFA CWIiPYYIGIS IDSFILIiEII KQGCEFENTV HKWISITEAL AFFHCCLNPI 300 
LYAFLGAKFK TSAQHALTSV SRGSSLK1LS KGKRGGHSSV STESESSSFH SS 



Seq ID NO: 82 Nucleotide sequence: 
Nucleic Acid Accession #: NM_014959 

Coding sequence: 314.. 1609 (underlined sequences correspond to start and 

1 11 21 31 41 51 

I I I I I I 

CTGGTTCTCA ACTTCTTTTG AAATAATGTT CATAGAGAAG GAGGGCTGTC TGAGATTCGA 6 0 

GGGAAACAAG CTCTCAGGAC TTCCGGTCGC CATGATGGCT GTGGGCGGTA AACGCGGTTA 120 

GTGCAAGCAT CTGGGCCATC TTCAATGGTA AAAAAGATAC AGTAAAGACA TAAATACCAC 180 

ATTTGACAAA TGGAAAAAAA GGAGTGTCCA GAAAAGAGTA GCAGCAGTGA GGAAGAGCTG 240 

CCGAGACGGG TATACAGGGA GCTACCCTGT GTTTCTGAGA CCCTTTGTGA CATCTCACAT 300 

TTTTTCCAAG AAGATGATGA GACAGAGGCA GAGCCATTAT TGTTCCGTGC TGTTCCTGAG 360 

TGTCAACTAT CTGGGGGGGA CATTCCCAGG AGACATTTGC TCAGAAGAGA ATCAAATAGT 420 

TTCCTCTTAT GCTTCTAAAG TCTGTTTTGA GATCGAAGAA GATTATAAAA ATCGTCAGTT 480 

TCTGGGGCCT GAAGGAAATG TGGATGTTGA GTTGATTGAT AAGAGCACAA ACAGATACAG 540 

CGTTTGGTTC CCCACTGCTG GCTGGTATCT GTGGTCAGCC ACAGGCCTCG GCTTCCTGGT 600 

AAGGGATGAG GTCACAGTGA CGATTGCGTT TGGTTCCTGG AGTCAGCACC TGGCCCTGGA 660 

CCTGCAGCAC CATGAACAGT GGCTGGTGGG CGGCCCCTTG TTTGATGTCA CTGCAGAGCC 720 

AGAGGAGGCT GTCGCCGAAA TCCACCTCCC CCACTTCATC TCCCTCCAAG GTGAGGTGGA 780 

CGTCTCCTGG TTTCTCGTTG CCCATTTTAA GAATGAAGGG ATGGTCCTGG AGCATCCAGC 840 

CCGGGTGGAG CCTTTCTATG CTGTCCTGGA AAGCCCCAGC TTCTCTCTGA TGGGCATCCT 900 

GCTGCGGATC GCCAGTGGGA CTCGCCTCTC CATCCCCATC ACTTCCAACA CATTGATCTA 960 

TTATCACCCC CACCCCGAAG ATATTAAGTT CCACTTGTAC CTTGTCCCCA GCGACGCCTT 1020 

GCTAACAAAG GCGATAGATG ATGAGGAAGA TCGCTTCCAT GGTGTGCGCC TGCAGACTTC 1080 

GCCCCCAATG GAACCCCTGA ACTTTGGTTC CAGTTATATT GTGTCTAATT CTGCTAACCT 1140 

GAAAGTAATG CCCAAGGAGT TGAAATTGTC CTACAGGAGC CCTGGAGAAA TTCAGCACTT 1200 

CTCAAAATTC TATGCTGGGC AGATGAAGGA ACCCATTCAA CTTGAGATTA CTGAAAAAAG 1260 

ACATGGGACT TTGGTGTGGG ATACTGAGGT GAAGCCAGTG GATCTCCAGC TTGTAGCTGC 1320 

ATCAGCCCCT CCTCCTTTCT CAGGTGCAGC CTTTGTGAAG GAGAACCACC GGCAACTCCA 13 8 0 

AGCCAGGATG GGGGACCTGA AAGGGGTGCT CGATGATCTC CAGGACAATG AGGTTCTTAC 1440 

TGAGAATGAG AAGGAGCTGG TGGAGCAGGA AAAGACACGG CAGAGCAAGA ATGAGGCCTT 1500 

GCTGAGCATG GTGGAGAAGA AAGGGGACCT GGCCCTGGAC GTGCTCTTCA GAAGCATTAG 1560 

TGAAAGGGAC CCTTACCTCG TGTCCTATCT TAGACAGCAG AATTTG TAAA ATGAGTCAGT 1620 

TAGGTAGTCT GGAAGAGAGA ATCCAGCGTT CTCATTGGAA ATGGATAAAC AGAAATGTGA 1680 

TCATTGATTT CAGTGTTCAA GACAGAAGAA GACTGGGTAA CATCTATCAC ACAGGCTTTC 174 0 

AGGACAGACT TGTAACCTGG CATGTACCTA TTGACTGTAT CCTCATGCAT TTTCCTCAAG 1800 

AATGTCTGAA GAAGGTAGTA ATATTCCTTT TAAATTTTTT CCAACCATTG CTTGATATAT 1860 

CACTATTTTA TCCATTGACA TGATTCTTGA AGACCCAGGA TAAAGGACAT CCGGATAGGT 1920 

GTGTTTATGA AGGATGGGGC CTGGAAAGGC AACTTTTCCT GATTAATGTG AAAAATAATT 1980 

CCTATGGACA CTCCGTTTGA AGTATCACCT TCTCATAACT AAAAGCAGAA AAGCTAACAA 2040 

AAGCTTCTCA GCTGAGGACA CTCAAGGCAT ACATGATGAC AGTCTTTTTT TTTTTTGTAT 2100 

GTTAGGACTT TAACACTTTA TCTATGGCTA CTGTTATTAG AACAATGTAA ATGTATTTGC 2160 

TGAAAGAGAG CACAAAAATG GGAGAAAATG CAAACATGAG CAGAAAATAT TTTCCCACTG 2220 

GTGTGTAGCC TGCTACAAGG AGTTGTTGGG TTAAATGTTC ATGGTCAACT CCAAGGAATA 2280 

CTGAGATGAA ATGTGGTAAA TCAACTCCAC AGAACCACCA AAAAGAAAAT GAGGGTAATT 234 0 

CAGCTTATTC TGAGACAGAC ATTCCTGGCA ATGTACCATA CAAAAAATAA GCCAACTCTG 24 00 

ACATTTGGAT TCTACCATAG ACTCTGTCAT TTTGTAGCCA TTTCAGCTGT CTTTTGATTA 2460 

ATGTTTTCGT GGCACACATA TTTCCATCCT TTTATGTTTA ATCTGTTTAA AACAAGTTCC 2520 

TAGTAGACAC CATCTGGTTG AGTCAGTTTT TTTTATGGTG TATTTTGAAC CCATTCTGAT 2580 

AGTCTCTTTT AACTGGAAGA TTTCAATTAC TTACGTTAAT GTAATTATTA ATATGTTAGG 264 0 

ATTTATCCTC AGTCAGCCAG TTTGTTATGT CTTTTCTATT CTACTGTTAT CACATTTGTA 2700 

CCACTTAAAG TGGAATCTAG GCACTTTATC ACCATTTAGA TCCTATTACC TTTTCTCATC 2760 

TAGGATATAG TTATCTTCTA CATAATCTTT CTGTATCTTA AAACCCATCA ATAAATTATT 2820 

ATATATTTTC TACTTTTAAT CACTCAGAAG ATTTAAAAAA CTCATGAGAA GAGTAATCTG 28 80 

TTATGTTTTT CCAGATATTT ACCATTTCTG TTGCTCTTCC TTCATTATTT TCCAAATTTC 2 940 

GTTCTGCAAA TTTCCACTTC TTCTGATAGA CGTTTTTTAG TTCTTTTAGA GTGGTTCTGA 3000 

TAGGTACAGA TTCTCTTATT TTTTGCTTCC TCTGAGGACA TCTTTTTCTC ACCTTCATTC 3060 

TCAGTGATGT TTTTTGCTTG TAGTATTTTT AGTTGACATT GTTTTCTGTT CAGCAGTTTC 3120 
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CTTTTAGCTT CCGTATTTCC TGATGAGAAA TCTGCAGTCA TTCAAATTGT TGTTTCCCTG 3180 

TATGTAGTGT GTCATTTTTC TGTCAGATTT CAAGGTATTT ATCTTTAGTT TTTAGCCATT 3240 

TCATTATGTT GGGGATGAGT TTCCTTGTTT TATTCCCTTT GGAATTTGCT CCAATTCATA 3300 

AATTTGCAGT TTTATGTCTT TTACCAAACT TAGAGGTTTT CAGCCTAATT TCTAAAAATA 3360 

CTTTTTATTA GCCTGATTTT CATCTTTATA GGAAATAGTT TAAGTGATGA CAAGTTCCAA 3420 

TAGCTTATAT GCCCAGAAGG CCTTCAAAAT AAGAATTTTG AAAGAATACA GAAAACAAAC 3480 

TTTTATATCC TTCTCATGTC TTCTACTGTA AAATTCATAT GCTTTGCTAC TCTAAACCTA 3540 

GTTTGAAATC AACAGTCTTG AGAATAGATG AAAATTTTGA TGAATAGTGG AATTCTTTTA 3600 

AATGGAAACC TCTTACATGT GATTTTCCTT GCCATCTAGA AATAAACCAT AGTATTTATG 3660 

TTGAATCAAT CAATATTATA TTTTGTTTTT TTCCTCCTCT TCTGAGACTC TTATTGTGGA 3720 

AATGTTAGAC TTTTATGTTT TCCTAAATGT CCCTGATATT CTACTTATTT AGAACATCTT 3780 

TTCATTTTTT CCATTATTCT GATTGGGTAA TTTTAATTTG TCTATTTTCA AATTTGCTGG 3 840 

AGTGTTCACC TGTTGTTGTC TGTGTCGTCC CACTGAGTGC ATTCACCACC TTTTAAATTT 3900 

TGGTCACTGT ATGTATCAGT TCTAAAATTT CCATTTTGTT CTCTATATTT TAAATTTCTT 3960 

GGCTTATATT CTATTTTCCT GCAAATGTGT CAGCATTTGC TTGTTTGAGC TTTTTTTTTT 4020 

TCAAGACAGG GTCTCAACTC TGTTACCCAG GCTGGAGTGC AGTGGTGCGA TCTCAGCTCA 4080 

CTGCAACCTC TGCCTCCTGG TTCAAGCGAT TATTGTGCCT CAGCCTCCTG AGTAGCTGGG 414 0 

ATTACAGGCA TGCACCACCA CAGCCCAGCT AATTTTTTGT ATTTTTAGTA GAGACAGAGT 4200 

TTTGCTATGT TGGCCAGGCT GGTTTTGAAC TCCTGGCCTC AAGTGATCCA CCCACCTCAG 4260 

CCTCCCAAAG TGCTGGGATT ACAGGCCACT ACACCTGGCA CATTTGAGTA TTTTTTTTTT 4320 

TTTTTTTTTT TTGAGATGGA GTCTCGCTCT GTCATCTAGG CTGGAGTGCA GTGGTGTGAT 4380 

CTCAGCTCAC TGCAGCCTCT GTCTCCCGGG CTCAAGCGAT TCTCTTGCCT CAGCCTCCTG 444 0 

AGTAGCTAGG ACTACAGGTG CATGCCAACA CGCCCGGCTA ATTTTTTTAA AAAATATTTT 4500 

TAGTAGAGAC AGGGTTTCAC CATTTTGGCC AGGATGGTCT CGATCTCCTG ACCTCATGAT 4560 

CCACCCGCCT CGGCCTTCCA AAGTGCTGGG ATTACAGGCA TGAGCCACCG TGCCTGGCCT 4620 

CATTTGAGTA TTTTTATAAT GTCTCTTTTA AAGTCTTTGT CAGATAATTC CACTGTACAT 4680 

GTTATTCAGT GTTTGGTGTC CACTGAGTTG TCATTTGCCA GACAAGTGGA GATTTTTGCA 4740 

GCTCATCCTT GTATTCTCAG TAGTTCCGAT ATGTACCCTC GACATGTGAA TGTTATCTTA 4800 

TGAGACTCTG TTTTATTTGT ATCCAACAGA AGATGTTTAT TATTTATTTG GCTTTCTGTG 4860 

AACTGAGGTC TTAATATCAG CTCATTTTAA AAGTCTTTGC AGTGGTATTC GGATCTATCC 4920 

TGTGTGTGCC TATGAGATTG GGTGCAGTGT ATCCTGTTAG CTCCATTCTC AGGGCGTTTG 4 980 

AATGTGAATT AGGACCAGCG CAATGAATGC TCAAGTTGGG GTTGGGCGTT AGAATTCATA 5040 
AAAGTCTTTA TATGCTCAG 



Seq ID NO: 83 Protein sequence: 
Protein Accession #: NP 055774 



1 
I 

MMRQRQSHYC 
GNVDVELIDK 
EQWLVGGPLF 
FYAVLESPSF 
IDDEEDRFHG 
AGQMKEPIQL 
DLKGVLDDLQ 
YLVSYLRQQN 



11 



21 



SVLFLSVNYL 
STNRYSVWFP 
DVTAEPEEAV 
SLMGILLRIA 
VRLQTSPPME 
EITEKRHGTL 
DNEVLTENEK 
L 



GGTFPGDICS 
TAGWYLWSAT 
AEIHLPHFIS 
SGTRLSIPIT 
PLNFGSSYIV 
VWDTEVKPVD 
EbVEQEKTRQ 



31 

I 

EENQIVSSYA 
GLGFLVRDEV 
LQGEVDVSWF 
SNTLIYYHPH 
SNSANLKVMP 
LQLVAASAPP 
SKNEALLSMV 



41 



SKVCFEIEED 
TVTIAFGSWS 
LVAHFKNEGM 
PEDIKFHLYL 
KELKLSYRSP 
PFSGAAFVKE 
EKKGDLALDV 



51 
I 

YKNRQFLGPE 60 

QHLALDLQHH 120 

VLEHPARVEP 180 

VPSDALLTKA 240 

GEIQHFSKFY 300 

NHRQLQARMG 360 

LFRSISERDP 420 



Seq ID NO: 84 Nucleotide sequence: 
Nucleic Acid Accession #: NM_007036 

Coding sequence: 56-610 (underlined sequences correspond to start and stop 

1 11 21 31 41 51 

I I I I I I 

CTTCCCACCA GCAAAGACCA CGACTGGAGA GCCGAGCCGG AGGCAGCTGG GAAACATGAA 60 

GAGCGTCTTG CTGCTGACCA CGCTCCTCGT GCCTGCACAC CTGGTGGCCG CCTGGAGCAA 120 

TAATTATGCG GTGGACTGCC CTCAACACTG TGACAGCAGT GAGTGCAAAA GCAGCCCGCG 180 

CTGCAAGAGG ACAGTGCTCG ACGACTGTGG CTGCTGCCGA GTGTGCGCTG CAGGGCGGGG 24 0 

AGAAACTTGC TACCGCACAG TCTCAGGCAT GGATGGCATG AAGTGTGGCC CGGGGCTGAG 300 

GTGTCAGCCT TCTAATGGGG AGGATCCTTT TGGTGAAGAG TTTGGTATCT GCAAAGACTG 360 

TCCCTACGGC ACCTTCGGGA TGGATTGCAG AG AGAC CTGC AACTGCCAGT CAGGCATCTG 420 

TGACAGGGGG ACGGGAAAAT GCCTGAAATT CCCCTTCTTC CAATATTCAG TAACCAAGTC 480 

TTCCAACAGA TTTGTTTCTC TCACGGAGCA TGACATGGCA TCTGGAGATG GCAATATTGT 54 0 

GAGAGAAGAA GTTGTGAAAG AGAATGCTGC CGGGTCTCCC GTAATGAGGA AATGGTTAAA 600 

TCCACGC TGA TCCCGGCTGT GATTTCTGAG AGAAGGCTCT ATTTTCGTGA TTGTTCAACA 660 

CACAGCCAAC ATTTTAGGAA CTTTCTAGAT ATAGCATAAG TACATGTAAT TTTTGAAGAT 720 

CCAAATTGTG ATGCATGGTG GATCCAGAAA ACAAAAAGTA GGATACTTAC AATCCATAAC 780 

ATCCATATGA CTGAACACTT GTATGTGTTT GTTAAATATT CGAATGCATG TAGATTTGTT 840 

AAATGTGTGT GTATAGTAAC ACTGAAGAAC TAAAAATGCA ATTTAGGTAA TCTTACATGG 900 

AGACAGGTCA ACCAAAGAGG GAGCTAGGCA AAGCTGAAGA CCGCAGTGAG TCAAATTAGT 960 

TCTTTGACTT TGATGTACAT TAATGTTGGG ATATGGAATG AAGACTTAAG AGCAGGAGAA 1020 

GATGGGGAGG GGGTGGGAGT GGGAAATAAA ATATTTAGCC CTTCCTTGGT AGGTAGCTTC 1080 

TCTAGAATTT AATTGTGCTT TTTTTTTTTT TTTGGCTTTG GGAAAAGTCA AAATAAAACA 1140 
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ACCAGAAAAC CCCTGAAGGA AGTAAGATGT TTGAAGCTTA TGGAAATTTG AGTAACAAAC 1200 

AGCTTTGAAC TGAGAGCAAT TTCAAAAGGC TGCTGATGTA GTTCCCGGGT TACCTGTATC 1260 

TGAAGGACGG TTCTGGGGCA TAGGAAACAC ATACACTTCC ATAAATAGCT TTAACGTATG 1320 

CCACCTCAGA GATAAATCTA AGAAGTATTT TACCCACTGG TGGTTTGTGT GTGTATGAAG 1380 

GTAAATATTT ATATATTTTT ATAAATAAAT GTGTTAGTGC AAGTCATCTT CCCTACCCAT 144 0 

ATTTATCATC CTCTTGAGGA AAGAAATCTA GTATTATTTG TTGAAAATGG TTAGAATAAA 1500 

AACCTATGAC TCTATAAGGT TTTCAAACAT CTGAGGCATG ATAAATTTAT TATCCATAAT 1560 

TATAGGAGTC ACTCTGGATT TCAAAAAATG TCAAAAAATG AGCAACAGAG GGACCTTATT 1620 

TAAACATAAG TGCTGTGACT TCGGTGAATT TTCAATTTAA GGTATGAAAA TAAGTTTTTA 1680 

GGAGGTTTGT AAAAGAAGAA TCAATTTTCA GCAGAAAACA TGTCAACTTT AAAATATAGG 1740 

TGGAATTAGG AGTATATTTG AAAGAATCTT AGCACAAACA GGACTGTTGT ACTAGATGTT 1800 

CTTAGGAAAT ATCTCAGAAG TATTTTATTT GAAGTGAAGA ACTTATTTAA GAATTATTTC 1860 

AGTATTTACC TGTATTTTAT TCTTGAAGTT GGCCAACAGA GTTGTGAATG TGTGTGGAAG 1920 

GCCTTTGAAT GTAAAGCTGC ATAAGCTGTT AGGTTTTGTT TTAAAAGGAC ATGTTTATTA 1980 
TTGTTCAATA AAAAAGAACA AGATAC 



Seq ID NO: 85 Protein sequence: 
Protein Accession #: NP_008 967.1 

1 11 21 31 

I I I I 

MKSVLLLTTL LVPAHLVAAW SNNYAVDCPQ HCDSSECKSS 

RGETCYRTVS GMDGMKCGPG LRCQPSNGED PFGEEFGICK 
ICDRGTGKCL KFPFFQYSVT KSSNRFVSLT EHDMASGDGN 
LNPR 



Seq ID NO: 86 Nucleotide sequence: 
Nucleic Acid Accession #: D86983 

Coding sequence: 52-4491 (underlined sequences correspond to start and stop 

1 11 21 31 41 51 

I I I I I I 

AGCCGGCCGT GGTGGCTCCG TGCGTCCGAG CGTCCGTCCG CGCCGTCGGC CATGGCCAAG 60 

CGCTCCAGGG GCCCCGGGCG CCGCTGCCTG TTGGCGCTCG TGCTGTTCTG CGCCTGGGGG 120 

ACGCTGGCCG TGGTGGCCCA GAAGCCGGGC GCAGGGTGTC CGAGCCGCTG CCTGTGCTTC 180 

CGCACCACCG TGCGCTGCAT GCATCTGCTG CTGGAGGCCG TGCCCGCCGT GGCGCCGCAG 240 

ACCTCCATCC TAGATCTTCG CTTTAACAGA ATCAGAGAGA TCCAACCTGG GGCATTCAGG 300 

CGG CTGAGG A ACTTGAACAC ATTGCTTCTC AATAATAATC AGATCAAGAG GATACCTAGT 360 

GGAGCATTTG AAGACTTGGA AAATTTAAAA TATCTCTATC TGTACAAGAA TGAGATCCAG 420 

TCAATTGACA GGCAAGCATT TAAGGGACTT GCCTCTCTAG AGCAACTATA CCTGCACTTT 480 

AATCAGATAG AAACTTTGGA CCCAGATTCG TTCCAGCATC TCCCGAAGCT CGAGAGGCTA 540 

TTTTTGCATA ACAACCGGAT TACACATTTA GTTCCAGGGA CATTTAATCA CTTGGAATCT 600 

ATGAAGAGAT TGCGACTGGA CTCAAACACA CTTCACTGCG ACTGTGAAAT CCTGTGGTTG 660 

GCGGATTTGC TGAAAACCTA CGCGGAGTCG GGGAACGCGC AGGCAGCGGC CATCTGTGAA 720 

TATCCCAGAC GCATCCAGGG ACGCTCAGTG GCAACCATCA CCCCGGAAGA GCTGAACTGT 780 

GAAAGGCCCC GGATCACCTC CGAGCCCCAG GACGCAGATG TGACCTCGGG GAACACCGTG 840 

TACTTCACCT GCAGAGCCGA AGGCAACCCC AAGCCTGAGA TCATCTGGCT GCGAAACAAT 900 

AATGAGCTGA GCATGAAGAC AGATTCCCGC CTAAACTTGC TGGACGATGG GACCCTGATG 960 

ATCCAGAACA CACAGGAGAC AGACCAGGGT ATCTACCAGT GCATGGCAAA GAACGTGGCC 1020 

GGAGAGGTGA AGACGCAAGA GGTGACCCTC AGGTACTTCG GGTCTCCAGC TCGACCCACT 1080 

TTTGTAATCC AGCCACAGAA TACAGAGGTG CTGGTTGGGG AGAGCGTCAC GCTGGAGTGC 1140 

AGCGCCACAG GCCACCCCCC GCCGCGGATC TCCTGGACGA GAGGTGACCG CACACCCTTG 1200 

CCAGTTG AC C CGCGGGTGAA CATCACGCCT TCTGGCGGGC TTTACATACA GAACGTCGTA 1260 

CAGGGGGACA GCGGAGAGTA TGCGTGCTCT GCGACCAACA ACATTGACAG CGTCCATGCC 1320 

ACCGCTTTCA TCATCGTCCA GGCTCTTCCT CAGTTCACTG TGACGCCTCA GGACAGAGTC 1380 

GTTATTGAGG GCCAGACCGT GGATTTCCAG TGTGAAGCCA AGGGCAACCC GCCGCCCGTC 1440 

ATCGCCTGGA CCAAGGGAGG GAGCCAGCTC TCCGTGGACC GGCGGCACCT GGTCCTGTCA 1500 

TCGGGAACAC TTAGAATCTC TGGTGTTGCC CTCCACGACC AGGGCCAGTA CGAATGCCAG 1560 

GCTGTCAACA TCATCGGCTC CCAGAAGGTC GTGGCCCACC TGACTGTGCA GCCCAGAGTC 1620 

ACCCCAGTGT TTGCCAGCAT TCCCAGCGAC ACAACAGTGG AGGTGGGCGC CAATGTGCAG 1680 

CTCCCGTGCA GCTCCCAGGG CGAGCCCGAG CCAGCCATCA CCTGGAACAA GGATGGGGTT 1740 

CAGGTGACAG AAAGTGGAAA ATTTCACATC AGCCCTGAAG GATTCTTGAC CATCAATGAC 1800 

GTTGGCCCTG CAGACGCAGG TCGCTATGAG TGTGTGGCCC GGAACACCAT TGGGTCGGCC 1860 

TCGGTGAGCA TGGTGCTCAG TGTGAACGTT CCTGACGTCA GTCGAAATGG AGATCCGTTT 1920 

GTAGCTACCT CCATCGTGGA AGCGATTGCG ACTGTTGACA GAGCTATAAA CTCAACCCGA 1980 

ACACATTTGT TTGACAGCCG TCCTCGTTCT CCAAATGATT TGCTGGCCTT GTTCCGGTAT 2040 

CCGAGGGATC CTTACACAGT TGAACAGGCA CGGGCGGGAG AAATCTTTGA ACGGACATTG 2100 

CAGCTCATTC AGGAGCATGT ACAGCATGGC TTGATGGTCG ACCTCAACGG AACAAGTTAC 2160 

CACTACAACG ACCTGGTGTC TCCACAGTAC CTGAACCTCA TCGCAAACCT GTCGGGCTGT 2220 

ACCGCCCACC GGCGCGTGAA CAACTGCTCG GACATGTGCT TCCACCAGAA GTACCGGACG 2280 

CACGACGGCA CCTGTAACAA CCTGCAGCAC CCCATGTGGG GCGCCTCGCT GACCGCCTTC 2340 

GAGCGCCTGC TGAAATCCGT GTACGAGAAT GGCTTCAACA CCCCTCGGGG CATCAACCCC 2400 

CACCGACTGT ACAACGGGCA CGCCCTTCCC ATGCCGCGCC TGGTGTCCAC CACCCTGATC 2460 

GGGACGGAGA CCGTCACACC CGACGAGCAG TTCACCCACA TGCTGATGCA GTGGGGCCAG 2520 



41 51 
I I 

PRC KRTVLDD CGCCRVCAAG 60 
DCPYGTFGMD CRETCNCQSG 120 
IVREEWKEN AAGSPVMRKW 180 
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TTCCTGGACC ACGACCTCGA CTCCACGGTG GTGGCCCTGA GCCAGGCACG CTTCTCCGAC 2580 

GGACAGCACT GCAGCAACGT GTGCAGCAAC GACCCCCCCT GCTTCTCTGT CATGATCCCC 2640 

CCCAATGACT CCCGGGCCAG GAGCGGGGCC CGCTGCATGT TCTTCGTGCG CTCCAGCCCT 2700 

GTGTGCGGCA GCGGCATGAC TTCGCTGCTC ATGAACTCCG TGTACCCGCG GGAGCAGATC 2760 

5 AACCAGCTCA CCTCCTACAT CGACGCATCC AACGTGTACG GGAGCACGGA GCATGAGGCC 2820 

CGCAGCATCC GCGACCTGGC CAGCCACCGC GGCCTGCTGC GGCAGGGCAT CGTGCAGCGG 2880 

TCCGGGAAGC CGCTGCTCCC CTTCGCCACC GGGCCGCCCA CGGAGTGCAT GCGGGACGAG 2940 

AACGAGAGCC CCATCCCCTG CTTCCTGGCC GGGGACCACC GCGCCAACGA GCAGCTGGGC 3000 

CTGACCAGCA TGCACACGCT GTGGTTCCGC GAGCACAACC GCATTGCCAC GGAGCTGCTC 3060 

10 AAGCTGAACC CGCACTGGGA CGGCGACACC ATCTACTATG AGACCAGGAA GATCGTGGGT 3120 

GCGGAGATCC AGCACATCAC CTACCAGCAC TGGCTCCCGA AGATCCTGGG GGAGGTGGGC 3180 

ATGAGGACGC TGGGAGAGTA CCACGGCTAC GACCCCGGCA TCAATGCTGG CATCTTCAAC 3240 

GCCTTCGCCA CCGCGGCCTT CAGGTTTGGC CACACGCTTG TCAACCCACT GCTTTACCGG 3300 

CTGGACGAGA ACTTCCAGCC CATTGCACAA GATCACCTCC CCCTTCACAA AGCTTTCTTC 3360 

15 TCTCCCTTCC GGATTGTGAA TGAGGGCGGC ATCGATCCGC TTCTCAGGGG GCTGTTCGGG 3420 

GTGGCGGGGA AAATGCGTGT GCCCTCGCAG CTGCTGAACA CGGAGCTCAC GGAGCGGCTG 3480 

TTCTCCATGG CACACACGGT GGCTCTGGAC CTGGCGGCCA TCAACATCCA GCGGGGCCGG 3 540 

GACCACGGGA TCCCACCCTA CCACGACTAC AGGGTCTACT GCAATCTATC GGCGGCACAC 3 600 

ACGTTCGAGG ACCTGAAAAA TGAGATTAAA AACCCTGAGA TCCGGGAGAA ACTGAAAAGG 3660 

20 TTGTATGGCT CGACACTCAA CATCGACCTG TTTCCGGCGC TCGTGGTGGA GGACCTGGTG 372 0 

CCTGGCAGCC GGCTGGGCCC CACCCTGATG TGTCTTCTCA GCACACAGTT CAAGCGCCTG 3 780 

CGAGATGGGG ACAGGTTGTG GTATGAGAAC CCTGGGGTGT TCTCCCCGGC CCAGCTGACT 3840 

CAGATCAAGC AGACGTCGCT GGCCAGGATC CTATGCGACA ACGCGGACAA CATCACCCGG 3 900 

GTGCAGAGCG ACGTGTTCAG GGTGGCGGAG TTCCCTCACG GCTACGGCAG CTGTGACGAG 3960 

25 ATCCCCAGGG TGGACCTCCG GGTGTGGCAG GACTGCTGTG AAGACTGTAG GACCAGGGGG 4 020 

CAGTTCAATG CCTTTTCCTA TCATTTCCGA GGCAGACGGT CTCTTGAGTT CAGCTACCAG 4 080 

GAGGACAAGC CGACCAAGAA AACAAGACCA CGGAAAATAC CCAGTGTTGG GAGACAGGGG 4140 

GAACATCTCA GCAACAGCAC CTCAGCCTTC AGCACACGCT CAGATGCATC TGGGACAAAT 4200 

GACTTCAGAG AGTTTGTTCT GGAAATGCAG AAGACCATCA CAGACCTCAG AACACAGATA 4260 

30 AAGAAACTTG AATCACGGCT CAGTACCACA GAGTGCGTGG ATGCCGGGGG CGAATCTCAC 4320 

GCCAACAACA CCAAGTGGAA AAAAGATGCA TGCACCATTT GTGAATGCAA AGACGGGCAG 4380 

GTCACCTGCT TCGTGGAAGC TTGCCCCCCT GCCACCTGTG CTGTCCCCGT GAACATCCCA 4440 

GGGGCCTGCT GTCCAGTCTG CTTACAGAAG AGGGCGGAGG AAAAGCCC TA G GCTCCTGGG 4500 

AGGCTCCTCA GAGTTTGTCT GCTGTGCCAT CGTGAGATCG GGTGGCCGAT GGCAGGGAGC 4560 

35 TGCGGACTGC AGACCAGGAA ACACCCAGAA CTCGTGACAT TTCATGACAA CGTCCAGCTG 4620 

GTG CTGTTAC AGAAGGCAGT GCAGGAGGCT TCCAACCAGA GCATCTGCGG AGAAGGAGGC 4680 

ACAGCAGGTG CCTGAAGGGA AGCAGGCAGG AGTCCTAGCT TCACGTTAGA CTTCTCAGGT 4 740 

TTTTATTTAA TTCTTTTAAA ATGAAAAATT GGTGCTACTA TTAAATTGCA CAGTTGAATC 4800 

ATTTAGGCGC CTAAATTGGT TTTGCCTCCC AACACCATTT CTTTTTAAAT AAAGCAGGAT 4860 

40 ACCTCTATAT GTCAGCCTTG CCTTGTTCAG ATGCCAGGAG CCGGCAGACC TGTCACCCGC 4 920 

AGGTGGGGTG AGTCTCGGAG CTGCCAGAGG GGCTCACCGA AATCGGGGTT CCATCACAAG 4 980 

CTATGTTTAA AAAGAAAATT GGTGTTTGGC AAACGGAACA GAACCTTTGA TGAGAGCGTT 5040 

CACAGGGACA CTGTCTGGGG GTGCAGTGCA AGCCCCCGGC CTCTTCCCTG GGAACCTCTG 5100 

AACTCCTCCT TCCTCTGGGC TCTCTGTAAC ATTTCACCAC ACGTCAGCAT CTAATCCCAA 5160 

45 GACAAACATT CCCGCTGCTC GAAGCAGCTG TATAGCCTGT GACTCTCCGT GTGTCAGCTC 5220 

CTTCCACACC TGATTAGAAC ATTCATAAGC CACATTTAGA AACAGATTTG CTTTCAGCTG 5280 

TCACTTGCAC ACATACTGCC TAGTTGTGAA C C AAATGTG A AAAAACCTCC TTCATCCCAT 534 0 

TGTGTATCTG ATACCTGCCG AGGGCCAAGG GTGTGTGTTG ACAACGCCGC TCCCAGCCGG 5400 

CCCTGGTTGC GTCCACGTCC TGAACAAGAG CCGCTTCCGG ATGGCTCTTC CCAAGGGAGG 54 60 

50 AGGAGCTCAA GTGTCGGGAA CTGTCTAACT TCAGGTTGTG TGAGTGCGTT 

Seq ID NO: 87 Protein sequence: 
Protein Accession #: BAA13219 
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SRPWWLRASE RPSAPSAMAK RSRGPGRRCL LALVLFCAWG TLAWAQKPG AGCPSRCLCF 60 

RTTVRCMHLL LEAVPAVAPQ TSILDLRFNR IREIQPGAFR RLRNLNTLLL NNNQIKRIPS 120 

GAFEDLENLK YLYLYKNEIQ SIDRQAFKGL ASLEQLYLHF NQIETLDPDS FQHLPKLERL 180 

FLHNNRITHL VPGTFNHLES MKRLRLDSNT LHCDCEILWL ADLLKTYAES GNAQAAAICE 240 

60 YPRRIQGRSV ATITPEELNC ERPRITSEPQ DADVTSGNTV YFTCRAEGNP KPEIIWLRNN 300 

NELSMKTDSR LNLLDDGTLM IQNTQETDQG IYQCMAKNVA GEVKTQEVTL RYFGSPARPT 360 

FVIQPQNTEV LVGESVTLEC SATGHPPPRI SWTRGDRTPIj PVDPRVNITP SGGLYIQNW 420 

QGDSGEYACS ATNNIDSVHA TAFIIVQALP QFTVTPQDRV VIEGQTVDFQ CEAKGNPPPV 4 80 

IAWTKGGSQL SVDRRHLVLS SGTLRISGVA LHDQGQYECQ AVNIIGSQKV VAHLTVQPRV 54 0 

65 TPVFASIPSD TTVEVGANVQ LPCSSQGEPE PAITWNKDGV QVTESGKFHI SPEGFLTIND 600 

VGPADAGRYE CVARNTIGSA SVSMVLSVNV PDVSRNGDPF VATSIVEAIA TVDRAINSTR 660 

THLFDSRPRS PNDLLALFRY PRDPYTVEQA RAGEIFERTL QLIQEHVQHG LMVDLNGTSY 720 

HYNDLVSPQY LNLIANLSGC TAHRRVNNCS DMCFHQKYRT HDGTCNNLQH PMWGASLTAF 780 

ERLLKSVYEN GFNTPRGINP HRLYNGHALP MPRLVSTTLI GTETVTPDEQ FTHMLMQWGQ 840 

70 FLDHDLDSTV VALSQARFSD GQHCSNVCSN DPPCFSVMIP PNDSRARSGA RCMFFVRSSP 900 

VCGSGMTSLL MNSVYPREQI NQLTSYIDAS NVYGSTEHEA RSIRDLASHR GLLRQGIVQR 960 

SGKPLLPFAT GPPTECMRDE NESPIPCFLA GDHRANEQLG LTSMHTLWFR EHNRIATELL 1020 

KLNPHWDGDT IYYETRKIVG AEIQHITYQH WLPKILGEVG MRTLGEYHGY DPGINAGIFN 1080 

AFATAAFRFG HTLVNPIjLYR LDENFQPIAQ DHLPLHKAFF SPFRIVNEGG IDPLLRGLFG 1140 

75 VAGKMRVPSQ LLNTELTERL FSMAHTVALD LAAINIQRGR DHGIPPYHDY RVYCNLSAAH 1200 

TFEDLKNEIK NPEIREKLKR LYGSTLNIDL FPALWEDLV PGSRLGPTLM CLLSTQFKRL 1260 
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RDGDRLWYEN PGVFSPAQLT QIKQTSLARI LCDNADNITR VQSDVFRVAE FPHGYGSCDE 1320 

IPRVDLRVWQ DCCEDCRTRG QFNAFSYHFR GRRSLEFSYQ EDKPTKKTRP RKIPSVGRQG 1380 

EHLSNSTSAF STRSDASGTN DFREFVLEMQ KTITDLRTQI KKLESRLSTT ECVDAGGESH 1440 
ANNTKWKKDA CTICECKDGQ VTCFVEACPP ATCAVPVNIP GACCPVCLQK RAEEKP 



Seq ID NO: 88 DNA sequence 

Nucleic Acid Accession #: NM_.004834.1 

Coding sequence: 80-3577 (underlined sequences correspond to start and stop codons) 
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AATTCGAGGA TCCGGGTACC ATGGCACAGA GCGACAGAGA CATTTATTGT TATTTGTTTT 60 

15 TTGGTGGCAA AAAGGGAA AA TG GCGAACGA CTCCCCTGCA AAAAGTCTGG TGGACATCGA 120 

CCTCTCCTCC CTGCGGGATC CTGCTGGGAT TTTTGAGCTG GTGGAAGTGG TTGGAAATGG 180 

CACCTATGGA CAAGTCTATA AGGGTCGACA TGTTAAAACG GGTCAGTTGG CAGCCATCAA 240 

AGTTATGGAT GTCACTGAGG ATGAAGAGGA AGAAATCAAA CTGGAGATAA ATATGCTAAA 300 

GAAATACTCT CATCACAGAA ACATTGCAAC ATATTATGGT GCTTTCATCA AAAAGAGCCC 360 

20 TCCAGGACAT GATGACCAAC TCTGGCTTGT TATGGAGTTC TGTGGGGCTG GGTCCATTAC 420 

AGACCTTGTG AAGAACACCA AAGGGAACAC ACTCAAAGAA GACTGGATCG CTTACATCTC 480 

CAGAGAAATC CTGAGGGGAC TGGCACATCT TCACATTCAT CATGTGATTC ACCGGGATAT 540 

CAAGGGCCAG AATGTGTTGC TGACTGAGAA TGCAGAGGTG AAACTTGTTG ACTTTGGTGT 600 

GAGTGCTCAG CTGGACAGGA CTGTGGGGCG GAGAAATACG TTCATAGGCA CTCCCTACTG 660 

25 GATGGCTCCT GAGGTCATCG CCTGTGATGA GAACCCAGAT GCCACCTATG ATTACAGAAG 720 

TGATCTTTGG TCTTGTGGCA TTACAGCCAT TGAGATGGCA GAAGGTGCTC CCCCTCTCTG 780 

TGACATGCAT CCAATGAGAG CACTGTTTCT CATTCCCAGA AACCCTCCTC CCCGGCTGAA 84 0 

GTCAAAAAAA TGGTCGAAGA AGTTTTTTAG TTTTATAGAA GGGTGCCTGG TGAAGAATTA 900 

CATGCAGCGG CCCTCTACAG AGCAGCTTTT GAAACATCCT TTTATAAGGG ATCAGCCAAA 960 

30 TGAAAGGCAA GTTAGAATCC AGCTTAAGGA TCATATAGAT CGTACCAGGA AGAAGAGAGG 1020 

CGAGAAAGAT GAAACTGAGT ATGAGTACAG TGGGAGTGAG GAAGAAGAGG AGGAAGTGCC 1080 

TGAACAGGAA GGAGAGCCAA GTTCCATTGT GAACGTGCCT GGTGAGTCTA CTCTTCGCCG 114 0 

AGATTTCCTG AGACTGCAGC AGGAGAACAA GGAACGTTCC GAGGCTCTTC GGAGACAACA 1200 

GTTACTACAG GAGCAACAGC TCCGGGAGCA GGAAGAATAT AAAAGGCAAC TGCTGGCAGA 1260 

35 GAGACAGAAG CGGATTGAGC AGCAGAAAGA ACAGAGGCGA CGGCTAGAAG AGCAACAAAG 1320 

GAGAGAGCGG GAGGCTAGAA GGCAGCAGGA ACGTGAACAG CGAAGGAGAG AACAAGAAGA 1380 

AAAGAGGCGT CTAGAGGAGT TGGAGAGAAG GCGCAAAGAA GAAGAGGAGA GGAGACGGGC 1440 

AGAAGAAGAA AAGAGGAGAG TTGAAAGAGA ACAGGAGTAT ATCAGGCGAC AG CTAGAAG A 1500 

GGAGCAGCGG CACTTGGAAG TCCTTCAGCA GCAGCTGCTC CAGGAGCAGG CCATGTTACT 1560 

40 GCATGACCAT AGGAGGCCGC ACCCGCAGCA CTCGCAGCAG CCGCCACCAC CGCAGCAGGA 1620 

AAGGAGCAAG CCAAGCTTCC ATGCTCCCGA GCCCAAAGCC CACTACGAGC CTGCTGACCG 16 80 

AGCGCGAGAG GTTCCTGTGA GAACAACATC TCGCTCCCCT GTTCTGTCCC GTCGAGATTC 1740 

CCCACTGCAG GGCAGTGGGC AGCAGAATAG CCAGGCAGGA CAGAGAAACT CCACCAGTAT 1800 

TGAGCCCAGG CTTCTGTGGG AGAGAGTGGA GAAGCTGGTG CCCAGACCTG GCAGTGGCAG 1860 

45 CTCCTCAGGG TCCAGCAACT CAGGATCCCA GCCCGGGTCT CACCCTGGGT CTCAGAGTGG 1920 

CTCCGGGGAA CGCTTCAGAG TGAGATCATC ATCCAAGTCT GAAGGCTCTC CATCTCAGCG 1980 

CCTGGAAAAT GCAGTGAAAA AACCTGAAGA TAAAAAGGAA GTTTTCAGAC CCCTCAAGCC 204 0 

TGCTGGCGAA GTGGATCTGA CCGCACTGGC CAAAGAGCTT CGAGCAGTGG AAGATGTACG 2100 

GCCACCTCAC AAAGTAACGG ACTACTCCTC ATCCAGTGAG GAGTCGGGGA CGACGGATGA 2160 

50 GGAGGACGAC GATGTGGAGC AGGAAGGGGC TGACGAGTCC ACCTCAGGAC CAGAGGACAC 2220 

CAGAGCAGCG TCATCTCTGA ATTTGAGCAA TGGTGAAACG GAATCTGTGA AAACCATGAT 2280 

TGTCCATGAT GATGTAGAAA GTGAGCCGGC CATGACCCCA TCCAAGGAGG GCACTCTAAT 234 0 

CGTCCGCCAG ACTCAGTCCG CTAGTAGCAC ACTCCAGAAA CACAAATCTT CCTCCTCCTT 24 00 

TACACCTTTT ATAGACCCCA GATTACTACA GATTTCTCCA TCTAGCGGAA CAACAGTGAC 246 0 

55 ATCTGTGGTG GGATTTTCCT GTGATGGGAT GAGACCAGAA GCCATAAGGC AAGATCCTAC 252 0 

CCGGAAAGGC TCAGTGGTCA ATGTGAATCC TACCAACACT AGGCCACAGA GTGACACCCC 25 80 

GGAGATTCGT AAATACAAGA AGAGGTTTAA CTCTGAGATT CTGTGTGCTG CCTTATGGGG 2640 

AGTGAATTTG CTAGTGGGTA CAGAGAGTGG CCTGATGCTG CTGGACAGAA GTGGCCAAGG 2700 

GAAGGTCTAT CCTCTTATCA ACCGAAGACG ATTTCAACAA ATGGACGTAC TTGAGGGCTT 2760 

60 • GAATGTCTTG GTGACAATAT CTGGCAAAAA GGATAAGTTA CGTGTCTACT ATTTGTCCTG 2820 

GTTAAGAAAT AAAATACTTC ACAATGATCC AGAAGTTGAG AAGAAGCAGG GATGGACAAC 28 80 

CGTAGGGGAT TTGGAAGGAT GTGTACATTA TAAAGTTGTA AAATATGAAA GAATCAAATT 2940 

TCTGGTGATT GCTTTGAAGA GTTCTGTGGA AGTCTATGCG TGGGCACCAA AGCCATATCA 3000 

CAAATTTATG GCCTTTAAGT CATTTGGAGA ATTGGTACAT AAGCCATTAC TGGTGGATCT 3060 

65 CACTGTTGAG GAAGGCCAGA GGTTGAAAGT GATCTATGGA TCCTGTGCTG GATTCCATGC 3120 

TGTTGATGTG GATTCAGGAT CAGTCTATGA CATTTATCTA CCAACACATG TAAGAAAGAA 3180 

CCCACACTCT ATGATCCAGT GTAGCATCAA ACCCCATGCA ATCATCATCC TCCCCAATAC 3240 

AGATGGAATG GAGCTTCTGG TGTGCTATGA AGATGAGGGG GTTTATGTAA ACACATATGG 33 00 

AAGGATCACC AAGGATGTAG TTCTACAGTG GGGAGAGATG CCTACATCAG TAGCATATAT 3360 

70 TCGATCCAAT CAGACAATGG GCTGGGGAGA GAAGGCCATA GAGATCCGAT CTGTGGAAAC 3420 

TGGTCACTTG GATGGTGTGT TCATGCACAA AAGGGCTCAA AGACTAAAAT TCTTGTGTGA 34 80 

ACG CAATG AC AAGGTGTTCT TTGCCTCTGT TCGGTCTGGT GGCAGCAGTC AGGTTTATTT 354 0 

CATGACCTTA GGCAGGACTT CTCTTC TGA G CTGGTAGAAG CAGTGTGATC CAGGGATTAC 3600 

TGGCCTCCAG AGTCTTCAAG ATCCTGAGAA CTTGGAATTC CTTGTAACTG GAGCTCGGAG 3660 

75 CTGCACCGAG GGCAACCAGG ACAGCTGTGT GTGCAGACCT CATGTGTTGG GTTCTCTCCC 3720 

CTCCTTCCTG TTCCTCTTAT ATACCAGTTT ATCCCCATTC TTTTTTTTTT TCTTACTCCA 3 780 
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AAATAAATCA AGGCTGCAAT GCAGCTGGTG CTGTTCAGAT TCCAAAAAAA AAAAAAAACC 3840 
ATGGTACCCG GATCCTCGAA TTCC 



Seq ID No: 89 Protein sequence: 
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MANDSPAKSL VDIDLSSLRD PAGIFELVEV VGNGTYGQVY KGRHVKTGQL AAIKVMDVTE 60 

DEEEEIKLEI NMLKKYSHHR NIATYYGAFI KKSPPGHDDQ LWLVMEFCGA GSITDLVKNT 120 

KGNTLKEDWI AYISREILRG LAHLHIHHVI HRDIKGQNVL LTENAEVKLV DFGVSAQLDR 180 

TVGRRNTFIG TPYWMAPEVI ACDENPDATY DYRSDLWSCG ITAIEMAEGA PPLCDMHPMR 240 

ALFLIPRNPP PRLKSKKWSK KFFSFIEGCL VKNYMQRPST EQLLKHPFIR DQPNERQVRI 300 

QLKDHIDRTR KKRGEKDETE YEYSGSEEEE EEVPEQEGEP SSIVNVPGES TLRRDFLRLQ 360 

QENKERSEAL RRQQLLQEQQ. LREQEEYKRQ LLAERQKRIE QQKEQRRRLE EQQRREREAR 420 

RQQEREQRRR EQEEKRRLEE LERRRKEEEE RRRAEEEKRR VEREQEYIRR QLEEEQRHLE 480 

VLQQQLLQEQ AMLLHDHRRP HPQHSQQPPP PQQERSKPSF HAPEPKAHYE PADRAREVPV 54 0 

RTTSRSPVLS RRDSPLQGSG QQNSQAGQRN STSIEPRLLW ERVEKLVPRP GSGSSSGSSN 600 

SGSQPGSHPG SQSGSGERFR VRSSSKSEGS PSQRLENAVK KPEDKKEVFR PLKPAGEVDL 660 

TALAKELRAV EDVRPPHKVT DYSSSSEESG TTDEEDDDVE QEGADESTSG PEDTRAASSL 720 

NLSNGETESV KTMIVHDDVE SEPAMTPSKE GTLIVRQTQS ASSTLQKHKS SSSFTPFIDP 780 

RLLQISPSSG TTVTSWGFS CDGMRPEAIR QDPTRKGSW NVNPTNTRPQ SDTPEIRKYK 840 

KRFNSEILCA ALWGVNLLVG TESGLMLLDR SGQGKVYPLI NRRRFQQMDV LEGLNVLVTI 900 

SGKKDKLRVY YLSWLRNKIL HNDPEVEKKQ GWTTVGDLEG CVHYKWKYE RIKFLVIALK 960 

SSVEVYAWAP KPYHKFMAFK SFGELVHKPL ItVDLTVEEGQ RLKVIYGSCA GFHAVDVDSG 1020 

SVYDIYLPTH VRKNPHSMIQ CSIKPHAIII LPNTDGMELL VCYEDEGVYV NTYGRITKDV 1080 

VLQWGEMPTS VAYIRSNQTM GWGEKAIEIR SVETGHLDGV FMHKRAQRLK FLCERNDKVF 1140 
FASVRSGGSS QVYFMTLGRT SLLSW 
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Nucleic Acid Accession # : none found 
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TTACACTTCA ATTCCTTACA CGGTATTTCA AACAAACAGT TTTGCTGAGA GGAGCTTTTG 60 

TCTCTCCT TA AG AAAATGTT TATAAAGCTG AAAGGAAATC AAACAGTAAT CTTAAAAATG 120 

AAAACAAAAC AACCCAACAA CCTAGATAAC TACAGTGATC AGGGAGCACA GTTCAACTCC 180 

TTGTTATGTT TTAGTCATAT GGCCTACTCA AACAGCTAAA TAACAACACC AGTGGCAGAT 240 

AAAAATCACC ATTTATCTTT CAGCTATTAA TCTTTTGAAT GAATAAACTG TGACAAACAA 300 

ATTAACATTT TTGAACATGA AAGGCAACTT CTGCACAATC CTGTATCCAA GCAAACTTTA 360 

AATTATCCAC TTAATTATTA CTTAATCTTA AAAAAAATTA GAACCCAGAA CTTTTCAATG 420 

AAGCATTTGA AAGTTGAAGT GGAATTTAGG AAAGCCATAA AAATATAAAT ACTGTTATCA 480 

CAGCACCAGC AAGCCATAAT CTTTATACCT ATCAGTTCTA TTTCTATTAA CAGTAAAAAC 54 0 

ATTAAGCAAG ATATAAGACT ACCTGCCCAA GAATTCAGTC TTTTTTCATT TTTGTTTTTC 600 

TCAGTTCTGA GGATGTTAAT CGTCAAATTT TCTTTGGACT GCATTCCTCA CTACTTTTTG 660 

CACAATGGTC TCACGTTCTC ACATTTGTTC TCGCGAATAA ATTGATAAAA GGTGTTAAGT 720 

TCTGTGAATG TCTTTTTAAT TATGGGCATA ATTGTGCTTG ACTGGATAAA AACTTAAGTC 780 

CACCCTTATG TTTATAATAA TTTCTTGAGA ACAGCAAACT GCATTTACCA TCGTAAAACA 840 

ACATCTGACT TACGGGAGCT GCAGGGAAGT GGTGAGACAG TTCGAACGGC TCCTCAGAAA 900 

TCCAGTGACC CAATT CT AAA GACCATAGCA CCTGCAAGTG ACACAACAAG CAGATTTATT 960 

ATACATTTAT TAGCCTTAG C AGGCAATAAA CCAAGAATCA CTTTGAAGAC ACAGCAAAAA 1020 

GTGATACACT CCGCAGATCT GAAATAGATG TGTTCTCAGA CAACAAAGTC CCTTCAGAAT 1080 
CTTCATGTTG CATAAATGTT ATGAATATTA ATAAAAAGTT GATTGAGA 



Seq ID No: 91 Protein sequence: 
Protein Accession #: none found 
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YTSIPYTVFQ TNSFAERSFC LSL 



Seq ID NO: 92 DNA sequence 

Nucleic Acid Accession #: NM_003 706.1 

Coding sequence: 310-1935 (underlined sequences correspond to start and stop codons) 
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CACGAGGCAG GGGCCATTTT ACCTCCAGGT TGGCCCTGCT CAGGACCAGG AGGAAACACC 
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TCCAGCCCGC GACCTCCTCC CACAGGGGGA AAAGGAAAGC AGGAGGACCA CAGAAGCTTT 120 

GGCACCGAGG ATCCCCGCAG TCTTCACCCG CGGAGATTCC GGCTGAAGGA GCTGTCCAGC 180 

GACTACACCG CTAAGCGCAG GGAGCCCAAG CCTCCGCACC GGATTCCGGA GCACAAGCTC 240 

CACCGCGCAT GCGCACACGC CCCAGACCCA GGCTCAGGAG GACTGAGAAT TTTCTGACCG 300 

CAGTGCACC A TG GGAAGCTC TGAAGTTTCC ATAATTCCTG GGCTCCAGAA AGAAGAAAAG 360 

GCGGCCGTGG AGAGACGAAG ACTTCATGTG CTGAAAGCTC TGAAGAAGCT AAGGATTGAG 420 

GCTGATGAGG CCCCAGTTGT TGCTGTGCTG GGCTCAGGCG GAGGACTGCG GGCTCACATT 480 

GCCTGCCTTG GGGTCCTGAG TGAGATGAAA GAACAGGGCC TGTTGGATGC CGTCACGTAC 540 

CTCGCAGGGG TCTCTGGATC CACTTGGGCA ATATCTTCTC TCTACACCAA TGATGGTGAC 600 

ATGGAAGCTC TCGAGGCTGA CCTGAAACAT CGATTTACCC GACAGGAGTG GGACTTGGCT 660 

AAGAGCCTAC AGAAAACCAT CCAAGCAGCG AGGTCTGAGA ATTACTCTCT GACCGACTTC 720 

TGGGCCTACA TGGTTATCTC TAAGCAAACC AGAGAACTGC CGGAGTCTCA TTTGTCCAAT 780 

ATGAAGAAGC CCGTGGAAGA AGGGACACTA CCCTACCCAA TATTTGCAGC CATTGACAAT 840 

GACCTGCAAC CTTCCTGGCA GGAGGCAAGA GCACCAGAGA CCTGGTTCGA GTTCACCCCT 900 

CACCACGCTG GCTTCTCTGC ACTGGGGGCC TTTGTTTCCA TAACCCACTT CGGAAGCAAA 960 

TTCAAGAAGG GAAGACTGGT CAGAACTCAC CCTGAGAGAG ACCTGACTTT CCTGAGAGGT 1020 

TTATGGGGAA GTGCTCTTGG TAACACTGAA GTCATTAGGG AATACATTTT TGACCAGTTA 1080 

AGGAATCTGA CCCTGAAAGG TTTATGGAGA AGGGCTGTTG CTAATGCTAA AAGCATTGGA 1140 

CACCTTATTT TTGCCCGATT ACTGAGGCTG CAAGAAAGTT CACAAGGGGA ACATCCTCCC 1200 

CCAGAAGATG AAGGCGGTGA GCCTGAACAC ACCTGGCTGA CTGAGATGCT CGAGAATTGG 1260 

ACCAGGACCT CCCTGGAAAA GCAGGAGCAG CCCCATGAGG ACCCCGAAAG GAAAGGCTCA 1320 

CTCAGTAACT TGATGGATTT TGTGAAGAAA ACAGGCATTT GCGCTTCAAA GTGGGAATGG 1380 

GGGACCACTC ACAACTTCCT GTACAAACAC GGTGGCATCC GGGACAAGAT AATGAGCAGC 144 0 

CGGAAGCACC TCCACCTGGT GGATGCTGGT TTAGCCATCA ACACTCCCTT CCCACTCGTG 1500 

CTGCCCCCGA CGCGGGAGGT TCACCTCATC CTCTCCTTCG ACTTCAGTGC CGGAGATCCT 1560 

TTCGAGACCA TCCGGGCTAC CACTGACTAC TGCCGCCGCC AC AAGATC CC CTTTCCCCAA 1620 

GTAGAAGAGG CTGAGCTGGA TTTGTGGTCC AAGGCCCCCG CCAGCTGCTA CATCCTGAAA 1680 

GGAGAAACTG GACCAGTGGT GATACATTTT CCCCTGTTCA ACATAGATGC CTGTGGAGGT 174 0 

GATATTGAGG CATGGAGTGA CACATACGAC ACATTCAAGC TTGCTGACAC CTACACTCTA 1800 

GATGTGGTGG TGCTACTCTT GGCATTAGCC AAGAAGAATG TCAGGGAAAA CAAGAAGAAG 1860 

ATCCTTAGAG AGTTGATGAA CGTGGCCGGG CTCTACTACC CGAAGGATAG TGCCCGAAGT 1920 

TGCTGCTTGG CATAGATGAG CCTCAGCTTC CAGGGCACTG TGGGCCTGTT GGTCTACTAG 1980 

GGCCCTGAAG TCCACCTGGC CTTCCTGTTC TTCACTCCCT TCAGCCACAC GCTTCATGGC 2040 

CTTGAGTTCA CCTTGGCTGT CCTAACAGGG CCAATCACCA GTGACCAGCT AGACTGTGAT 2100 

TTTGATAGCG TCATTCAGAA GAAGGTGTCC AAGGAGCTGA AGGTGGTGAA ATTTGTCCTG 2160 

CAGGTCCCTC GGGAGATCCT GGAGCTGGAG CATGAGTGTC TGACAATCAG AAGCATCATG 2220 

TCCAATGTCC AGATGGCCAG AATGAATGTG ATAGTTCAGA CCAATGCCTT CCACTGCTCC 22 80 

TTTATGACTG CACTTCTAGC CAGTAGCTCT GCACAAGTTA GCTCTGTAGA AGTAAGAACT 234 0 

TGGGCTTAAA TCATGGGCTA TCTCTCCACA GCCAAGTGGA GCTCTGAGAA TACAACAAGT 2400 

GCTCAATAAA TGCTTGCTGA TTGACTGATG AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA 2460 
AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAA 



Seq ID No: 93 Protein secruence: 
Protein Accession #: NP 003697.1 



1 11 21 31 41 51 

I I I I I I 

MGSSEVSIIP GLQKEEKAAV ERRRLHVLKA LKKLRIEADE APWAVLGSG GGLRAHIACL 6 0 

GVLSEMKEQG LLDAVTYLAG VSGSTWAISS LYTNDGDMEA LEADLKHRFT RQEWDLAKSL 12 0 

QKTIQAARSE NYSLTDFWAY MVTSKQTREL PESHLSNMKK PVEEGTLPYP IFAAIDNDLQ 18 0 

PSWQEARAPE TWFEFTPHHA GFSALGAFVS ITHFGSKFKK GRLVRTHPER DLTFLRGLWG 240 

SALGNTEVIR EYIFDQLRNL TLKGLWRRAV ANAKSIGHLI FARLLRLQES SQGEHPPPED 300 

EGGEPEHTWL TEMLENWTRT SLEKQEQPHE DPERKGSLSN LMDFVKKTGI CASKWEWGTT 360 

HNFLYKHGGI RDKIMSSRKH LHLVDAGLAI NTPFPLVLPP TREVHLILSF DFSAGDPFET 42 0 

IRATTDYCRR HKIPFPQVEE AELDLWSKAP ASCYILKGET GPWIHFPLF NIDACGGDIE 4 80 

AWSDTYDTFK LADTYTLDW VLLLALAKKN VRENKKKILR ELMNVAGLYY PKDSARSCCL ,54 0 
A 

Seq ID NO: 94 DNA sequence 
Nucleic Acid Accession #: AK027351 

Coding sequence: 1-642 {underlined sequences correspond to start and stop codons) 



l 11 21 31 41 . 51 

I I I I I I 

AGGGAAAAAA ACTCCATTAA AAAGCCCAGC TTTCCTCCAT GTTAGATGTG ACTTGGAAAA 60 

TGAGAAAGAT TTAGCAAAAT TCCACCGTAT CTTTTGCCAG GCTAGAGACA GGGAGAGCAG 120 

AGTAAAACCC TCAGGCTGCT GAAATTTCTA GGCTGTTAGG AAGCCCCTCG AATTCTGTGA ,180 

AAATGAGGGT TTCTTAACTC ACACTGAGAG CGGAAAGGGG CAGACCCTTT TCATAACTCC 240 

CTCAAGTGTG TGTTACCTTT CTTTACCAGC ATGGTAAGCA ACAGGACATA TCCCAGCCTC 300 

GGACATGTCT GTATGATCCA AGGTACCCAA AGTCAGACAG AGTAAACTCA AGCCTGGCAC 360 

TGGCTTTCTG CCGCTTCATG TGCTTTGGAA AAAGCAGGAG AAGCAATAGC AGCAGGAGTC 420 

CCCAGCAGCT GGAGCCGCAA GAATGAACTG CAAAGAGGGA ACTGACAGCA GCTGCGGCTG 480 

CAGGGGCAAC GACGAGAAGA AGATGTTGAA GTGTGTGGTG GTGGGGGACG GTGCCGTGGG 540 



224 



WO 02/079492 



PCT/US02/04915 



GAAAACCTGC CTGCTGATGA GCTACGCCAA 
TGTGTTTGAC CACTATGCAG TTACTGTGAC 
GTATGACACC GCGGGACAGG AGGACTACAA 
GGATGTGTTT TTGATCTGCT TCTCTGTCGT 
5 GGAATGGGTC CCCGAGCTCA AGGACTGCAT 
CCAGATTGAT CTCCGTGATG ACCCAAAAAC 
ACCTCTCACT TACGAGCATG GTGTGAAGCT 
GGAATGTTCA GCTCTGACTC AGAAAGGTCT 
CATTTTCCAC CCCAAGAAAA AGAAGAAACG 

10 TATCTGAGGT TGTCTGGGAC CTGCCTCCAC 
TCTGTGGCCA AGCTCCAGCC AAAAAGGAGG 
GAGGCTTGCC CCATCACCCT CTGAGCCCTC 
ACGACCTCCC TGCCAGCCAG AAGCATCCGT 
GATTGCAGAC AGTGCCGCTG CTGATCGCAT 

15 TTACAAATCC CCAGCTCATG AACGTGAAGC 
AAGAAACTTG ATTCCTCTAT TGCTGGCCTT 
CAATACTAAC CTTTTTTTCT GAATCTGCTG 
TTATTTCAAG AAATGTACTA ATTTCCAGTT 
AGCCTAAAGA CAAGGCATTT TATATTCATT 

20 TTAGAACCAA CACGTACCTC TGAATGCCCG 
GTTTTGGAAA TTTACAGAGC CATGATTTTT 
GTTGCAGGTC CACATTTTTG CCAAAGATAC 
TTTTTCCATG TATTGCCACG ACAAACTAAA 
TGTTTTTCAT CCAAGTTGAT TGGGGGAAGA 

25 TGTATTCAGT AAAGTGGACA TTCCTGCTCC 
• * CCTTGATTTC ACTTTCTCTC ATGCCCGGAT 
ATAAAAGAAA GATCTGAGCA TAAAGATACG 
TGCCTTAATT TTTCTATTTG CTTCAACTGA 
CTCATTCCAT GTAAATGACA TTTTCCAGTT 

30 TTCCTTACTC ATCCTCCCAA ATGTCTTTGT 
TAT CCATTTG TCCCCTGCCC TCCACAATGT 
AAAGCAAAAG CTCCCAGTAA GGAATCCTGT 
CAGGAATTTT TGTATCATAG AGCGAATTAC 
CTTCTAATTA GTCTTAGTTG CTTATAAGTG 

35 TTTTTTTCAG TTGCATGAGC AAAGTGCTTC 
ACTTTCCAGA TTCAAGCTCC CACTGTTGGA 
CTGGAATAAG CACTTAAGAA TTGCGTGATA 
CAACACTTAG GGAGGCTGAG GTGGGTGGGC 
GGGTAATATA GTGAGATCCT GTGTCTCTAT 

40 ACACATACCT GTAGTCCCAG CTACTCAGGA 
AAGGTAAGGC TGCAGTGAGC TGTGACTGTG 
AGAACCTGTC AAAAAAAAAA AAAAAACAAC 
CCATCTAATT GCTAAAGATT TTCTTTCATA 
AGTTTTCAGT CAAAG CTTG A CATTTAGAGA 

45 AAT CAACTGT GTATGAACTA TAACTCTGCA 
AATGAACTTT TAGTCCTGTA ATAAATGAAA 
ATAGTTATAT CTTGCTAACG GGCCACTCAT 
CAGTATGTTT CCAGGTGTGT GCACTCAACA 
TAATGGGCCA CAGGAAGTAA GTTGATCTTG 

50 ACTGGATAGA GACTGTTGTT AGTGTCTGGG 
CTAATTACTG AATAAAACAA TCTAGAACAA 



CGACGCCTTC CCAGAGGAAT ACGTGCCCAC 600 

TGTGGGAGGC AAGCAACACT TGCTCGGACT 660 

CCAGCTGAGG CCACTCTCCT ACCCCAACAC 720 

AAACCCTGCC TCTTACCACA ATGTCCAGGA 780 

GCCTCACGTG CCTTATGTCC TCATAGGGAC 840 

CTTGGCCCGT TTGCTGTATA TGAAAGAGAA 900 

CGCAAAAGCG ATCGGAGCAC AGTGCTACTT 960 

CAAAGCGGTT TTTGATGAAG CAATCCTCAC 1020 

CTGTTCTGAG GGTCACAGCT GCTGTTCAAT 1080 

CCCATCCAGG GATGAGAATG GCAGCCAATC 114 0 

GCACGACCAG AAAGGAACTC CCTTTGCACG 1200 

CCAACACAGC ACACTAGTCA GCCCACTGCC 1260 

ACTGCACGCT GTCTGAGAAT GCTGGGCCTG 1320 

CAAAAACAAA GTCAAAGGCC ATCTCACATT 1380 

TGATAGGAAA TCACCCCAGG GAACCCGAAA 1440 

ACTTGATGTC TTTTATAAAA CTTGGGACTA 1500 

TTCTACCCAT GTGTCTCACA TTCATTTGTA 1560 

CACTCAGGCC TTACTAATCC ATACCAAATT 1620 

TCTATTTTCA GCA TGTTTCT ACCAAAGCTA 1680 

ATTATAAGAA GACATGAGAA GACTTTAAAA 1740 

GAACCTAATT GAAAGAAAAC CATCTGAATT 1800 

ACTCTATAGA TGCTTAGTAG TGGCCTGATT 1860 

AATGAACTGT GTTTAAGAAT GTAGTATTTC 192 0 

ATATGGCAGG ATCCATCTTT TACAGTATTT 1980 

TCCCTTCCCC CATTGCATGC CCTCTTCCTC 2040 

CCTTTTATTC TCCCCAGTTA TAACCCAGTT 2100 

TGTTTAAAAA TAACTAAAAG TAAAGGAAAG 2160 

AAGTGCTTCT CAGCTCGCCC CATGTAAGTT 2220 

ACAACTGGTA CTGAGATTTT GCCTCTCTCT 2280 

GGGAGCCATA TCAGTGGATA CCAAGCTCTG 2340 

GTGACATAGA ACAGGGACTT TGGCCCTGGG 2400 

GCCCAATGAT GTAAAACAAT TCCAAACATC 2460 

TTCCTATCTT TTCATTAGAG GCTATGAGGA 2520 

CCCTGGAATC ACCCAGGTAG GCACTTAATT 2580 

TTAGTAGTGT GAAATTACAA CAACTTTAAG 2640 

AAAAGCCAGC CTTTCTAATC TCTTCTGCTA 2700 

GCCAGGCACC GTGGCTCATG CCTGTAATCC 2 760 

CGCTTGAGCT CAGGAGTTCA AGACCAGCCT 2 820 

AAAAAAATTA AAAATTAGTC AGTTGTAGTG 2880 

GGCTGAGGTG GAAGGATCAC TTGAGCCCAG 2940 

CCACTACACT CCAGCCTGAG TGACAGAGAA 3000 

CTACATTTCA AGTACTATTT CCCTTCTCTC 306 0 

CGCACACACT CCAGTGACTG GAAAAACGGG 312 0 

AAACAAGGAC TTTCTGCCTT TATAAATGGA 318 0 

GAGGTTATGA ATTCATCCTT TACAAACAAT 324 0 

TGTTATTAGG CAGCTTTGTT GCATGATTGC 33 00 

TTCTCACTGA TGTGGATGAA AAAATGAGAG 3360 

GGCAAATAGC TCCCGAGGTC ACCACTTCCC 34 20 

ATGGGGAGAT CACGTCACCC AGAAC C AG C A 34 80 

TAGAGCACAG GCTCCCAGGG GTCTTAAGAG 354 0 
AGCAA 



Seq ID No: 95 Protein sequence: 
Protein Accession #: CAC06611.1 

55 



1 11 21 31 41 51 

I I I I I I 

MNCKEGTDSS CGCRGNDEKK MLKCVWGDG AVGKTCLLMS YANDAFPEEY VPTVFDHYAV 60 
60 TVTVGGKQHL LGLYDTAGQE DYNQLRPLSY PNTDVFLICF SWNPASYHN VQEEWVPELK 120 
DCMPHVPYVL IGTQIDLRDD PKTLiARLLYM KEKPLTYEHG VKLAKAIGAQ CYLECSALTQ 180 
KGLKAVFDEA ILTIFHPKKK KKRCSEGHSC CSII 



65 Seq ID NO: 96 DNA sequence 

Nucleic Acid Accession #: NMJ)03654.1 

Coding sequence: 367-1602 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

70 | I | | | | 

GGGGAGGGCG CGGGAGGCGG AGGATGCCGC CGCGGCTGCT GCCGCCGCCG CCACCCGCGG 60 

GTCCCCGGCG ACCCTACTCC AGACCCGAGG ATGGAGCCGG CGCTGGGCGC TGCAGCTGCT 120 

CCCGGCGCGT CCCCGACCAG GTAGCTGGTG TCACTTCGGT GTGGTTGGAA GAAGACTTTC 180 

TCCCCAGCTG CATTCCCGGA GGCGCCCTTT CGACCTGGAG GCCGGGTCTG CTGGCCACAG 24 0 

75 GGCTGCCGCA CTGGCTGGGA CTGCCAGCTG GGCCTGGAGA CGCTGGTGGC TGTGGACTCC 300 

CCAGCTTGGA GCAGTCCCTC TTTGACCTCA CCCCTTGGAG AAGCAGCCCC ATGAAGGTGC 360 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



CCAGCCATGC 
CAGTACACGG 
GAGGCCGGGC 
CGCAAGACCC 
CTCTTCAACC 
ACGCTCATCC 
GCCAGCCGCG 
ATCAAGCCGC 
GTCCTCTGCT 
GGGGACTGTG 
GAGCGCAGCC 
CTGGTGGAAG 
ATTCTGGCTT 
GGCACCGGGA 
TTCTCCAACT 
TTGGTGCGCT 
TTCCTGGGCA 
GACCCCACCC 
AAGTGGCGCT 
GTGCTGGCCC 
GTCAGCCTGG 
GGGCGGGAGG 
TAACCCCTCC 
CTTCTGCCCC 
AGAGGGCGCC 
TCTCTGTGCG 
CGGGCACTCG 
TTTGCACTGT 
TTTATGAATG 
CCCTTAGAGC 
TTACTGTGAG 
GTCTGAGTCT 
TTGATCTCGG 
CAGTTTGCTA 
AGTGCAATAA 



AATGTTCCTG 
CCATCCGCAC 
TGGCCGAGCG 
ACATCCTCAT 
AGCACCTGGA 
CCCGCTTCAC 
ACCTCCTGCG 
CGCCGGTCAA 
CCCGGCCTGT 
TGCGCAAGTG 
ACGTGGCCAT 
ACCCGCGATT 
CGCGCAGCGA 
GGAAACC CTA 
CCGTGTCCAC 
ACGAGGACCT 
TCCCGCTGGA 
TGGGCAAGCA 
TCCGCCTCTC 
AGCTGGGCTA 
TGGAGGAGCG 
CGCAAGGTGT 
CTCTCCCACC 
TTTTTTGTCT 
TGAAGTAGGG 
GACGGTGACA 
CGAGG CG ACT 
CTTACTATTC 
GTGTCCATCC 
AGCGAAACTG 
GTGAACGTGG 
CGTGGCCGCC 
GGTCCATCTG 
AATGAACATT 
TCACC 



GAAGGCCGTC 
CTTCACCGCC 
ACTGTGCGAG 
CCTGGCCACC 
CGTCTTCTAC 
CCAGGGCAAG 
GAGCCTCTAC 
CCACACCACC 
GTGCGACCCT 
CGGGCTACTC 
CAAGACGGTG 
AAACCTCAAG 
GACCTTCCGC 
CAACCTGGAC 
CGGCCTCATG 
GGCTCGGAAC 
CAGCCACGTG 
CAAATACGGC 
CTACGACATC 
CAAGATCGCC 
GGACTTCCGC 
CGGTTTTGAT 
TCATCTTCGT 
CTGAAATTTG 
TCCCGCCCCC 
ATGTTTACAA 
TCTCAAGCTT 
AAGGTAAGAG 
TTTCCCCATC 
CCCCCTCCTG 
ACCTGTTTCT 
CCTGGACCAG 
TGATATTTCT 
GAAATTGAAA 



CTCCTCCTTG 

AAGTCCTTTC 

GAGAGCCCCA 

ACGCGCAGCG 

CTGTTTGAGC 

AGCCCGGCCG 

GACTGCGACC 

GACAGGATCT 

CCGGGGCCAG 

AACCTGACCG 

CGCGTGCCCG 

GTCATCCAGC 

GACACGTACC 

GTGACGCAGC 

CGGCCCCCGT 

CCTATGAAGA 

GCCCGCTGGA 

ACCGTGCGAA 

GTGGCCTTTG 

GCCTCGGAGG 

CCCTTCTCGT. 

AAAATGGACC 

GTCCTTCCTG 

CACTACGTCT 

CCCACCCCAT 

GCACCACATT 

TTGAATGGGT 

GATACAAACA 

CCTGCCTCCT 

CCCGCCCTTG 

GTTTCCAGTC 

TGATGACTGA 

TTGTGCCAAA 

TGCTTTATCT 



CCCTGGCCTC 
ACACCTGCCC 
CCTTCGCCTA 
GCTCCTCCTT 
CCCTCTACCA 
ACCGGCGGGT 
TCTACTTCCT 
TCCGCCGCGG 
CCGACCTGGT 
TGGCGGCCGA 
AGGTGAACGA 
TGGTCCGAGA 
GGCTCTGGCG 
TGACCACGGT 
GGCTCAAGGG 
AGACCGAGGA 
TCCAGAACAA 
ACTCGGCGGC 
CCCAGAACGC 
AGGAGCTGAA 
GACCCGGGCG 
GTTTTTAACT 
CCCCCAGCTC 
TGGACGGGAA 
TCAGACACAT 
TACACATCCA 
GAGTGGTCGG 
AGAGGACCAC 
GCCCCTGACG 
CCTGTCGGTG 
TGTGGTGATG 
TGAATCTTAT 
AAGAAAAAAA 
GTGTTTTCTG 



CATTGCCATC 
CGGGCTGGCA 
CAACCTCTCC 
CGTGGGCCAG 
CGTCCAGAAC 
CATGCTAGGC 
GGAGAACTAC 
GGCCAGCCGG 
CCTGGAGGAG 
GGCGTGCCGC 
CCTGCGCGCC 
CCCCCGCGGC 
GCTCTGGTAC 
GTGCGAGGAC 
CAAGTACATG 
GATCTACGGG 
CACGCGGGGC 
CACGGCCGAG 
CTGCCAGCAG 
GAACCCCTCG 
GTGCGGGTGG 
GTTGCCTTAT 
ACCCCACTCC 
TCACTGGGGC 
GGATGTTGGG 
CACACGCACA 
GTATCTAGTT 
TTGTCTCTAA 
CCCATTTCCC 
AGGCAGGTTT 
CTGTCTGTCT 
GAGCTTCTGA 
AAGAGTGGAT 
TAAATAAAAG 



Seq ID No: 97 Protein sequence: 
Protein Accession #: NP 003645.1 



I 

MQCSWKAVLL 
THILILATTR 
RDLLRSLYDC 
CVRKCGLLNL 
ASRSETFRDT 
RYEDLARNPM 
RFRLSYDIVA 



11 

I 

LALASIAIQY 
SGSSFVGQLF 
DLYFLENYIK 
TVAAEACRER 
YRLWRLWYGT 
KKTEEIYGFL 
FAQNACQQVL 



21 
I 

TAIRTFTAKS 
NQHLDVFYLF 
PPPVNHTTDR 
SHVAIKTVRV 
GRKPYNLDVT 
GIPLDSHVAR 
AQDGYKIAAS 



31 
I 

FHTCPGLAEA 
EPLYHVQNTL 
IFRRGASRVL 
PEVNDLRALV 
QLTTVCEDFS 
WIQNNTRGDP 
EEELKNPSVS 



41 
I 

GLAERLCEES 
IPRFTQGKSP 
CSRPVCDPPG 
EDPRLNLKVI 
NSVSTGLMRP 
TLGKHKYGTV 
LVEERDFRPF 



51 



PTFAYNLSRK 
ADRRVMLGAS 
PADLVLEEGD 
QLVRDPRGIL 
PWLKGKYMLV 
RNSAATAEKW 
S 



420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



60 
120 
180 
240 
300 
360 



Seq ID NO: 98 DNA sequence 

Nucleic Acid Accession #: NM_002852.1 

Coding sequence: 68-1213 (underlined sequences correspond to start and stop codons) 



75 



I 

CTCAAACTCA 
TCCAGCAATG 
GAACTCGGAT 
CCATCCCACT 
GCTCTTCATC 
CGACGTCCTG 
CCTGGCGAGG 
CGAGCTGCTG 
GGCGCAGCGC 
GACGCGAGCC 
TTGTGAAACA 
AGTGAGACCA 
ATTAAACAAA 
GTATCTCAGC 
TGAAGCCATG 
AGGGCTCACA 
AGGTCACATT 
TGTGGGTGGT 
CTGGGATAGT 



11 
I 

GCTCACTTGA 
CATCTCCTTG 
GATTATGATC 
GAGGACCCCA 
ATGCTGGAGA 
CGGGGCGAGC 
CCGTGCGCGC 
CAGGCGACCC 
CCAGAGGAGG 
GACCTGCACG 
GCTATTTTAT 
ATGAGGCTTG 
ACCATCCTGT 
TACCAATCCA 
GTTTCCCTGG 
TCCTTGTGGG 
GTTCCTGAGG 
GGCTTTGATG 
GTTCTTAGCA 



21 



31 



I 



GAGTCTCCTC 
CGATTCTGTT 
TCATGTATGT 
CGCCGTGCGA 
ACTCGCAGAT 
TGCAGAGGCT 
CGGGGGCTCC 
GCGACGCGGG 
CGGGGCGCGC 
CGGTGCAGGG 
TCCCAATGCG 
AGTCTTTTAG 
TTTCCTATGG 
TAGTGTTTGT 
GAAGGTGGAC 
TAAATGGTGA 
GAGGAATCCT 
AAACATTAGC 
ATGAAGAGAT 



CCGCCAGCTG 
TTGTGCTCTC 
GAATTTGGAC 
CTGCGGTCAG 
GAGAGAGCGC 
GCGGGAGGAG 
CGCAGAGGCC 
CCGCAGGCTG 
CCTGGCCGCG 
CTGGGCTGCC 
TTCCAAGAAG 
TGCCTGCATT 
CACAAAGAGG 
GGTGGGTGGA 
CCACCTGTGC 
ACTGGCGGCT 
GCAGATTGGC 
CTTCTCTGGG 
AAGAGAGACC 



41 
I 

TGGAAAGAAC 

TGGTCTGCAG 

AACGAAATAG 

GAGCACTCGG 

ATGCTGCTGC 

CTGGGCCGGC 

AGGCTGACCA 

GCGCGTATGG 

GTGCTAGAGG 

CGGAGCTGGC 
ATTTTTGGAA 

TGGGTCAAAG 
AATCCATATG 
GAGGAGAACA 
GGCACCTGGA 
ACCACTGTTG 
CAAGAAAAGA 
AGACTCACAG 
GGAGGAGCAG 



51 



TTTGCGTCTC 
TGTTGGCCGA 
ACAATGGACT 
AATGGGACAA 
AAGCCACGGA 
TCGCGGAAAG 
GTGCTCTGGA 
AGGGCGCGGA 
AGCTGCGGCA 
TGCCGGCAGG 
GCGTGCATCC 
CCACAGATGT 
AAATCCAGCT 
AACTGGTTGC 
ATTCAGAGGA 
AGATGGCCAC 
ATGGCTGCTG 
GCTTCAATAT 
AGTCTTGTCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
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CATCCGGGGG AATATTGTTG GGTGGGGAGT 
GTATGTTTCA TAAATGTTGT GAAACTCCAC 
AACACATGCC AGTTGGGAAG GTCTGAAAAC 
TGAAAGAGAG AGTTGAGACC AATCTTTATT 
5 AAGGAAAGAC ATTGGAAAAA GCTTTTGAGG 
TTTCAGTTTA ATGCTGTGTC TCTGTCAGAT 
TGTTGAACAG AGGGACAATT GTTTTACTTT 
GAATTTTACA TTGGAAGAAT AACAAAATAA 
TATGTACCTT ATTACAAAAA AAATGATGAA 
10 ACTATAAATG TAGTTTATGT GTTATAATCG 
AAGTTATATT GCAAAAGGGA TTTGTATTAA 
AAATAAAATA TTTTATAAAA CTAAAAAAAA 



CACAGAGATC CAGCCACATG GAGGAGCTCA 1200 

TTGAAGCCAA AGAAAGAAAC TCACACTTAA 1260 

TCAGTGCATA ATAGGAACAC TTGAGACTAA 1320 

TGTACTGGCC AAATACTGAA TAAACAGTTG 13 8 0 

ATAATGTTAC TAGACTTTAT GCCATGGTGC 1440 

AAACTCTCAA ATAATTAAAA AGGACTGTAT 1500 

TCTTTGGTTA ATTTTGTTTT GGCCAGAGAT 1560 

GATTTGTTGT CCATTGTTCA TTGTTATTGG 1620 

AACATATTTA TACTACAAGG TGACTTAACA 1680 

AATGTCACGT TTTTGAGAAG ATAGTCATAT 1740 

TTTAAGACTA TTTTTGTAAA GCTCTACTGT 1800 
AAAAAAA 



15 Seg ID No: 99 Protein sequence: 

Protein Accession #: NP_002843.1 * 

1 11 21 31 41 51 

^ i I I I I I 

20 MHLLAILFCA LWSAVLAENS DDYDLMYVNL DNEIDNGLHP TEDPTPCDCG QEHSEWDKLF 60 
IMLENSQMRE RMLLQATDDV LRGELQRLRE ELGRLAESLA RPCAPGAPAE ARLTSALDEL 120 
LQATRDAGRR LARMEGAEAQ RPEEAGRALA AVLEELRQTR ADLHAVQGWA ARSWLPAGCE 180 
TAILFPMRSK KIFGSVHPVR PMRLESFSAC I WVKATD VLN KTILFSYGTK RNPYEIQLYL 240 
SYQSIVFWG GEENKLVAEA MVSLGRWTHL CGTWNSEEGL TSLWVNGELA ATTVEMATGH 300 

25 IVPEGGILQI GQEKNGCCVG GGFDETLAFS GRLTGFNIWD SVLSNEEIRE TGGAESCHIR 360 
GNIVGWGVTE IQPHGGAQYV S 

Seq ID NO: 100 DNA sequence 
Nucleic Acid Accession #: NM_007351.1 
30 Coding sequence; 72-3758 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

35 | | | | | | 

CTGCTATCAA AAAGGCCATA AGGATTTTGT CCCCAAATTT CACATGAGCT ACCTTGCTTC 60 

AAACTACTGA GATGAAGGGG GCAAGATTAT TTGTCCTTCT TTCTAGTTTA TGGAGTGGGG 120 

GCATTGGGCT TAACAACAGT AAGCATTCTT GGACTATACC TGAGGATGGG AACTCTCAGA 180 

AGACTATGCC TTCTGCTTCA GTTCCTCCAA ATAAAATACA AAGTTTGCAA ATACTGCCAA 24 0 

40 CCACTCGGGT CATGTCGGCG GAGATAGCTA CAACTCCAGA GGCAAGAACT TCTGAAGACA 300 

GTCTTCTTAA ATCAACACTG CCTCCCTCAG AAACAAGTGC ACCTGCTGAG GGTGTGAGAA 360 

ATCAAACTCT CACATCCACA GAGAAAGCAG AAGGAGTGGT CAAGTTACAG AATCTTACCC 42 0 

TCCCAACCAA CGCTAGCATC AAGTTCAATC CTGGAGCAGA ATCAGTGGTC CTTTCCAATT 4 80 

CTACACTGAA ATTTCTTCAG AGCTTTGCCA GAAAGTCAAA TGAACAAGCA ACTTCTCTAA 54 0 

45 ACACAGTTGG AGGCACTGGA GGCATTGGAG GCGTTGGAGG CACTGGAGGC GTGGGAAATC 600 

GAGCCCCACG GGAAACATAC CTCAGCCGGG GTGACAGCAG TTCCAGCCAA AGAACTGACT 660 

ACCAAAAATC AAATTTCGAA ACAACTAGAG GAAAGAATTG GTGTGCTTAT GTACATACCA 720 

GGTTATCTCC CACAGTGACA TTGGACAACC AGGTCACTTA TGTCCCAGGT GGGAAAGGAC 780 

CTTGTGGCTG GACCGGTGGA TCCTGTCCTC AGAGATCTCA GAAGATATCC AATCCTGTCT 84 0 

50 ATAGGATGCA ACATAAAATT GTCACCTCAT TGGATTGGAG GTGCTGTCCT GGATACAGTG 900 

GGCCGAAATG TCAACTAAGA GCCCAGGAAC AGCAAAGTTT GATACACACC AACCAGGCTG 960 

AAAGTCATAC AGCTGTTGGC AGAGGAGTAG CTGAGCAGCA GCAGCAGCAA GGCTGTGGTG 1020 

ACCCAGAAGT GATGCAAAAA ATGACTGATC AGGTGAACTA CCAGGCAATG AAACTGACTC 10 80 

TTCTGCAGAA GAAGATTGAC AATATTTCTT TGACTGTGAA TGATGTAAGG AACACTTACT 114 0 

55 CCTCCCTAGA AGGAAAAGTC AGCGAAGATA AAAGCAGAGA ATTTCAATCT CTTCTAAAAG 12 00 

GTCTAAAATC CAAAAGCATT AATGTACTGA TAAGAGACAT AGTAAGAGAA CAATTTAAAA 1260 

TTTTTCAAAA TGACATGCAA GAGACTGTAG CACAGCTCTT CAAGACTGTA TCAAGTCTAT 1320 

CAGAGGACCT CGAAAGCACC AGGCAAATAA TTCAAAAAGT TAATGAATCT GTGGTTTCAA 13 80 

TAGCAGCCCA GCAAAAGTTT GTTTTGGTGC AAGAGAATCG GCCCACTTTG ACTGATATAG 144 0 

60 TGGAACTAAG GAATCACATT GTGAATGTAA GGCAAGAAAT GACTCTTACA TGTGAGAAGC 1500 

CTATTAAAGA ACTAGAAGTA AAG CAGACTC ATTTAGAAGG TGCTCTAGAA CAGGAACACT 1560 

CAAGAAGCAT TCTGTATTAT GAATCCCTCA ATAAAACTCT TTCTAAATTG AAGGAAGTAC 1620 

ATGAGCAGCT TTTATCAACT GAACAGGTAT CAGACCAGAA GAATGCTCCA GCTGCTGAGT 1680 

CAGTTAGCAA TAATGTCACT GAGTACATGT CTACTTTACA TGAAAATATA AAG AAG C AG A 174 0 

65 GTTTGATGAT GCTGCAAATG TTTGAAGATT TGCACATTCA AGAAAGCAAG ATTAACAATC 1800 

TCACCGTCTC TTTGGAGATG GAGAAAGAGT CTCTCAGAGG TGAATGTGAA GACATGTTAT 1860 

CCAAATGCAG AAATGATTTT AAATTTCAAC TTAAGGACAC AGAAGAGAAT TTACATGTGT 1920 

TAAATCAAAC ATTGGCTGAA GTTCTCTTTC CAATGGACAA TAAGATGGAC AAAATGAGTG 1980 

AGCAACTAAA TGATTTGACT TATGATATGG AGATCCTTCA ACCCTTGCTT GAGCAGGGAG 2040 

70 CATCACTCAG ACAGACAATG ACATATGAAC AACCAAAGGA AGCAATAGTG ATAAGGAAAA 2100 

AGATAGAAAA TCTGACTAGT GCTGTCAATA GTCTAAATTT TATTATCAAA GAACTTACAA 2160 

AAAGACACAA CTTACTTAGA AATGAAGTAC AGGGTCGTGA TGATGCCTTA GAAAGACGTA 2220 

TCAATGAATA TGCCTTAGAA ATGGAAGATG GCCTCAATAA GACAATGACT ATTATAAATA 2280 

ATGCTATTGA TTTCATTCAA GATAACTATG CCCTAAAAGA GACTTTAAGT ACTATTAAGG 2340 

75 ATAATAGTGA GATCCATCAT AAATGTACCT CCGATATGGA AACTATTTTG ACATTTATTC 24 00 

CTCAGTTCCA CCGTCTGAAT GATTCTATTC AGACTTTGGT CAATGACAAT CAGAGATATA 2460 
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ACTTTGTTTT GCAAGTCGCC AAGACCCTTG CAGGTATTCC CAGAGATGAG AAACTAAATC 2520 

AGTCCAACTT CCAAAAGATG TATCAAATGT TCAATGAAAC CACTTCCCAA GTGAGAAAAT 2580 

ACCAGCAAAA TATGAGTCAT TTGGAAGAAA AACTACTCTT AACTACCAAG ATTTCCAAAA 2640 

ATTTTGAGAC TCGGTTGCAA GACATTGAGT CTAAAGTTAC CCAGACGCTC ATACCTTATT 2700 

ATATTTCAGT TAAAAAAGGC AGTGTAGTTA CAAATGAGAG AGATCAGGCT CTTCAACTGC 2760 

AAGTATTAAA TTCCAGATTT AAGGCGTTGG AAGCAAAATC TATC CATCTT TCAATTAACT 2820 

TCTTTTCGCT TAACAAAACT CTCCACGAAG TTTTAACAAT GTGTCACAAT GCTTCTACAA 28 80 

GTGTGTCAGA ACTGAATGCT ACCATCCCTA AGTGGATAAA ACATTCCCTG CCAGATATTC 2940 

AACTTCTTCA GAAAGGTCTA ACAGAATTTG TGGAACCAAT AATTCAAATA AAAACTCAAG 3000 

CTGCCCTATC TAATTCAACT TGTTGTATAG ATCGATCGTT GCCTGGTAGT CTGGCAAATG 3060 

TTGTCAAGTC TCAGAAGCAA GTAAAATCAT TGCCAAAGAA AATTAACGCA CTTAAGAAAC 3120 

CAACGGTAAA TCTTACCACA GTCCTGATAG GCCGGACTCA AAGAAACACG GACAACATAA 3180 

TATATCCTGA GGAGTATTCA AGCTGTAGTC GGCATCCGTG CCAAAATGGG GGCACGTGCA 3240 

TAAATGGAAG AACTAGCTTT ACCTGTGCCT GCAGACATCC TTTTACTGGT GACAACTGCA 3300 

CTATCAAGCT TGTGGAAGAA AATGCTTTAG CTCCAGATTT TTCCAAAGGA TCTTACAGAT 3360 

ATGCACCCAT GGTGGCATTT TTTGCATCTC ATACGTATGG AATGACTATA CCTGGTCCTA 3420 

TCCTGTTTAA TAACTTGGAT GTCAATTATG GAGCTTCATA TACCCCAAGA ACTGGAAAAT 3480 

TTAGAATTCC GTATCTTGGA GTATATGTTT TCAAGTACAC CATCGAGTCA TTTAGTGCTC 3540 

ATATTTCTGG ATTTTTAGTG GTTGATGGAA TAGACAAGCT TGCATTTGAG TCTGAAAATA 3 600 

TTAACAGTGA AATACACTGT GATAGGGTTT TAACTGGGGA TGCCTTATTA GAATTAAATT 3660 

ATGGGCAGGA AGTCTGGTTA CGACTTGCAA AAGGAACAAT TCCAGCCAAG TTTCCCCCTG 3720 

TTACTACATT TAGTGGCTAT TTATTATATC GTAC ATAAG T TAGTATGAAA AACAGACTAT 3780 

CACCTTTATT GAGAAACAGC CAGTGTTTTC ATTTATCTTT GCTTGCACAT CTGCTCTGTT 3840 

TTGGTTTTTC TACAGGAAAT GAAAATCAAC TTGTTTTTTT AATATGAGTA AACTTGTATG 3900 

TCTATTTTAT AAAATTATTT GAATATTGTT TAATGTCTGA ATATGAAAGA GTTCTTGATC 3960 

CTAAAGAAAT TTAGTGGCAC AGAAAACAAA GTGAATTTGT TAGCATAATT ATTCCTATTC 4020 

TTATTTCTTC ATTTTAAGTC ATTGCAATGG AAAGTAATAT TATAAAACGG TAATTACAAC 4080 

ATATTATCAG TCACAGTTTT CTTTCCAATT AAACACTTAA CTTTTGTTAT TCCCTGTATA 4140 
TAAATATATA ACACACATTT TCTAGATTCA CAAATTTAAA TAAATTACTC AAAAAATG 



Seq ID No: 101 Protein sequence: 
Protein Accession #: NP 031377.1 



1 11 21 31 41 51 

I I I I I I 

MKGARLFVIrl* SSLWSGGIGL NNSKHSWTIP EDGNSQKTMP SASVPPNKIQ SLQILPTTRV 60 

MSAEIATTPB ARTSEDSLLK STLPPSETSA PAEGVRNQTL TSTEKAEGW KLQNLTLPTN 120 

ASIKFNPGAE SWLSNSTLK FLQSFARKSN EQATSLNTVG GTGGIGGVGG TGGVGNRAPR 180 

ETYLSRGDSS SSQRTDYQKS NFETTRGKNW CAYVHTRLSP TVTLDNQVTY VPGGKGPCGW 24 0 

TGGSCPQRSQ KISNPVYRMQ HKIVTSLDWR CCPGYSGPKC QLRAQEQQSL IHTNQAESHT 300 

AVGRGVAEQQ QQQGCGDPEV MQKMTDQVNY QAMKLTLLQK KIDNISLTVN DVRNTYSSLE 360 

GKVSEDKSRE FQSLLKGLKS KSINVLIRDI VREQFKIFQN* DMQETVAQLF KTVSSLSEDL 420 

ESTRQIIQKV NESWSIAAQ QKFVLVQENR PTLTDIVELR NHIVNVRQEM TLTCEKPIKE 480 

LEVKQTHLEG ALEQEHSRSI LYYESLNKTL SKLKEVHEQL LSTEQVSDQK NAPAAESVSN 54 0 

NVTEYMSTLH ENIKKQSLMM LQMFEDLHIQ ESKINNLTVS LEMEKESLRG ECEDMLSKCR 600 

NDFKFQLKDT EENLHVLNQT LAEVLFPMDN KMDKMSEQLN DLTYDMEILQ PLLEQGASLR 660 

QTMTYEQPKE AIVIRKKIEN LTSAVNSLNF IIKELTKRHN LLRNEVQGRD DALERRINEY 72 0 

ALEMEDGLNK TMTIINNAID FIQDNYALKE TLSTIKDNSE IHHKCTSDME TILTFIPQFH 780 

RLNDSIQTLV NDNQRYNFVL QVAKTLAGIP RDEKLNQSNF QKMYQMFNET TSQVRKYQQN 840 

MSHLEEKLLL TTKISKNFET RLQDIESKVT QTLIPYYISV KKGSWTNER DQALQLQVLN 900 

SRFKALEAKS IHLSINFFSL NKTLHEVLTM CHNASTSVSE LNATI PKWIK HSLPDIQLLQ 960 

KGLTEFVEPI IQIKTQAALS NSTCCIDRSL PGSLANWKS QKQVKSLPKK INALKKPTVN 1020 

LTTVLIGRTQ RNTDNIIYPE EYSSCSRHPC QNGGTCINGR TSFTCACRHP FTGDNCTIKL 1080 

VEENALAPDF SKGSYRYAPM VAFFASHTYG MTIPGPIL.FN NLDVNYGASY TPRTGKFRIP 114 0 

YLGVYVFKYT IESFSAHISG FLWDGIDKL AFESENINSE IHCDRVLTGD ALLELNYGQE 1200 

VWLRLAKGTI PAKFPPVTTF SGYLLYRT 



Seq ID NO: 102 DNA sequence 

Nucleic Acid Accession #: NM_000873.2 

Coding sequence: 57-884 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I 1 I 

ATCTCCCTCC AGGCAGCCCT TGGCTGGTCC CTGCGAGCCC GTGGAGACTG CCAGAG ATG T 60 

CCTCTTTCGG TTACAGGACC CTGACTGTGG CCCTCTTCAC CCTGATCTGC TGTCCAGGAT 120 

CGGATGAGAA GGTATTCGAG GTACACGTGA GGCCAAAGAA GCTGGCGGTT GAGCCCAAAG 180 

GGTCCCTCGA GGTCAACTGC AGCACCACCT GTAACCAGCC TGAAGTGGGT GGTCTGGAGA 240 

CCTCTCTAAA TAAGATTCTG CTGGACGAAC AGGCTCAGTG GAAACATTAC TTGGTCTCAA 300 

ACATCTCCCA TGACACGGTC CTCCAATGCC ACTTCACCTG CTCCGGGAAG CAGGAGTCAA 360 

TGAATTCCAA CGTCAGCGTG TACCAGCCTC CAAGGCAGGT CATCCTGACA CTGCAACCCA 420 

CTTTGGTGGC TGTGGGCAAG TCCTTCACCA TTGAGTGCAG GGTGCCCACC GTGGAGCCCC 480 

TGGACAGCCT CACCCTCTTC CTGTTCCGTG GCAATGAGAC TCTGCACTAT GAGACCTTCG 540 

GGAAGGCAGC CCCTGCTCCG CAGGAGGCCA CAGCCACATT CAACAGCACG GCTGACAGAG 600 

AGGATGGCCA CCGCAACTTC TCCTGCCTGG CTGTGCTGGA CTTGATGTCT CGCGGTGGCA 660 

ACATCTTTCA CAAACACTCA GCCCCGAAGA TGTTGGAGAT CTATGAGCCT GTGTCGGACA 720 
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GCCAGATGGT CATCATAGTC ACGGTGGTGT CGGTGTTGCT GTCCCTGTTC GTGACATCTG 780 
TCCTGCTCTG CTTCATCTTC GGCCAGCACT TGCGCCAGCA GCGGATGGGC ACCTACGGGG 840 
TGCGAGCGGC TTGGAGGAGG CTGCCCCAGG CCTTCCGGCC ATAGCAACCA TGAGTGGCAT 900 
GGCCACCACC ACGGTGGTCA CTGGAACTCA GTGTGACTCC TCAGGGTTGA GGTCCAGCCC 960 
TGGCTGAAGG ACTGTGACAG GCAGCAGAGA CTTGGGACAT TGCCTTTTCT AGCCCGAATA 1020 
CAAACACCTG GACTT 

Seq ID No: 103 Protein sequence: 
Protein Accession #: NP 000864.1 



1 11 21 31 41 51 

I I I I I I 

MSSFGYRTLT VALFTLICCP GSDEKVFEVH VRPKKLAVEP KGSLEVNCST TCNQPEVGGL 60 
ETSLNKILLD EQAQWKHYLV SNISHDTVLQ CHFTCSGKQE SMNSNVSVYQ PPRQVILTLQ 120 
PTLVAVGKSF TIECRVPTVE PLDSLTLFLF RGNETLHYET FGKAAPAPQE ATATFNSTAD 180 
REDGHRNFSC LiAVLDLMSRG GNIFHKHSAP KMLEIYEPVS DSQMVIIVTV VSVLLSLFVT 240 
SVLLCFIFGQ HLRQQRMGTY GVRAAWRRLP QAFRP 

Seq ID NO: 104 DNA sequence 

Nucleic Acid Accession #: NM_001795.2 

Coding sequence: 121-2475 (underlined sequences correspond to start and 



1 11 21 31 41 51 

I I I I I I 

GACGGTCGGC TGACAGGCTC CACAGAGCTC CACTCACGCT CAGGCCCTGG ACGGACAGGC 60 

AGTCCAACGG AACAGAAACA TCCCTCAGCC CCACAGGCAC GATCTGTTCC TCCTGGGAAG 120 

ATG CAGAGGC TCATGATGCT CCTCGCCACA TCGGGCGCCT GCCTGGGCCT GCTGGCAGTG 180 

GCAGCAGTGG CAGCAGCAGG TGCTAACCCT GCCCAACGGG ACACCCACAG CCTGCTGCCC 240 

ACCCACCGGC G CC AAAAGAG AGATTGGATT TGGAACCAGA TGCACATTGA TGAAGAGAAA 300 

AACACCTCAC TTCCCCATCA TGTAGGCAAG ATCAAGTCAA GCGTGAGTCG CAAGAATGCC 360 

AAGTACCTGC TCAAAGGAGA ATATGTGGGC AAGGTCTTCC GGGTCGATGC AGAGACAGGA 420 

GACGTGTTCG CCATTGAGAG GCTGGACCGG GAGAATATCT CAGAGTACCA CCTCACTGCT -480 

GTCATTGTGG ACAAGGACAC TGGTGAAAAC CTGGAGACTC CTTCCAGCTT CACCATCAAA 54 0 

GTTCATGACG TGAACGACAA CTGGCCTGTG TTCACGCATC GGTTGTTCAA TGCGTCCGTG 600 

CCTGAGTCGT CGGCTGTGGG GACCTCAGTC ATCTCTGTGA CAGCAGTGGA TGCAGACGAC 660 

CCCACTGTGG GAGACCACGC CTCTGTCATG TACCAAATCC TGAAGGGGAA AGAGTATTTT 720 

GCCATCGATA ATTCTGGACG TATTATCACA ATAACGAAAA GCTTGGACCG AGAGAAGCAG 780 

GCCAGGTATG AGATCGTGGT GGAAGCGCGA GATGCCCAGG GCCTCCGGGG GGACTCGGGC 84 0 

ACGGCCACCG TGCTGGTCAC TCTGCAAGAC ATCAATGACA ACTTCCCCTT CTTCACCCAG 900 

ACCAAGTACA CATTTGTCGT GCCTGAAGAC ACCCGTGTGG GCACCTCTGT GGGCTCTCTG 960 

TTTGTTGAGG ACCCAGATGA GCCCCAGAAC CGGATGACCA AGTACAGCAT CTTGCGGGGC 1020 

GACTACCAGG ACGCTTTCAC CATTGAGACA AACCCCGCCC ACAACGAGGG CATCATCAAG 1080 

CCCATGAAGC CTCTGGATTA TGAATACATC CAGCAATACA GCTTCATCGT CGAGGCCACA 114 0 

GACCCCACCA TCGACCTCCG ATACATGAGC CCTCCCGCGG GAAACAGAGC CCAGGTCATT 1200 

ATCAACATCA CAGATGTGGA CGAGCCCCCC ATTTTCCAGC AGCCTTTCTA CCACTTCCAG 1260 

CTGAAGGAAA ACCAGAAGAA GCCTCTGATT GGCACAGTGC TGGCCATGGA CCCTGATGCG 1320 

GCTAGGCATA GCATTGGATA CTCCATCCGC AGGACCAGTG ACAAGGGCCA GTTCTTCCGA 1380 

GTCACAAAAA AGGGGGACAT TTACAATGAG AAAGAACTGG ACAGAGAAGT CTACCCCTGG 144 0 

TATAACCTGA CTGTGGAGGC CAAAGAACTG GATTCCACTG GAACCCCCAC AGGAAAAGAA 1500 

TCCATTGTGC AAGTCCACAT TGAAGTTTTG GATGAGAATG ACAATGCCCC GGAGTTTGCC 1560 

AAGCCCTACC AGCCCAAAGT GTGTGAGAAC GCTGTCCATG GCCAGCTGGT CCTGCAGATC 1620 

TCCGCAATAG ACAAGGACAT AACACCACGA AACGTGAAGT TCAAATTCAC CTTGAATACT 1680 

GAGAACAACT TTACCCTCAC GGATAATCAC GATAACACGG CCAACATCAC AGTCAAGTAT 174 0 

GGGCAGTTTG ACCGGGAGCA TACCAAGGTC CACTTCCTAC CCGTGGTCAT CTCAGACAAT 1800 

GGGATGCCAA GTCGCACGGG CACCAGCACG CTGACCGTGG CCGTGTGCAA GTGCAACGAG 1860 

CAGGGCGAGT TCACCTTCTG CGAGGATATG GCCGCCCAGG TGGGCGTGAG CATCCAGGCA 1920 

GTGGTAGCCA TCTTACTCTG CATCCTCACC ATCACAGTGA TCACCCTGCT CATCTTCCTG 1980 

CGGCGGCGGC TCCGGAAGCA GGCCCGCGCG CACGGCAAGA GCGTGCCGGA GATCCACGAG 2 040 

CAGCTGGTCA CCTACGACGA GGAGGGCGGC GGCGAGATGG ACACCACCAG CTACGATGTG 2100 

TCGGTGCTCA ACTCGGTGCG CCGCGGCGGG GCCAAGCCCC CGCGGCCCGC GCTGGACGCC 2160 

CGGCCTTCCC TCTATGCGCA GGTGCAGAAG CCACCGAGGC ACGCGCCTGG GGCACACGGA 2220 

GGGCCCGGGG AGATGGCAGC CATGATCGAG GTGAAGAAGG ACGAGGCGGA CCACGACGGC 2280 

GACGGCCCCC CCTACGACAC GCTGCACATC TACGGCTACG AGGGCTCCGA GTCCATAGCC 234 0 

GAGTCCCTCA GCTCCCTGGG CACCGACTCA TCCGACTCTG ACGTGGATTA CGACTTCCTT 2400 

AACGACTGGG GACCCAGGTT TAAGATGCTG GCTGAGCTGT ACGGCTCGGA CCCCCGGGAG 2460 

GAGCTGCTGT AT TAG GCGGC CGAGGTCACT CTGGGCCTGG. GGACCCAAAC CCCCTGCAGC 2520 

CCAGGCCAGT CAGACGCCAG GCACCACAGC CTCCAAAAAT GGCAGTGACT CCCCAGCCCA 258 0 

GCACCCCTTC CTCGTGGGTC CCAGAGACCT CATCAGCCTT GGGATAGCAA ACTCCAGGTT 264 0 

CCTGAAATAT CCAGGAATAT ATGTCAGTGA TGACTATTCT CAAATGCTGG CAAATCCAGG 2700 

CTGGTGTTCT- GTCTGGGCTC AGACATCCAC ATAACCCTGT CACCCACAGA CCGCCGTCTA 2760 

ACTCAAAGAC TTCCTCTGGC TCCCCAAGGC TGCAAAGCAA AACAGACTGT GTTTAACTGC 282 0 

TGCAGGGTCT TTTTCTAGGG TCCCTGAACG CCCTGGTAAG GCTGGTGAGG TCCTGGTGCC 2880 

TATCTGCCTG GAGGCAAAGG CCTGGACAGC TTGACTTGTG GGGCAGGATT CTCTGCAGCC 2940 

CATTCCCAAG GGAGACTGAC CATCATGCCC TCTCTCGGGA GCCCTAGCCC TGCTCCAACT 3000 
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CCATACTCCA CTCCAAGTGC CCCACCACTC CCCAACCCCT CTCCAGGCCT GTCAAGAGGG 3060 

AGGAAGGGGC CCCATGGCAG CTCCTGACCT TGGGTCCTGA AGTGACCTCA CTGGCCTGCC 3120 

ATGCCAGTAA CTGTGCTGTA CTGAGCACTG AACCACATTC AGGGAAATGG CTTATTAAAC 3180 

TTTGAAGCAA CTGTGAATTC ATTCTGGAGG GGCAGTGGAG ATCAGGAGTG ACAGATCACA 3240 

GGGTGAGGGC CACCTCCACA CCCACCCCCT CTGGAGAAGG CCTGGAAGAG CTGAGACCTT 3300 

GCTTTGAGAC TCCTCAGCAC CCCTCCAGTT TTGCCTGAGA AGGGGCAGAT GTTCCCGGAG 3360 

CAGAAGACGT CTCCCCTTCT CTGCCTCACC TGGTCGCCAA TCCATGCTCT CTTTCTTTTC 3420 

TCTGTCTACT CCTTATCCCT TGGTTTAGAG GAACCCAAGA TGTGGCCTTT AGCAAAACTG 3480 

GACAATGTCC AAACCCACTC ATGACTGCAT GACGGAGCCG AGCCATGTGT CTTTACACCT 3540 

CGCTGTTGTC ACATCTCAGG GAACTGACCC TCAGGCACAC CTTGCAGAAG GCAAGGCCCT 3600 

GCCCTGCCCA ACCTCTGTGG TCACCCATGC ATCTTCCACT GGAACGTTTC ACTGCAAACA 3660 

CACCTTGGAG AAGTGGCATC AGTCAACAGA GAGGGGCAGG GAAGGAGACA CCAAGCTCAC 3720 

CCTTCGTCAT GGACCGAGGT TCCCACTCTG GGCAAAGCCC CTCACACTGC AAGGGATTGT 3780 

AGATAACACT GACTTGTTTG TTTTAACCAA TAACTAGCTT CTTATAATGA TTTTTTTACT 3 840 

AATGATACTT ACAAGTTTCT AGCTCTCACA GACATATAGA ATAAGGGTTT TTGCATAATA 3900 

AGCAGGTTGT TATTTAGGTT AACAATATTA ATTCAGGTTT TTTAGTTGGA AAAACAATTC 3960 

CTGTAACCTT CTATTTTCTA TAATTGTAGT AATTGCTCTA CAGATAATGT CTATATATTG 4020 

GCCAAACTGG TGCATGACAA GTACTGTATT TTTTTATACC TAAATAAAGA AAAATCTTTA 4080 
GCCTGGGCAA CAAAAAAA 



Seq ID No: 105 Protein sequence : 
Protein Accession #: NP 001786.1 



1 11 21 31 41 51 

I I I I I I 

MQRLMMLLAT SGACLGLLAV AAVAAAGANP AQRDTHSLIiP THRRQKRDWI WNQMHIDEEK 60 

NTSLPHHVGK IKSSVSRKNA KYLLKGEYVG KVFRVDAETG DVFAIERIjDR ENISEYHLTA 120 

VIVDKDTGEN LETPSSFTIK VHDVNDNWPV FTHRLFNASV PESSAVGTSV ISVTAVDADD 180 

PTVGDHASVM YQILKGKEYF AIDNSGRIIT ITKSLDREKQ ARYEIWEAR DAQGLRGDSG 240 

TATVLVTLQD INDNFPFFTQ TKYTFWPED TRVGTSVGSL FVEDPDEPQN RMTKYSILRG 300 

DYQDAFTIET NPAHNEGIIK PMKPLDYEYI QQYSFIVEAT DPTIDLRYMS PPAGNRAQVI 360 

INITDVDEPP IFQQPFYHFQ LKENQKKPLI GTVLAMDPDA ARHSIGYSIR RTSDKGQFFR 420 

VTKKGDIYNE KELDREVYPW YNLTVEAKEL DSTGTPTGKE SIVQVHIEVL DENDNAPEFA 480 

KPYQPKVCEN AVHGQLVLQI SAIDKDITPR NVKFKFTLNT ENNFTLTDNH DNTANITVKY 540 

GQFDREHTKV HFLPWISDN GMPSRTGTST LTVAVCKCNE QGEFTFCEDM AAQVGVSIQA 600 

WAILLCILT ITVITLLIFL RRRLRKQARA HGKSVPE1HE QLVTYDEEGG GEMDTTSYDV 660 

SVLNSVRRGG AKPPRPALDA RPSLYAQVQK PPRHAPGAHG GPGEMAAMIE VKKDEADHDG 720 

DGPPYDTLHI YGYEGSESIA ESLSSLGTDS SDSDVDYDFL NDWGPRFKML AEliYGSDPRE 780 
ELLY 



Seq ID NO: 106 DNA sequence 

Nucleic Acid Accession # : none found 

Coding sequence: 1-474 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACA GTACTCT GTGCAAAAAA CCTGGTGAAA AAGGATTTTT TCCGACTTCC TGATCCATTT 60 

GCTA^GGTGG TGGTTGATGG ATCTGGGCAA TGCCATTCTA CAGATACTGT GAAGAATACG 120 

CTTGATCCAA AGTGGAATCA GCATTATGAC CTGTATATTG GAAAGTCTGA TTCAGTTACG 180 

ATCAGTGTAT GGAATCACAA GAAGATCCAT AAGAAACAAG GTGCTGGATT TCTCGGTTGT 24 0 

GTTCGTCTTC TTTCCAATGC CATCAACCGC CTCAAAGACA CTGGTTATCA GAGGTTGGAT 300 

TTATGCAAAC TCGGGCCAAA TGACAATGAT ACAGTTAGAG GACAGATAGT AGTAAGTCTT 360 

CAGTCCAGAG ACCGAATAGG CACAGGAGGA CAAGTTGTGG ACTGCAGTCG TTTATTTGAT 420 

AACGATTTAC CAGACGGAGC TCATTATTTG TGGACTTGGA AAGATAGATG TTAATGACTG 480 

GAAGGTAAAC ACCCGGTTAA AACACTGTAC ACCAGACAGC AACATTGTCA AATGGTTCTG 540 

GAAAGCTGTG GAGTTTTTTG ATGAAGAGCG ACGAGCAAGA TTGCTTCAGT TTGTGACAGG 600 

ATCCTCTCGA GTGCCTCTGC AGGGCTTCAA AGCATTGCAA GGTGCTGCAG GCCCGAGACT 660 

CTTTACCATA CACCAGATTG ATGCCTGCAC TAACAACCTG CCGAAAGCCC ACACTTGCTT 720 

CAATCGAATA GACATTCCAC CCTATGAAAG CTATGAAAAG CTATATGAAA AGCTGCTAAC 780 

AGCCATTGAA GAAACATGTG GATTTGCTGT GGAATGACAA GCTTCAAGGA TTTACCCAGG 84 0 
AC 

Seq ID No: 107 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 

I I I I I I 

TVLCAKNLVK KDFFRLPDPF AKVWDGSGQ CHSTDTVKNT LDPKWNQHYD LYIGKSDSVT 60 

ISVWNHKKIH KKQGAGFLGC VRLLSNAINR LKDTGYQRLD LCKLGPNDND TVRGQIWSL 120 

QSRDRIGTGG QWDCSRLFD NDLPDGAHYL WTWKDRC 



Seq ID NO: 108 DNA sequence 

Nucleic Acid Accession #: NM_002318.1 

Coding sequence: 248-2572 (underlined sequences correspond to start and stop codons) 
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- I I I I 

5 ACTCCAGCGC GCGGCTACCT ACGCTTGGTG CTTGCTTTCT 

CCGCCCCCTC TGCTCGAGAA AGGGGCTCAG CGGCGGCGGA 

GAGAGCGCGG TCCCAGCCCG GCCACTGCGG ATCCCTGAAA 

CTGTACCCCG CCTGTCCCTC CCAGCTGCGC AGGGCCCCTT 

GACAGGGATG GAGAGGCCTC TGTGCTCCCA CCTCTGCAGC 

10 CCTGTCCCCC CTGAGCCTGG CACAGTATGA CAGCTGGCCC 
GCAACCGGCT CCTGAGTATC ACCAGCCCCA GGCCCCCGCC 
GCGCCTGGCT GGGCAGAAGA GGAAGCACAG CGAGGGCCGG 
CCAGTGGGGC ACCGTGTGCG ATGACGACTT CTCCATCCAC 
GGAGCTGGGC TATGTGGAGG CCAAGTCCTG GACTGCCAGC 

15 AGGGCCCATC TGGTTAGACA ATCTCCACTG TACTGGCAAC 
CACCTCCAAT GGCTGGGGCG TCACTGACTG CAAGCACACG 
CAGCGACAAA AGGATTCCTG GGTTCAAATT TGACAATTCG 
CCTGAATATC CAGGTGGAGG ACATTCGGAT TCGAGCCATC 
CACCCCAGTG ATGGAGGGCT ACGTGGAGGT GAAGGAGGGC 

20 TGACAAGCAC TGGACGGCCA AGAATTCCCG CGTGGTCTGC 
GGAGAGGACA TACAATACCA AAGTGTACAA AATGTTTGCC 
CTGGCCATTC TCCATGGACT GCACCGGCAC AGAGGCCCAC 
CCCCCAGGTG TCACTGGACC CCATGAAGAA TGTCACCTGC 
GGTGAGTTGT GTGCCTGGGC AGGTCTTCAG CCCTGACGGA 

25 ATACAAGCCA GAGCAACCCC TGGTGCG ACT GAGAGGCGGT 
CGTGGAGGTG CTCAAAAATG GAGAATGGGG GACCGTCTGC 
GTCGGCCAGT GTGGTCTGCA GAGAGCTGGG CTTTGGGAGT 
CTCCCGACTG GGGCAAGGGA TCGGACCCAT CCACCTCAAC 
TGAGAAGTCC ATTATAGACT GCAAGTTCAA TGCCGAGTCT 

30 GGATGCTGGT GTGAGATGCA ACACCCCTGC CATGGGCTTG 
CGGCGGCCGC AATC CCTACG AGGGCCGAGT GGAGGTGCTG 
TGTGTGGGGG ATGGTGTGTG GCCAAAACTG GGGCATCGTG 
CCAGCTGGGC CTGGGATTCG CCAGCAACGC CTTCCAGGAG 
TGTCAACAGC AACAAAGTGG TCATGAGTGG AGTGAAGTGC 

35 GGCGCACTGC CGCCACGACG GGGAGGACGT GGCCTGCCCC 
GGCCGGAGTT GCCTGCTCAG AAACCGCCCC TGACCTGGTC 
GCAGACCACC TACCTGGAGG ACCGGCCCAT GTTCATGCTG 
CTGCCTCTCG GCCTCAGCCG CGCAGACCGA CCCCACCACG 
CTTCTCCTCC CAGATCCACA ACAATGGCCA GTCCGACTTC 

40 CGCGTGGATC TGGCACGACT GTCACAGGCA CTACCACAGC 
TGACCTGCTG AACCTCAATG GCACCAAGGT GGCAGAGGGC 
GGAGGACACA GAATGTGAAG GAGACATCCA GAAGAATTAC 
TCAGGGCATC ACCATGGGCT GCTGGGACAT GTACCGCCAT 
TGACATCACT GACGTGCCCC CTGGAGACTA CCTGTTCCAG 

45 CGAGGTTGCA GAATCCGATT ACTCCAACAA CATCATGAAA 
CCACCGCATC TGGATGTACA ACTGCCACAT AGGTGGTTCC 
AAAGTTTGAG CACTTCAGCG GGCTCTTAAA CAACCAGCTG 
GCGTGGTCAA CTCCTGTCTT CAGGCCACAC CACATCTTCC 
CTGAGTCTGA ACGAATGCCA CGTGCCCTCA CCCAGCCCGG 

50 CTACAGCTGT GTCTAAGCTC AGGAGGAAAG GGACCCTCCC 
ACCTGACCCT TGGGGCCTGA GAAGGCCTTG GGGGGGTGGG 
GAGCAGCACC AAGAGCCAGT CTTGACCGGG ATGAGGCCCA 
TGTCCCATTC AAGCCACCGA GCTCACCACA GACACAGTGG 
ACACGTGGAC AAATGCGGGC TCATCAGCCC CCCCAGAGAG 

55 CTCCTCCTCT TAGGTCATTT TCAGCAAACT TGAATATCTA 
CCTCCAGTCT ATTATAGTCA CATAGATAAT GGTGCCACGT 
CAGACTTGGT GCTTCCCTCT CCACAACCCC CACCCCTTGT 
TATTTTCACA GACTTTTGAA GCACAAATTT ATTGGCATTT 
TTGGAAGTAC AAATCTAAGG AAAAACCAAC CCACTGTGTA 

60 TTCCAATTCT GTGGGTTTTT GATTCAACGG TGCTATAACC 
GCTCACTGAG CACCATGTGT CATCACAGAC ACTTACACAT 
GAAAGATTTA TG 

Seq ID No: 109 Protein sequence: 
65 Protein Accession #: NP 002309.1 



1 11 21 31 

„ | I I I 

70 MERPLCSHLC SCLAMLALLS PLSLAQYDSW PHYPEYFQQP 
AGQKRKHSEG RVEVYYDGQW GTVCDDDFSI HAAHWCREL 
IWLDNLHCTG NEATLAACTS NGWGVTDCKH TEDVGWCSD 
IQVEDIRIRA ILSTYRKRTP VMEGYVEVKE GKTWKQICDK 
TYNTKVYKMF ASRRKQRYWP FSMDCTGTEA HISSCKLGPQ 

75 CVPGQVFSPD GPSRFRKAYK PEQPLVRLRG GAYIGEGRVE 
SWCRELGFG SAKEAVTGSR LGQGIGPIHL NEIQCTGNEK 
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41 


51 




| 

CCAGCCATCG 


I 

GAGACCAGAG 


60 


AGCGGAGGGG 


GACCACCGTG 


120 


CCAAAAAGCT 


CCTGCTGCTT 


180 


CGTGGGATCA 


TCAGCCCGAA 


240 


TGCCTGGCTA 


TGCTGGCCCT 


300 


CATTACCCCG 


AGTACTTCCA 


360 


AACGTGGCCA 


AGATTCAGCT 


420 


GTGGAGGTGT 


ACTATGATGG 


480 


GCTGCCCACG 


TCGTCTGCCG 


540 


TCCTCCTACG 


GCAAGGGAGA 


600 


GAGGCGACCC 


TTGCAGCATG 


660 


GAGGATGTCG 


GTGTGGTGTG 


720 


TTGATCAACC 


AGATAGAGAA 


780 


CTCTCAACCT 


ACCGCAAGCG 


840 


AAGACCTGGA 


AGCAGATCTG 


900 


GGCATGTTTG 


GCTTCCCTGG 


960 


TCACGGAGGA 


AGCAGCGCTA 


1020 


ATCTCCAGCT 


GCAAGCTGGG 


1080 


GAGAATGGGC 


TGCCGGCCGT 


1140 


CCCTCGAGAT 


TCCGGAAAGC 


1200 


GCCTACATCG 


GGGAGGGCCG 


1260 


GACGACAAGT 


GGGACCTGGT 


1320 


GCCAAAGAGG 


CAGTCACTGG 


1380 


GAGATCCAGT 


GCACAGGCAA 


1440 


CAGGGCTGCA 


ACCACGAGGA 


1500 


CAGAAGAAGC 


TGCGCCTGAA 


1560 


GTGGAGAGAA 


ACGGGTCCCT 


1620 


GAGGCCATGG 


TGGTCTGCCG 


1680 


ACCTGGTATT 


GGCACGGAGA 


1740 


TCGGGAACGG 


AGCTGTCCCT 


1800 


CAGGGCGGAG 


TGCAGTACGG 


1860 


CTCAATGCGG 


AGATGGTGCA 


1920 


CAGTGTGCCA 


TGGAGGAGAA 


1980 


GGCTACCGCC 


GGCTCCTGCG 


2040 


CGGCCCAAGA 


ACGGCCGCCA 


2100 


ATGGAGGTGT 


TCACCCACTA 


2160 


CACAAGGCCA 


GCTTCTGCTT 


2220 


GAGTGTGCCA 


ACTTCGGCGA 


2280 


GACATCGACT 


GCCAGTGGGT 


2340 


GTTGTTATTA 


ACCCCAACTT 


2400 


TGCAGGAGCC 


GCTATGACGG 


2460 


TTCAGCGAAG 


AGACGGAAAA 


2520 


TCCCCGCAGT 


AAAGAAGCCT 


2580 


ATGGGACTTC 


CCCCCAACAA 


2640 


CCCCCACCCT 


GTCCAGACCC 


2700 


ATCATTCATG 


GGGGGCTGCT 


2760 


GTTTGTCCAC 


AGAGCTGCTG 


2820 


CAGACAGGTT 


GTCATCAGCT 


2880 


AGCCGCGCTC 


TTCTCCAGTG 


2940 


GGTCAGGCCG 


AACCCCATTT 


3000 


GACCTCTCTT 


CCAATGAAAC 


3060 


GTTTTCTGAT 


TTGGTGAGCT 


3120 


TTTTCAAGAT 


ACTATTATTA 


3180 


AATATTGGAC 


ATCTGGGCCC 


3240 


AGTGACTCAT 


CTTCCTGTTG 


3300 


AGGGTCCTGG 


GTGACAGGGC 


3360 


ACTTGAAACT 


TGGAATAAAA 


3420 


41 


51 




1 

APEYHQPQAP 


1 

ANVAKIQLRL 


60 


GYVEAKSWTA 


SSSYGKGEGP 


120 


KRIPGFKFDN 


SLINQIENLN 


180 


HWTAKNSRW 


CGMFGFPGER 


240 


VSLDPMKNVT 


CENGLPAWS 


300 


VLKNGEWGTV 


CDDKWDLVSA 


360 


SIIDCKFNAE 


SQGCNHEEDA 


420 
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GVRCNTPAMG LQKKLRIINGG RNPYEGRVEV LVERNGSLVW GMVCGQNWGI VEAMWCRQL 480 

GLGFASNAFQ ETWYWHGDVN SNKWMSGVK CSGTELSLAH CRHDGEDVAC PQGGVQYGAG 540 

VACSETAPDL VLNAEMVQQT TYLEDRPMFM LQCAMEENCL SASAAQTDPT TGYRRLLRFS 600 

SQIHNNGQSD FRPKNGRHAW IWHDCHRHYH SMEVFTHYDL LNLNGTKVAE GHKASFCLED 660 

TECEGDIQKN YECANFGDQG ITMGCWDMYR HDIDCQWVDI TDVPPGDYLF QWINPNFEV 720 
AESDYSNNIM KCRSRYDGHR IWMYNCHIGG SFSEETEKKF EHFSGLLNNQ LSPQ 

Seq ID NO: 110 DNA sequence 

Nucleic Acid Accession #: none found, CAT_73007_3 

Coding sequence: 1-495 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

I 1. I I I I 

CGGACGCGTG GGTCGACCCA CGCGTCCGCC CACGCGTCCG TATGGACAGA GCCTCCACTG 60 

GCTGCTGCCT GCCCGCCACA TACCCAGCTG ACATGGGCAC CGCAGGAGCC ATGCAGCTGT 120 

CTGGGTGATC CTGGGCTTCC TCCTGTTCCG AGGCCACAAC TCCCAGCCCA CAATGACCCA 180 

ACCTCTAGCT CTCAGGGAGG CCTTGGCGGT CTAAGTCTGA CCACAGAGCC AGTTTCTTCC 240 

ACCCAGGATA CATCCCTTCC TCAGAGGCTA ACAGGCCAAG CCATCTGTCC AGCACTGGTA 300 

CCCAGGCGCA GGTGTCCCCA GCAGTGGAAG AGACGGAGGC ACAAGCAGAG ACACATTTCA 360 

ACTGTTCCCC CCAATTCAAC CACCATGAGC CTGAGCATGA GGGAAGATGC GACCATCCTG 420 

CCAGCCCCAC GTCAGAGACT GTGCTCACTG TGGCTGCATT TGGGATGGAG TCGGGTGGAG 480 

GCCCACTCTG GCTAGGGGGC GGCAGGCTGA GAGCTCACCT GTTCAGCAGA GAAGTGGAAC 540 

CACTTTGCTC CTGGAGCCTG TCTACCACAG TGTTATCAGC TTCATTGTCA TCCTGGTGGT 600 

GTGGTGATCA TCCTAGTTGG TGTGGTCAGC CTGAGGGTTC AGTGTCGGAA GAGCAAGGAG 660 

TCTGAAGATC CCAGAACCTG GGAGTACAGG GCGTGTCTGA CAAGCTGGTC ACAGACCATG 720 

GCGAGAACGA CAGCATCGCC CATTATCACA TGGAAGACAT CACACGACTT AGGGCAACAC 780 

GCACTCAGCA GCGAGCATCA AAGGAGCCTA CGCATGGCCC AGACTGAGAG CAAGCACAAA 840 
GGGC 

Seq ID No: 111 Protein sequence: 

Protein Accession #: none found, CAT 73007_3 



1 11 21 31 41 51 

I I I I I I 

RTRGSTHASA HASVWTEPPL AAACPPHTQL TWAPQEPCSC LGDPGLPPVP RPQLPAHNDP 60 

TSSSQGGLGG LSLTTEPVSS TQDTSLPQRL TGQAICPALV PRRRCPQQWK RRRHKQRHIS 120 

TVPPNSTTMS LSMREDATIL PAPRQRLCSL WLHLGWSRVE AHSG 



Seq ID NO: 112 DNA sequence 

Nucleic Acid Accession #: NM_005424.1 

Coding sequence: 37-34 53 {underlined sequences correspond to start and stop codons) 



CGCTCGTCCT 
TTGCTCCCCA 
GCCAACCTGC 
GGGGCGGGGA 
ATCGTGCGCA 
ACGCTTCGCG 
GCTGGGGCGC 
CCAGACAAGG 
CACAAGGAGA 
GACTGGCATG 
TCGAGCGGCA 
CGGCTCATCG 
CCAGGTTGCC 
GGCTTCACTG 
CAGGAGCAGT 
TATGGCTGCT 
GGTCATTTTG 
CGGTTCAGTG 
CGGATCCCCC 
CGGATCAACT 
AAGCCAGACG 
GCTGAGTTCG 
TCCACATCTG 
CCCCTGGCTG 
GTCTCGTTCT 
AGTACCATGG 
CTGAGGCCAA 
GAGGGGGCCT 
CCGTGGTTGG 
CCCTTGGTGC 



11 
I 

GGCTGGCCTG 
TCCTCTTCTT 
GGCTCACGGA 
GGGGCTCGGA 
CCCCGCCCGG 
GCTTCTCCAA 
GGCGCACGCG 
TCACACACAC 
AGCAGACAGA 
AAGCCCAGGA 
TCTACAGTGC 
TGCGGGGTTG 
TACATGGAGG 
GCACCCGCTG 
GCCCAGGCAT 
CTTGTGGATC 
GGGCTGATTG 
GTTGTGTCTG 
AGATCCTCAA 
GTGCAGCTGC 
GCACTGTGCT 
AGGTGCCCCG 
GCGGCCAAGA 
CACCTCGGCT 
CTGGGGATGG 
ACTGGTCGAC 
AGACAGGATA 
GGGGGCCTCC 
AGGGCTGGCA 
CCGGGCCACT 



21 



GGTCGGCCTC 
GGCTTCTCAT 
CCCCCAGCGC 
CGCCTGGGGC 
GCCACCCCTG 
GCCCTCGGAC 
CGTCATCTAC 
TGTGAACAAA 
CGTGATCTGG 
TGGGCGGTTC 
CACTTACCTG 
TGGGGCTGGG 
TGTCTGCCAC 
TGAACAGGCC 
ATCAGGCTGC 
TGGCTGGAGA 
CCGACTCCAG 
CCCCTCTGGG 
CATGGCCTCA 
AGGGAACCCC 
CCTGTCCACC 
CTTGGTTCTT 
CAGCCGGCGC 
CCTGACCAAG 
ACCCATCTCC 
CATTGTGGTG 
CAGTGTTCGT 
CACCCTCATG 
TGTGGAAGGC 
GGTGGGCGAC 



31 
I 

TGGAGT ATG G 
GTGGGCGCGG 
TTCTTCCTGA 
CCGCCCCTGC 
CGCCTGGCGC 
CTCGTGGGCG 
GTGCACAACA 
GGTGACACCG 
AAGAGCAACG 
CTGCTGCAGC 
GAAGCCAGCC 
CGCTGGGGGC 
GACCATGACG 
TGCAGAGAGG 
CGGGGCCTCA 
GGAAGCCAGT 
TGCCAGTGTC 
TGGCATGGAG 
GAACTGGAGT 
TTCCCCGTGC 
AAGGCCATTG 
GCGGACAGTG 
TTCAAGGTCA 
CAGAGCCGCC 
ACTGTCCGCC 
GACCCCAGTG 
GTGCAGCTGA 
ACCACAGACT 
ACTGACCGGC 
GGTTTCCTGC 



41 
I 

TCTGGCGGGT 
CGGTGGACCT 
CTTGCGTGTC 
TGCTGGAGAA 
GCAACGGTTC 
TCTTCTCCTG 
GCCCTGGAGC 
CTGTACTTTC 
GATCCTACTT 
TCCCAAATGT 
CCCTGGGCAG 
CAGGCTGTAC 
GCGAATGTGT 
GCCGTTTTGG 
CCTTCTGCCT 
GCCAAGAAGC 
AGAATGGTGG 
TGCACTGTGA 
TCAACTTAGA 
GGGGCAGCAT 
TGGAGCCAGA 
GGTTCTGGGA 
ATGTGAAAGT 
AGCTTGTGGT 
TGCACTACCG 
AGAACGTGAC 
GCCGGCCAGG 
GTCCTGAGCC 
TGCGAGTGAG 
TGCGCCTGTG 



51 



GCCCCCTTTC 
GACGCTGCTG 
TGGGGAGGCC 
GGACGACCGT 
GCACCAGGTC 
CGTGGGCGGT 
CCACCTGCTT 
TGCACGTGTG 
CTACACCCTG 
GCAGCCACCA 
CGCCTTCTTT 
CAAGGAGTGC 
ATGCCCCCCT 
GCAGAGCTGC 
CCCAGACCCC 
TTGTGCCCCT 
CACTTGTGAC 
GAAGTCAGAC 
GACGATGCCC 
AGAGCTACGC 
GAAGACCACA 
GTGCCGTGTG 
GCCCCCCGTG 
CTCCCCGCTG 
GCCCCAGGAC 
GTTAATGAAC 
GGAAGGAGGA 
TTTGTTGCAG 
CTGGTCCTTG 
GGACGGGACA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
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CGGGGGCAGG AGCGGCGGGA 
GGACTCACGC CTGGCACCCA 
GGCCCGGCCT CGCCCCCTGC 
CACCTCCACG CCCAGGCCCT 
GCTCTGCCTG GGCCAATATC 
GACCCACTGT GGATAGACGT 
AACGCCAGCA CGCGCTACCT 
AGCAACACAG TAGAAGAGTC 
GAGAGCCGGG CAGCTGAAGA 
GTGTCTGCCA CCTGCCTCAC 
AGCTGCCTGC ATCGGAGACG 
CTGCAGTTCA GCTCAGGGAC 
CTGAGCTACC CAGTGCTAGA 
AACTTCGGCC AGGTCATCCG 
ATCAAAATGC TGAAAGAGTA 
GAAGTTCTGT GCAAATTGGG 
AACCGAGGTT AC TTGTATAT 
CTGCGGAAAA GCCGGGTCCT 
TCTACCCTTA GCTCCCGGCA 
TACCTGAGTG AGAAGCAGTT 
GAGAACCTAG CCTCCAAGAT 
AAGAAGACGA TGGGGCGTCT 
GTCTATACCA CCAAGAGTGA 
CTTGGAGGTA CACCCTACTG 
GGCTACCGCA TGGAGCAGCC 
TGCTGGCGGG ACCGTCCCTA 
CGCATGCTGG AAGCCAGGAA 
GCGGGCATTG ATGCCACAGC 
GCTGGCCGGA GCAAACTCTG 
CTTAAGCTGC CTCAAGGAAT 
GGTGGGCTTA GGGGAACTGG 
TCCTTCTTTC TAGTTCAGCT 
CAAACCCCCA CTCCAGCTCC 
CAGCTACTCC CACTCCCGGC 
AAAAA 

Seq ID No: 113 Protein sequence: 
Protein Accession #: NP_005415.1 

1 11 21 31 41 51 

I I I I I 1 

MWJRVPPFLL PILFLASHVG AAVDLTLIAN LRLTDPQRFF LTCVSGEAGA GRGSDAWGPP 60 
LLLEKDDRIV RTPPGPPLRL ARNGSHQVTL RGFSKPSDLV GVFSCVGGAG ARRTRVIYVH . 120 
NSPGAHLLPD KVTHTVNKGD TAVLSARVHK EKQTDVIWKS NGSYFYTLDW HEAQDGRFLL 180 
QLPNVQPPSS GIYSATYLEA SPLGSAFFRL IVRGCGAGRW GPGCTKECPG CLHGGVCHDH 24 0 
DGECVCPPGF TGTRCEQACR EGRFGQSCQE QCPGISGCRG LTFCLPDPYG CSCGSGWRGS 300 
QCQEACAPGH FGADCRLQCQ CQNGGTCDRF SGCVCPSGWH GVHCEKSDRI PQILNMASEL 360 
EFNLETMPRI NCAAAGNPFP VRGSIELRKP DGTVIjLSTKA IVEPEKTTAE FEVPRLVLAD 42 0 
SGFWECRVST SGGQDSRRFK VNVKVPPVPL AAPRLLTKQS RQLWSPLVS FSGDGPISTV 4 80 
RLHYRPQDST MDWSTIWDP SENVTLMNLR PKTGYSVRVQ LSRPGEGGEG AWGPPTLMTT 54 0 
DCPEPLLQPW LEGWHVEGTD RLRVSWSLPL VPGPLVGDGF LLRLWDGTRG QERRENVSSP 600 
QARTALLTGL TPGTHYQLDV QLYHCTLLGP ASPPAHVLLP PSGPPAPRHL HAQALSDSEI 660 
QLTWKHPEAL PGPISKYWE VQVAGGAGDP LWIDVDRPEE TSTIIRGLNA STRYLFRMRA 720 
SIQGLGDWSN TVEESTLGNG LQAEGPVQES RAAEEGLDQQ LILAWGSVS ATCLTILAAL 780 
LTLVCIRRSC LHRRRTFTYQ SGSGEETILQ FSSGTLTLTR RPKLQPEPLS YPVLEWEDIT 840 
FEDLIGEGNF GQVIRAMIKK DGIiKMNAAIK MLKEYASEND HRDFAGELEV LCKLGHHPNI 900 
INLLGACKNR GYLYIAIEYA PYGNLLDFLR KSRVLETDPA FAREHGTAST LSSRQLLRFA 960 
SDAANGMQYIi SEKQFIHRDL AARNVLVGEN LASKIADFGL SRGEEVYVKK TMGRLPVRWM 1020 
AIESLNYSVY TTKSDVWSFG VLLWEIVSLG GTPYCGMTCA ELYEKLPQGY RMEQPRNCDD 1080 
EVYELMRQCW RDRPYERPPF AQIALQLGRM LEARKAYVNM SLFENFTYAG IDATAEEA 



Seq ID NO: 114 DNA sequence 

Nucleic Acid Accession #: NM_002632.l 

Coding sequence: 322-771 (underlined sequences correspond to start and stop codons) 

i 

1 11 21 31 41 SI 

I I I I I I 

GGGATTCGGG CCGCCCAGCT ACGGGAGGAC CTGGAGTGGC ACTGGGCGCC CGACGGACCA 60 

TCCCCGGGAC CCGCCTGCCC CTCGGCGCCC CGCCCCGCCG GGCCGCTCCC CGTCGGGTTC 120 

CCCAGCCACA GCCTTACCTA CGGGCTCCTG ACTCCGCAAG GCTTCCAGAA GATGCTCGAA 180 

CCACCGGCCG GGGCCTCGGG GCAGCAGTGA GGGAGGCGTC CAGCCCCCCA CTCAGCTCTT 24 0 

CTCCTCCTGT GCCAGGGGCT CCCCGGGGGA TGAGCATGGT GGTTTTCCCT CGGAGCCCCC 300 

TGGCTCGGGA CGTCTGAGAA G ATG CCGGTC ATGAGGCTGT TCCCTTGCTT CCTGCAGCTC 360 

CTGGCCGGGC TGGCGCTGCC TGCTGTGCCC CCCCAGCAGT GGGCCTTGTC TGCTGGGAAC 420 

GGCTCGTCAG AGGTGGAAGT GGTACCCTTC CAGGAAGTGT GGGGCCGCAG CTACTGCCGG 480 



GAACGTCTCA TCCCCCCAGG CCCGCACTGC CCTCCTGACG I860 

CTACCAGCTG GATGTGCAGC TCTACCACTG CACCCTCCTG 192 0 

ACACGTGCTT CTGCCCCCCA GTGGGCCTCC AGCCCCCCGA 198 0 

CTCAGACTCC GAGATCCAGC TGACATGGAA GCACCCGGAG 204 0 

CAAGTACGTT GTGGAGGTGC AGGTGGCTGG GGGTGCAGGA 2100 

GGACAGGCCT GAGGAGACAA GCACCATCAT CCGTGGCCTC 2160 

CTTCCGCATG CGGGCCAGCA TTCAGGGGCT CGGGGACTGG 2220 

CACCCTGGGC AACGGGCTGC AGGCTGAGGG CCCAGTCCAA 2280 

GGGCCTGGAT CAGCAGCTGA TCCTGGCGGT GGTGGGCTCC 2340 

CATCCTGGCC GCCCTTTTAA CCCTGGTGTG CATCCGCAGA 2400 

CACCTTCACC TACCAGTCAG GCTCGGGCGA GGAGACCATC 2460 

CTTGACACTT ACCCGGCGGC CAAAACTGCA GCCCGAGCCC 2520 

GTGGGAGGAC ATCACCTTTG AGGACCTCAT CGGGGAGGGG 2580 

GGCCATGATC AAGAAGGACG GGCTGAAGAT GAACGCAGCC 2640 

TGCCTCTGAA AATGACCATC GTGACTTTGC GGGAGAACTG 2700 

GCATCACCCC AACATCATCA ACCTCCTGGG GGCCTGTAAG 2760 

CGCTATTGAA TATGCCCCCT ACGGGAACCT GCTAGATTTT 2820 

AGAGACTGAC CCAGCTTTTG CTCGAGAGCA TGGGACAGCC 28 80 

GCTGCTGCGT TTCGCCAGTG ATGCGGCCAA TGGCATGCAG 294 0 

CATCCACAGG GACCTGGCTG CCCGGAATGT GCTGGTCGGA 3000 

TGCAGACTTC GGCCTTTCTC GGGGAGAGGA GGTTTATGTG 3060 

CCCTGTGCGC TGGATGGCCA TTGAGTCCCT GAACTACAGT 3120 

TGTCTGGTCC TTTGGAGTCC TTCTTTGGGA GATAGTGAGC 3180 

TGGCATGACC TGTGCCGAGC TCTATGAAAA GCTGCCCCAG 324 0 

TCGAAACTGT GACGATGAAG TGTACGAGCT GATGCGTCAG 3300 

TGAGCGACCC CCCTTTGCCC AGATTGCGCT ACAGCTAGGC 3360 

GGCCTATGTG AACATGTCGC TGTTTGAGAA CTTCACTTAC 3420 

TGAGGAGGCC TGAG CTGCCA TCCAGCCAGA ACGTGGCTCT 3480 

CTGTCTAACC TGTGACCAGT CTGACCCTTA CAGCCTCTGA 3540 

TTTTTTAACT TAAGGGAGAA AAAAAGGGAT CTGGGGATGG 3600 

GTTCCCATGC TTTGTAGGTG TCTCATAGCT ATCCTGGGCA 366 0 

GCCCCACAGG TGTGTTTCCC ATCCCACTGC TCCCCCAACA 3720 

TTCGCTTAAG CCAGCACTCA CACCACTAAC ATGCCCTGTT 3780 

CTGTCATTCA GAAAAAAATA AATGTTCTAA TAAGCTCCAA 3840 
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GCGCTGGAGA 
AGCCCATCCT 
TGTGTGCCGG 
CGGCCCTCCT 
5 CGGGAGAAGA 
TGGAGGAGAG 
TCCTGCTGGT 
CCTTCAAGAC 
TGAGAGAAAG 
10 ACACGTGGCC 
GCAGAAGGAA 
GCAGCCCTTG 
ACGGCCTGGT 
CTTCTGAAGA 
TCCTTGTCCC 
TTTCCGGCCG 
GGCTGGAGAA 
GGGAGGAGCC 
CTGGCACCCC 
20 ATAAAGTATT 



15 



GGCTGGTGGA 
GTGTCTCCCT 
TGGAGACGGC 
ACGTGGAGCT 
TGAAGCCGGA 
AGACCCCGCA 
ACCTGCCCTC 
GAGGGGCAGG 
AGAGAAGCCA 
TCGTGAGGGG 
AGAAGGGGGC 
CTTTCGGAGC 
GGTGGGAAGG 
TCAGAACATT 
CCGTGATCTC 
AGGTGCCACC 
AGAGCTGCCT 
TGTGCGTCCC 
CACAAGCTGT 
CTAGTGTGGA 



CGTCGTGTCC 
GCTGCGCTGC 
CAATGTCACC 
GACGTTCTCT 
AAGGTGCGGC 
CCCGGCTCGT 
TATTTATTAG 
GAAGGACAGG 
GCCACAGACC 
CAAGCTAGGC 
CCTGCTACCT 
TCCTGTCCAA 
CCGGCAGCGG 
CAGCTCTGGA 
CCCTCACACT 
ACCCTGCCCC 
GGATGAGAAA 
AGCTGAAGGC 
CCCTGCAGGG 
AACGC , 



GAGTACCCCA 
ACCGGCTGCT 
ATGCAGCTCC 
CAGCACGTTC 
GATGCTGTTC 
GTATTTATTA 
CCAACTGTTT 
ACCCTCAGGA 
CCTGGGAGCT 
CCCAGAGGCC 
GTTCTTGGGC 
AGTAGGGATG 
GCGGAGGGGA 
GAACAGTGGT 
TTGCCATTTG 
CACTAAGAGA 
CAGCTCAGCC 
AGTGGCAGGG 
CCATCTGACT 



GCGAGGTGGA 
GCGGCGATGA 
TAAAGATCCG 
GCTGCGAATG 
CCCGGAGGTA 
CCGTCACACT 
CCCTGCTGAA 
ATTCAGTGCC 
TCCGCTTTGA 
CTGGAGGTCT 
CTCAGGCTCT 
CGGATTCTGC 
TTCAGCCACT 
TGCCTGGGGG 
CTTGTACTGG 
CACATACAGA 
AGTGGGGATG 
GAGCAGGTTC 
GCCAAGCCAG 



GCACATGTTC 54 0 

GAATCTGCAC 600 

TTCTGGGGAC 660 

CCGGCCTCTG 720 

^CCCACCCCT 780 

CTTCAGTGAC 840 

TGCCTCGCTC 900 

TTCAACAACG 960 

AAGAAGCAAG 1020 

CCAGGGGCCT 1080 

GCACAGACAA 1140 

TGGGGCCGCC 1200 

TCCCCCTCTT 1260 

CTTTTGCCAC 1320 

GACATTGTTC 1380 

GTGGGCCCCG 1440 

AGGTCACCAG 1500 

CCCAAGGGCC 1560 

ATTCTCTTGA 1620 



25 



Seq ID No: 115 Protein sequence: 
Protein Accession #: NP 002623.1 



11 



30 



35 



40 



45 



50 



55 



60 



65 



70 



21 31 41 51 

I ! I I I 

MPVMRLFPCF LQLLAGLALP AVPPQQWALS AGNGSSEVEV VPFQEVWGRS YCRALERLVD 

WSEYPSEVE HMFSPSCVSL LRCTGCCGDE NLHCVPVETA NVTMQLLKIR SGDRPSYVEL 

TFSQHVRCEC RPLREKMKPE RCGDAVPRR 



60 
120 



Seq ID NO: 116 DNA sequence 

Nucleic Acid Accession NM_007361.1 

Coding sequence: 1-4131 (underlined sequences correspond to start and stop codons) 



11 



75 



ATGGAGGGGG 
CTGCAGTTGC 
GAGTCGTGGT 
GCTGGCGAAT 
ACGGCATCAT 
CCACCGACTT 
GCCGAGTCCT 
TGCGCGCTGG 
CTGGGAGCAG 
AACACTTTCC 
CCTGCCAACG 
CAGCTTCCAG 
CCATATTTCA 
CTGGGGATCC 
AGGCCAGCTG 
TTCAGCCATG 
GTGAATGAGG 
AGCAGCATTG 
ACCTTGGATC 
AAAGGCCAAG 
GACAGAGATT 
ATCCAGCCCT 
CCTGAAGAAG 
CGAGGGACGT 
TATAATGCTG 
TTCTGCACGG 
GGGAAGCACT 
CTCCACGTGG 
GGCAATGATG 
CTCCTCCCCC 
GGCTCTGAGA 
TTCTACCCGG 
AACTACCTGA 
ACAGCCCACA 
ACAAGTTCCA 
ATCCACCAGA 
ACCACCCAGC 
CTTAGATTTG 
GTGAATCCTT 
ACAGGTGTAG 



ACCGGGTGGC 
TAATGTTGCG 
GGGACCAGCT 
CCCCTGCACT 
CTCCACTCAG 
CCCGGCCATC 
GTACCGAGAG 
CTTCCCGCGC 
GTAGGCGCTT 
AGGCAGTTTT 
GCCTGCAGTT 
CTCGGGTGGG 
GCTTGACTAG 
CTGGAGTGTG 
CAGTTGGAGA 
CTACAGCCCT 
AGGAAGCTGA 
ATGTTTCCTT 
CTCACACCAA 
TTGAGCCCTG 
CACTGGCTCC 
ACCCAGATGG 
AAATTGTTCT 
ATGAGGTGGG 
CCAACAAGGA 
ACTATGCCAC 
GTCTGCCTGA 
GCCATACACC 
GCAGAGCCTA 
TCACACCAAT 
ACGG CTTCAG 
GAGAGGAGAC 
GCATTAAGAC 
TCTCTCCCTA 
GAGACTACTC 
ACATCACTTA 
AGCTGAACGT 
CTGTGACCAA 
GCTATGATGG 
ATTACACCTG 



21 
I 

CGGGCGGCCG 
GGCCGCGGCG 
CCTGCAGGAA 
TCTTACGAAG 
GACTTCCCCA 
GCCCCTTTTC 
GACACCTCCC 
TCTGCGCGCT 
ACGAGGAGGT 
GGCATCTGAT 
CCTTGGAACC 
CTTCTGCCGA 
CACTGAACAG 
GGCTTTCCAT 
CCTTTCCGCT 
GGAAAGTGAC 
ATACCTTCCG 
CCAATCCAAA 
AGAAGGAACA 
GGATGAGAGA 
TTCCTGGGAA 
AGGGCCAGTG 
TCGAAGTTAC 
ACTGGAAGAC 
AACCTGTGAA 
TGGCTTCTGC 
GGGGGCACCT 
CGTGCACTTC 
CACGGCCATC 
TGGAGGCCTG 
CCTCGCAGGT 
GGTTCGTATC 
CAACATTCAA 
CAAGGAGCTG 
TCTGACTTTT 
CCAGGTGTGC 
GGACCGGGTC 
TCAAATTGGC 
GAGCCACATG 
TGAGTGCGCA 



31 
I 

GTGCTGTCGT 
CTGCACCCAG 
GGCGACGACG 
CCCGATTCAG 
GGGAAACGCA 
TGGCGGACAT 
CCGCAGTGCT 
TTTTACCCCC 
CAAACGCGGG 
GGGTCTGATA 
CGCCCCAAAG 
GGGGAGGCTG 
TCTGTGAAAA 
ATCGGCAGCA 
GCCCACTCTT 
TATAATGAGG 
GGTGAACCAG 
GTGGATACAA 
TCTCTGGGAG 
GAGACCAGAA 
ACCCCACCAC 
CCTTCGGAAA 
CCTG CTTCAG 
AACATAGGTT 
CACAACCACA 
TGCCACTGCC 
CACCGAGTGA 
ACTGATGTGG 
AGCCACATCC 
TTTGGCTGGC 
GCTGCCTTTA 
ACTCAAACTG 
GGCCAGGTGC 
TACCACTACT 
GGTGCAATCA 
AGGCACGCCC 
TTTGCCTTGT 
CCGGTCAAAG 
TGTGACACAA 
TCTGGGTACC 



41 



51 



CGTTACCAGT 
ACGAGCTCTT 
TAAAGCTCAG 
CAACCTCTAC 
GTATGTGGAC 
CGACACGAGC 
GGGCCTGGCC 
ACCCACGCCT 
CGCTGCCCTC 
GCTACGCCCT 
AGTCTTACAA 
ATGATCTGAA 
ATCTCTATCA 
CTTCCCCGTT 
CTGTTCCCCT 
ACAATTTGGA 
AGGAGGCATT 
AG CCTTTAG A 
AGGTAGGGGG 
GCCCAGCTCC 
CGTACCCCGA 
TGGATGTTCC 
GTCACACTAC 
CCAACACCGA 
GACAATGCTC 
AATCCAAGTT 
ATGGGAAAGT 
ACCTGCATGC 
CACAGCCAGC 
TCTTTGCTTT 
CCCATGACAT 
CTGAGGGACT 
CTTACGTCCC 
CCGACTCCAC 
ACCAAACATG 
CCAGACACCC 
ATAATGATGA 
AAGATTCAGA 
CAGCACGGTG 
AGGGAGATGG 



GCTACTGCTG 
CCCACACGGG 
CCGTGGTGAA 
GTGGGCACCA 
TATGATTTCC 
CACGGCAGAG 
GCCCGCTATG 
TCCTGGCCAC 
GGGAGAGCTG 
CTTTCTTTAT 
TGTCCAGCTT 
GTCAGAAGGA 
ACTAAGCAAC 
GGACAATGTC 
GGGACGTTCC 
TTACTACGAT 
GAATGGCCAC 
GGAATCTTCC 
CCCAGATTTA 
ACCAGAGGTA 
AAACGGAAGC 
CCCAGCTCAT 
ACCCTTAAGT 
GGTCTTCACG 
CCGGCATGCC 
TTATGGAAAT 
GAGTGGCCAC 
GTATATCGTG 
AGCCCAGGCC 
AGAAAAACCT 
GGAAGTTACA 
TGACCCAGAG 
AGCAAATTTC 
TGTGACCTCT 
GTCCTACCGC 
GTCCTTCCCC 
AGAAAGAGTG 
CCCCACTCCG 
CCATCCAGGG 
ACGGAACTGT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 



234 



WO 02/079492 



PCT/US02/049I5 



GTGGATGAAA ATGAATGTGC AACTGGCTTT CATCGCTGTG GCCCCAACTC TGTATGTATC 2460 

AACTTGCCTG GAAGCTACAG GTGTGAGTGC CGGAGTGGTT ATGAGTTTGC AGATGACCGG 2520 

CATACTTGCA TCTTGATCAC CCCACCTGCC AACCCCTGTG AGGATGGCAG TCATACCTGT 2580 

GCTCCTGCTG GGCAGGCCCG GTGTGTTCAC CATGGAGGCA GCACGTTCAG CTGTGCCTGC 264 0 

CTGCCTGGTT ATGCCGGCGA TGGGCACCAG TGCACTGATG TAGATGAATG CTCAGAAAAC 2700 

AGATGTCACC CTGCAGCTAC CTGCTACAAT ACTCCTGGTT CCTTCTCCTG CCGTTGTCAA 2760 

CCCGGATATT ATGGGGATGG ATTTCAGTGC ATACCTGACT CCACCTCAAG CCTGACACCC 2820 

TGTGAACAAC AGCAGCGCCA TGCCCAGGCC CAGTATGCCT ACCCTGGGGC CCGGTTCCAC 28 80 

ATCCCCCAAT GCGACGAGCA GGGCAACTTC CTGCCCCTAC AGTGTCATGG CAGCACTGGT 2940 

TTCTGCTGGT GCGTGGACCC TGATGGTCAT GAAGTTCCTG GTACCCAGAC TCCACCTGGC 3000 

TCCACCCCGC CTCACTGTGG ACCATCACCA GAGCCCACCC AGAGGCCCCC GACCATCTGT 3060 

GAGCGCTGGA GGGAAAACCT GCTGGAGCAC TACGGTGGCA CCCCCCGAGA TGACCAGTAC 3120 

GTGCCCCAGT GCGATGACCT GGGCCACTTC ATCCCCCTGC AGTGCCACGG AAAGAGCGAC 3180 

TTCTGCTGGT GTGTGGACAA AGATGGCAGA GAGGTGCAGG GCACCCGCTC CCAGCCAGGC 3240 

ACCACCCCTG CGTGTATACC CACCGTCGCT CCACCCATGG TCCGGCCCAC GCCCCGGCCA 3300 

GATGTGACCC CTCCATCTGT GGGCACCTTC CTGCTCTATA CTCAGGGCCA GCAGATTGGC 3360 

TACTTACCCC TCAATGGCAC CAGGCTTCAG AAGGATGCAG CTAAGACCCT GCTGTCTCTG 3420 

CATGGCTCCA TAATCG'TGGG AATTGATTAC GACTGCCGGG AGAGGATGGT GTACTGGACA 3480 

GATGTTGCTG GACGGACAAT CAGCCGTGCC GGTCTGGAAC TGGGAGCAGA GCCTGAGACG 3540 

ATCGTGAATT CAGGTCTGAT AAGCCCTGAA GGACTTGCCA TAGACCACAT CCGCAGAACA 3600 

ATGTACTGGA CGGACAGTGT CCTGGATAAG ATAGAGAGCG CCCTGCTGGA TGGCTCTGAG 3660 

CGCAAGGTCC TCTTCTACAC AGATCTGGTG AATCCCCGTG CCATCGCTGT GGATCCAATC 3720 

CGAGGCAACT TGTACTGGAC AGACTGGAAT AGAGAAGCTC CTAAAATTGA AACGTCATCT 3780 

TTAGATGGAG AAAACAGAAG AATTCTGATC AATACAGACA TTGGATTGCC CAATGGCTTA 3840 

ACCTTTGACC CTTTCTCTAA ACTGCTCTGC TGGGCAGATG CAGGAACCAA AAAACTGGAG 3900 

TGTACACTAC CTGATGGAAC TGGACGGCGT GTCATT C AAA ACAACCTCAA GTACCCCTTC 3960 

AGCATCGTAA GCTATGCAGA TCACTTCTAC CACACAGACT GGAGGAGGGA TGGTGTTGTA 4020 

TCAGTAAATA AACATAGTGG CCAGTTTACT GATGAGTATC TCCCAGAACA ACGATCTCAC 4080 

CTCTACGGGA TAACTGCAGT CTACCCCTAC TGCCCAACAG GAAGAAAG TA AG TACAGTAA 4140 

TGTAAAGGAA GACTTGGAGT TTACAATCAG AACCTGGACC CTAAAGAACA GTGACTGCAA 4200 

AGGCAAAGAA AGTAAAAAAG GAATTGGCCA TTAGACGTTC CTGAGCATCC AAGATGAACA 4260 

TTTTGTAGTG CAAAAAGACT TTTGTGAAAA GCTGATACCT CAATCTTTAC TACTGTATTT 4320 

TTAAAAATGA AGGTTGTTAT TGCAAGTTTA AAAAGGTAAC AGAATTTTAA CTGTTGCTTA 4380 

TTAAAGCAAC TTCTTGTAAA CATTTATCAT TAATATTTAA AAGATCAAAT TCATTCAACT 4440 

AAGAATTAGA GTTTAAGACT CTAAACCTGA TTTTTGCCAT GGATTCCTTC TGGCCAAGAA 4500 

ATTAAAGCAC ATGTGATCAA TATAACAATA TAATCCTAAA CCTTGACAGT TGGAGAAGCC 4560 

AATGCAGAAC TGATGGGAAA GGACCAATTA TTTATAGTTT CCCAACAAAA GTTCTAAGAT 4620 

TTTTTACCTC TGCATCAGTG CATTTCTATT TATATCAAAA GGTGCTAAAA TGATTCAATT 4680 

TGCATTTTCT GATCCTGTAG TGCCTCTATA GAAGTACCCA CAGAAAGTAA AGTATCACAT 4740 

TTATAAATAC CAAAGATGTA ACAATTTTAA AATTTTCTAG ATTACTCCAA TAAAGTGTTT 4800 
TAAGTTTAAA AAAAAAAAAA AAAAAAAAA 



Seq ID No: 117 Protein sequence : 
Protein Accession #: NP_031387.l 

1 11 21 31 41 51 

I I I • I I I 

MEGDRVAGRP VLSSLPVLLL LQIiLMLRAAA LHPDELFPHG ESWWDQLLQE GDDVKLSRGE 60 

AGESPALLTK PDSATSTWAP TASSPLRTSP GKRSMWTMIS PPTSRPSPLF WRTSTRATAE 120 

AESCTERTPP PQCWAWPPAM CALASRALRA FYPHPRLPGH LGAGRRLRGG QTRALPSGEL 180 

NTFQAVLASD GSDSYALFLY PANGLQFLGT RPKESYNVQL QLPARVGFCR GEADDLKSEG 240 

PYFSLTSTEQ SVKNLYQLSN LGIPGVWAFH IGSTSPLDNV RPAAVGDLSA AHSSVPLGRS 300 

FSHATALESD YNEDNLDYYD VNEEEAEYLP GEPEEALNGH SSIDVSFQSK VDTKPLEESS 360 

TLDPHTKEGT SLGEVGGPDL KGQVEPWDER ETRSPAPPEV DRDSLAPSWE TPPPYPENGS 420 

IQPYPDGGPV PSEMDVPPAH PEEEIVLRSY PASGHTTPLS RGTYEVGLED NIGSNTEVFT 480 

YNAANKETCE HNHRQCSRHA FCTDYATGFC CHCQSKFYGN GKHCLPEGAP HRVNGKVSGH 540 

LHVGHTPVHF TDVDLHAYIV GNDGRAYTAI SHIPQPAAQA LLPLTPIGGL FGWLFALEKP 600 

GSENGFSLAG AAFTHDMEVT FYPGEETVRI TQTAEGLDPE NYLSIKTNIQ GQVPYVPANF 660 

TAHISPYKEL YHYSDSTVTS TSSRDYSLTF GAINQTWSYR IHQNITYQVC REAPRHPSFP 720 

TTQQLNVDRV FALYNDEERV LRFAVTNQIG PVKEDSDPTP VNPCYDGSHM CDTTARCHPG 780 

TGVDYTCECA SGYQGDGRNC VDENECATGF HRCGPNSVCI NLPGSYRCEC RSGYEFADDR 840 

HTCILITPPA NPCEDGSHTC APAGQARCVH HGGSTFSCAC LPGYAGDGHQ CTDVDECSEN 900 

RCHPAATCYN TPGSFSCRCQ PGYYGDGFQC IPDSTSSLTP CEQQQRHAQA QYAYPGARFH 960 

IPQCDEQGNF LPLQCHGSTG FCWCVDPDGH EVPGTQTPPG STPPHCGPSP EPTQRPPTIC 1020 

ERWRENLLEH YGGTPRDDQY VPQCDDLGHF IPLQCHGKSD FCWCVDKDGR EVQGTRSQPG 1080 

TTPACIPTVA PPMVRPTPRP DVTPPSVGTF LLYTQGQQIG YLPLNGTRLQ KDAAKTLLSL 1140 

HGSIIVGIDY DCRERMVYWT DVAGRTISRA GLELGAEPET IVNSGLISPE GLAIDHIRRT 1200 

MYWTDSVIiDK IESALLDGSE RKVLFYTDLV NPRAIAVDPI RGNLYWTDWN REAPKIETSS 1260 

LDGENRRILI NTDIGLPNGL TFDPFSKLLC WADAGTKKLE CTLPDGTGRR VIQNNLKYPF 1320 
SIVSYADHFY HTDWRRDGW SVNKHSGQFT DEYLPEQRSH LYGITAVYPY CPTGRK 

Seq ID NO: 118 DNA sequence 

Nucleic Acid Accession #: NM_003088.1 

Coding sequence: 112-1593 (underlined sequences correspond to start and stop codons) 
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1 11 21 31 41 51 

I I I I I I 

GCGGAGGGTG CGTGCGGGCC GCGGCAGCCG AACAAAGGAG CAGGGGCGCC GCCGCAGGGA 60 

CCCGCCACCC ACCTCCCGGG GCCGCGCAGC GGCCTCTCGT CTACTGCCAC CATGACCGCC 120 

AACGGCACAG CCGAGGCGGT GCAGATCCAG TTCGGCCTCA TCAACTGCGG CAACAAGTAC 180 

CTGACGGCCG AGGCGTTCGG GTTCAAGGTG AACGCGTCCG CCAGCAGCCT GAAGAAGAAG 240 

CAGATCTGGA CGCTGGAGCA GCCCCCTGAC GAGGCGGGCA GCGCGGCCGT GTGCCTGCGC 300 

AGCCACCTGG GCCGCTACCT GGCGGCGGAC AAGGACGGCA ACGTGACCTG CGAGCGCGAG 360 

GTGCCCGGTC CCGACTGCCG TTTCCTCATC GTGGCGCACG ACGACGGTCG CTGGTCGCTG 420 

CAGTCCGAGG CGCACCGGCG CTACTTCGGC GGCACCGAGG ACCGCCTGTC CTGCTTCGCG 480 

CAGACGGTGT CCCCCGCCGA GAAGTGGAGC GTGCACATCG CCATGCACCC TCAGGTCAAC 540 

ATCTACAGTG TCACCCGTAA GCGCTACGCG CACCTGAGCG CGCGGCCGGC CGACGAGATC 600 

GCCGTGGACC GCGACGTGCC CTGGGGCGTC GACTCGCTCA TCACCCTCGC CTTCCAGGAC 660 

CAGCGCTACA GCGTGCAGAC CGCCGACCAC CGCTTCCTGC GCCACGACGG GCGCCTGGTG 720 

GCGCGCCCCG AGCCGGCCAC TGGCTACACG CTGGAGTTCC GCTCCGGCAA GGTGGCCTTC 780 

CGCGACTGCG AGGGCCGTTA CCTGGCGCCG TCGGGGCCCA GCGGCACGCT CAAGGCGGGC 840 

AAGGCCACCA AGGTGGGCAA GGACGAGCTC TTTGCTCTGG AGCAGAGCTG CGCCCAGGTC 900 

GTGCTGCAGG CGGCCAACGA GAGGAACGTG TCCACGCGCC AGGGTATGGA CCTGTCTGCC 960 

AATCAGGACG AGGAGACCGA CCAGGAGACC TTCCAGCTGG AGATCGACCG CGACACCAAA 1020 

AAGTGTGCCT TCCGTACCCA CACGGGCAAG TACTGGACGC TGACGGCCAC CGGGGGCGTG 1080 

CAGTCCACCG CCTCCAGCAA GAATGCCAGC TGCTACTTTG ACATCGAGTG GCGTGACCGG 1140 

CGCATCACAC TGAGGGCGTC CAATGGCAAG TTTGTGACCT CCAAGAAGAA TGGGCAGCTG 1200 

GCCGCCTCGG TGGAGACAGC AGGGGACTCA GAGCTCTTCC TCATGAAGCT CATCAACCGC 1260 

CCCATCATCG TGTTCCGCGG GGAGCATGGC TTCATCGGCT GCCGCAAGGT CACGGGCACC 1320 

CTGGACGCCA ACCGCTCCAG CTATGACGTC TTCCAGCTGG AGTTCAACGA TGGCGCCTAC 1380 

AACATCAAAG ACTCCACAGG CAAATACTGG ACGGTGGGCA GTGACTCCGC GGTCACCAGC 1440 

AGCGGCGACA CTCCTGTGGA CTTCTTCTTC GAGTTCTGCG ACTATAACAA GGTGGCCATC 1500 

AAGGTGGGCG GGCGCTACCT GAAGGGCGAC CACGCAGGCG TCCTGAAGGC CTCGGCGGAA 1560 

ACCGTGGACC CCGCCTCGCT CTGGGAGTAC TAG GGCCGGC CCGTCCTTCC CCGCCCCTGC 1620 

CCACATGGCG GCTCCTGCCA ACCCTCCCTG CTAACCCCTT CTCCGCCAGG TGGGCTCCAG 1680 

GGCGGGAGGC AAGCCCCCTT GCCTTTCAAA CTGGAAACCC CAGAGAAAAC GGTGCCCCCA 1740 

CCTGTCGCCC CTATGGACTC CCCACTCTCC CCTCCGCCCG GGTTCCCTAC TCCCCTCGGG 1800 

TCAGCGGCTG CGGCCTGGCC CTGGGAGGGA TTTCAGATGC CCCTGCCCTC TTGTCTGCCA 1860 

CGGGGCGAGT CTGGCACCTC TTTCTTCTGA CCTCAGACGG CTCTGAGCCT TATTTCTCTG 1920 

GAAGCGGCTA AGGGACGGTT GGGGGCTGGG AGCCCTGGGC GTGTAGTGTA ACTGGAATCT 1980 

TTTGCCTCTC CCAGCCACCT CCTCCCAGCC CCCCAGGAGA GCTGGGCACA TGTCCCAAGC 204 0 

CTGTCAGTGG CCCTCCCTGG TGCACTGTCC CCGAAACCCC TGCTTGGGAA GGGAAGCTGT 2100 

CGGGAGGGCT AGGACTGACC CTTGTGGTGT TTTTTTGGGT GGTGGCTGGA AACAGCCCCT 2160 

CTCCCACGTG GGAGAGGCTC AGCCTGGCTC CCTTCCCTGG AGCGGCAGGG CGTGACGGCC 2220 

ACAGGGTCTG CCCGCTGCAC GTTCTGCCAA GGTGGTGGTG GCGGGCGGGT AGGGGTGTGG 2280 

GGGCCGTCTT CCTCCTGTCT CTTTCCTTTC ACCCTAGCCT GACTGGAAGC AGAAAATGAC 2340 

CAAATCAGTA TTTTTTTTAA TGAAATATTA TTGCTGGAGG CGTCCCAGGC AAGCCTGGCT 2400 

GTAGTAGCGA GTGATCTGGC GGGGGGCGTC TCAGCACCCT CCCCAGGGGG TGCATCTCAG 2460 

CCCCCTCTTT CCGTCCTTCC CGTCCAGCCC CAGCCCTGGG CCTGGGCTGC CGACACCTGG 2520 

GCCAGAGCCC CTGCTGTGAT TGGTGCTCCC TGGGCCTCCC GGGTGGATGA AGCCAGGCGT 2580 

CGCCCCCTCC GGGAGCCCTG GGGTGAGCCG CCGGGGCCCC CCTGCTGCCA GCCTCCCCCG 2640 

TCCCCAACAT GCATCTCACT CTGGGTGTCT TGGTCTTTTA TTTTTTGTAA GTGTCATTTG 2700 

TATAACTCTA AACGCCCATG ATAGTAGCTT CAAACTGGAA ATAGCGAAAT AAAATAACTC 2760 
AGTCTGC 



Seq ID No: 119 Protein sequence : 
Protein Accession #: NP_003079.1 

1 11 21 31 41 51 

I I I I I I 

MTANGTAEAV QIQFGLINCG NKYLTAEAFG FKVNASASSL KKKQIWTLEQ PPDEAGSAAV 60 

CLRSHLGRYL AADKDGNVTC EREVPGPDCR FLIVAHDDGR WSLQSEAHRR YFGGTEDRLS 120 

CFAQTVSPAE KWSVHIAMHP QVNIYSVTRK RYAHLSARPA DEIAVDRDVP WGVDSLITLA 180 

FQDQRYSVQT ADHRFLRHDG RLVARPEPAT GYTLEFRSGK VAFRDCEGRY LAPSGPSGTL 240 

KAGKATKVGK DELFALEQSC AQWLQAANE RNVSTRQGMD LSANQDEETD QETFQLEIDR 300 

DTKKCAFRTH TGKYWTLTAT GGVQSTASSK NASCYFDIEW RDRRITLRAS NGKFVTSKKN 360 

GQLAASVETA GDSELFLMKL INRPIIVFRG EHGFIGCRKV TGTLDANRSS YDVFQLEFND 420 

GAYNIKDSTG KYWTVGSDSA VTSSGDTPVD FFFEFCDYNK VAIKVGGRYL KGDHAGVLKA 4 80 
SAETVDPASL WEY 

Seq ID NO: 120 DNA sequence 

Nucleic Acid Accession #t NM_006404.1 

Coding sequence: 25-741 (underlined sequences correspond to start and stop 



1 11 21 31 41 51 

I I I I I I 

CAGGTCCGGA GCCTCAACTT CAGGATGTTG ACAACATTGC TGCCGATACT GCTGCTGTCT 60 
GGCTGGGCCT TTTGTAGCCA AGACGCCTCA GATGGCCTCC AAAGACTTCA TATGCTCCAG 120 
ATCTCCTACT TCCGCGACCC CTATCACGTG TGGTACCAGG GCAACGCGTC GCTGGGGGGA 180 



236 



WO 02/079492 



PCT/US02/04915 



CACCTAACGC 
TTGCAGGAGC 
TTCCACGGCC 
CGCTGCTTCC 
GTGGCTGTGA 
GACACCCAGG 
CGCACTCGGT 
CATATTTCCG 
CTGGGCGTCC 
ACAGGTGGAC 
GGCTGGCAAG 
AGGTTTGGAG 
GGAGATGGAG 
GCTTTGCTGA 
GAGTTGGGGC 
TCAAAAGATA 
CAGGTGTGTC 
GAAGTGGTGG 
AATATTAATA 



ACGTGCTGGA 
CCGAGAGCTG 
TCGTGCGCCT 
TGGGCTGTGA 
ATGGGAGCTC 
TCACCTCCGG 
ATGAACTGCG 
CGGAAAACAC 
TGGTGGGCGG 
GGCGATGTTA 



GGAAAGTTTC 
TGACAGCTCC 
AGGAGAGGTG 
ATTAGTCTGA 
AGGAAGCCTA 
TAACCAAATA 
AGACTTGGGA 
AAATGTAAAA 
AATTTCTTAT 



AGGCCCAGAC 
GGCGCGCACG 
GGTGCACCAG 
GCTGCCTCCC 
CTTTGTGAGT 
AGTGGTCACC 
GGAATTCCTG 
GAAAGGGAGC 
TTTCATCATT 
ATTACTCTCC 
AGCTCACTGT 
TTTCTTCTCC 
GACAAAGTAC 
TAAGTGAATG 
TGCGCCATCC 
AACAAGTCAT 
TGGGACGCTG 
TCCAAGTCAT 
ATTT 



ACCAACACCA 
CAGAGTGGCC 
GAGCGGACCT 
GAGGGCTCTA 
TTCCGGCCGG 
TTCACCCTGC 
GAGGACACCT 
CAAACAAGCC 
GCTGGTGTGG 
AGCCCCGTCA 
GAAGCCAGAC 
CACATCTGCC 
TTGGTTTGCT 
TTTATCTATC 
TCCAAAGACA 
CCACAATCAA 
ATATAATAGG 
ATGGCAGTGA 



CGATCATTCA 
TGCAGTCCTA 
TGGCCTTTCC 
GAGCCCATGT 
AGAGAGCCTT 
AGCAGCTCAA 
GTGTGCAGTA 
GCTCCTACAC 
CTGTAGGCAT 
GAAGGGGCTG 
TCCCCAACTG 
CACTGAAGAT 
AAGAACCTAA 
TTTGTGGAAA 
GACAGAATCA 
AATACAACAT 
GTAGAAAGAA 
TCAATTATTA 



GCTGCAGCCC 240 

CCTGCTCCAG 300 

TCTGACCATC 360 

CTTCTTCGAA 420 

GTGGCAGGCA 480 

TGCCTACAAC 540 

TGTGCAGAAA 600 

TTCGCTGGTC 660 

CTTCCTGTGC 720 

GATTGATGGA 7 BO 

AAACACCAGA 840 

TTGAGGGAGG 900 

GAACGTGTAT 960 

ACAGATAATG 1020 

CCTGAGGCGT 1080 

TCAATACTTC 1140 

GTAACACGAA 1200 

ATCAATTAAT 1260 



Seq ID No: 121 Protein sequence; 
Protein Accession #: NP 006395.1 



1 11 21 31 41 51 

I I I I I I 

MLTTLLPILL LSGWAFCSQD ASDGLQRLHM LQI SYFRDPY HVWYQGNASL GGHLTHVLEG 
PDTNTTIIQL QPLQEPESWA RTQSGLQSYL LQPHGLVRLV HQERTLAFPL TIRCFLGCEL 
PPEGSRAHVF FEVAVNGSSF VSFRPERALW QADTQVTSGV VTFTLQQLNA YNRTRYELRE 
FLEDTCVQYV QKHISAENTK GSQTSRSYTS LVLGVLVGGF IIAGVAVGIF LCTGGRRC 



60 
120 
180 



Seq ID NO: 122 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 2-505 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I i 

CGAGAAGCTG GGAGAGACAC CACTTGTCCC TGAACAAGAC AATTCAGTAA CATCTATTCC 60 

TGAGATTCCT CGATGGGGAT CACAGAGCAC GATGTCTACC CTTCAAATGT CCCTTCAAGC 120 

CGAGTCAAAG GCCACTATCA CCCCATCAGG GAGCGTGATT TCCAAGTTTA ATTCTACGAC 180 

TTCCTCTGCC ACTCCTCAGG CTTTCGACTC CTCCTCTGCC GTGGTCTTCA TATTTGTGAG 24 0 

CACAGCAGTA GTAGTGTTGG TGATCTTGAC CATGACAGTA CTGGGGCTTG TCAAGCTCTG 3 00 

CTTTCACGAA AGCCCCTCTT CCCAGCCAAG GAAGGAGTCT ATGGGCCCGC CGGGCCTGGA 360 

GAGTGATCCT GAGCCCGCTG CTTTGGGCTC CAGTTCTGCA CATTGCACAA ACAATGGGGT 420 

GAAAGTCGGG GACTGTGATC TGCGGGACAG AGCAGAGGGT GCCTTGCTGG CGGAGTCCCC 4 80 

TCTTGGCTCT AGTGATGCA T AG GGAAACAG GGGACATGGG CACTCCTGTG AACAGTTTTT 54 0 

CACTTTTGAT GAAACGGGGA ACCAAGAGGA ACTTACTTGT GTAACTGACA ATTTCTGCAG 600 

AAATCCCCCT TCCTCTAAAT TCCCTTTACT CCACTGAGGA GCTAAATCAG AACTGCACAC 66 0 

TCCTTCCCTG ATGATAGAGG AAGTGGAAGT GCCTTTAGGA TGGTGATACT GGGGGACCGG 72 0 

GTAGTGCTGG GGAGAGATAT TTTCTTATGT TTATTCGGAG AATTTGGAGA AGTGATTGAA 780 

CTTTTCAAGA CATTGGAAAC AAATAGAACA CAATATAATT TACATTAAAA AATAATTTCT 840 

ACCAAAATGG AAAGGAAATG TTCTATGTTG TTCAGGCTAG GAGTATATTG GTTCGAAATC 900 
CCAGGGAAAA AAATAAAAAT AAAAAATTAA AGGATTGTTG ATAAAA 

Seq ID NO: 123 Protein sequence: 
Protein Accession #: none found 

1 11 21 31 41 51 

I I i i i i 

EKLGETPLVP EQDNSVTSIP EIPRWGSQST MSTLQMSLQA ESKATITPSG SVISKFNSTT 60 

SSATPQAFDS SSAWFIFVS TAWVLVILT MTVLGLVKLC FHESPSSQPR KESMGPPGLE 120 
SDPEPAALGS SSAHCTNNGV KVGDCDLRDR AEGALLAESP LGSSDA 



Seq ID NO: 124 DNA sequence 

Nucleic Acid Accession #: NM_006500.1 

Coding sequence: 27-1967 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

l I I l I I 

ACTTGCGTCT CGCCCTCCGG CCAAGC ATGG GGCTTCCCAG GCTGGTCTGC GCCTTCTTGC 60 

TCGCCGCCTG CTGCTGCTGT CCTCGCGTCG CGGGTGTGCC CGGAGAGGCT GAGCAGCCTG 120 

CGCCTGAGCT GGTGGAGGTG GAAGTGGGCA GCACAGCCCT TCTGAAGTGC GGCCTCTCCC 180 

AGTCCCAAGG CAACCTCAGC CATGTCGACT GGTTTTCTGT CCACAAGGAG AAGCGGACGC 240 

TCATCTTCCG TGTGCGCCAG GGCCAGGGCC AGAGCGAACC TGGGGAGTAC GAGCAGCGGC 3 00 

TCAGCCTCCA GGACAGAGGG GCTACTCTGG CCCTGACTCA AGTCACCCCC CAAGACGAGC 360 
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GCATCTTCTT GTGCCAGGGC AAGCGCCCTC 
TCTACAAAGC TCCGGAGGAG CCAAACATCC 
GTAAGGAGCC TGAGGAGGTC GCTACCTGTG 
TCATCTGGTA CAAGAATGGC CGGCCTCTGA 
5 CGTCCCAGAC TGTGGAGTCG AGTGGTTTGT 
TGGTTAAAGA AGACAAAGAT GCCCAGTTTT 
GGAACCACAT GAAGGAGTCC AGGGAAGTCA 
TGTGGCTGGA AGTGGAGCCC GTGGGAATGC 
GTTTGGCTGA TGGCAACCCT CCACCACACT 

10 GGGAGGCAGA GGAAGAGACA ACCAACGACA 
AGGAACACAG TGGGCGCTAT GAATGTCAGG 
TGAGTGAACC ACAGGAACTA CTGGTGAACT 
CCCCTGAGAG ACAGGAAGGC AGCAGCCTCA 
ACCTCGAGTT CCAGTGGCTG AGAGAAGAGA 

15 TTCAGTTGCA TGACCTGAAA CGGGAGGCAG 
CCAGCATACC CGGCCTGAAC CGCACACAGC 
GGATGGCATT CAAGGAGAGG AAGGTGTGGG 
GTGAAGCGTC AGGGCACCCC CGGCCCACCA 
AACAAGACCA AGATCCACAG CGAGTCCTGA 

20 TGTTGGAGAC AGGTGTTGAA TGCACGGCCT 
TCTTCCTGGA GCTGGTCAAT TTAACCACCC 
TCAGCACTTC CACTGCCAGT CCTCATACCA 
TGCCGGAGCC GGAGAGCCGG GGCGTGGTCA 
TGGCGGTGCT GGGCGCTGTC CTCTATTTCC 

25 GCTCAGGGAA GCAGGAGATC ACGCTGCCCC 
TTAAGTCAGA TAAGCTCCCA GAAGAGATGG 
GGGCTCCGGG AGACCAGGGA GAGAAATACA 
CAGCTCCCTT CCCTGCCTGG ACCATTCCCA 
CCTCCAAAGG GACTAGAGAG AAGCCTCCTG 

30 GGCCACTGGG TTAGGACCTG AGGACCTCAC 
GTCCACCACC ATCTCCTGCA CGTTGAGTGA 
CCGAGCGGGT AGGAGAGTTT CTTGCAGAAC 
AAATACC TGG CTCCTGCCAG CAGCTGAGCT 
CAAAGGCTGG CTTCCACCAT CCAGGTGCAC 

35 GCCTGCTCAT GTTGAAGTGC GCTGTTCACA 
AGAAGCAGCT GCAGTGTTGC TGCCACCACC 
ACATTTTTTC TTTGGTCAGA AGCCAGGAAC 
GGCCAGGTGT GGTGGCTCAC GCCTGTAATC 
TCACAAAGTC AGGACGAGAC CATCCTGGCT 

40 TACAAAAAAA AATTAGCTAG GCGTAGTGGT 
CTGAAGCAGG AGAATGGTAT GAATCCAGGA 
CACTGCACTC CAGCCTGGGC AACACAGCGA 
ACGCGTACCT GCGGTGAGGA AGCTGGGCGC 
TCCCCGTGTT CACTTGCTCC CATAGCCCTC 

45 GGGGAGCAGA CAAAGATGAG GTCTACACTG 
TTAGCACCAA ACTTCTACAA ACCAAGCTCA 
AGAATGGTAC TTAGGGATGG AAAACGGGGC 
CTGTGTGTAT GCATACATAT GTGTGTATAT 
TTGTTTCCTT TATATATGTA TGTATATATA 

50 AAAGCTTAAT TGTCCCAGAA AATCATACAT 
AACCTGGGGG CCTGTGAAAC TACAACCAAA 
AGAGATCAGG GGTTACCTCT GCTTCTGAGC 
CTACCCTACT TTTCAGCAGC AAAACGTCCC 
TGTTAGCAGG AGCTATGTCC CTTCCTATCG 

55 

Seq ID No: 125 Protein sequence: 
Protein Accession #: NP 006491.1 



GGTCCCAGGA GTACCGCATC CAGCTCCGCG 420 

AGGTCAACCC CCTGGGCATC CCTGTGAACA 480 

TAGGGAGGAA CGGGTACCCC ATTCCTCAAG 540 

AGGAGGAGAA GAACCGGGTC CACATTCAGT 600 

ACACCTTGCA GAGTATTCTG AAGGCACAGC 660 

ACTGTGAGCT CAACTACCGG CTGCCCAGTG 720 

CCGTCCCTGT TTTCTACCCG ACAGAAAAAG 780 

TGAAGGAAGG GGACCGCGTG GAAATCAGGT 840 

TCAGCATCAG CAAGCAGAAC CCCAGCACCA 900 

ACGGGGTCCT GGTGCTGGAG CCTGCCCGGA 960 

CCTGGAACTT GGACACCATG ATATCGCTGC 102 0 

ATGTGTCTGA CGTCCGAGTG AGTCCCGCAG 1080 

CCCTGACCTG TGAGGCAGAG AGTAGCCAGG 1140 

CAGACCAGGT GCTGGAAAGG GGGCCTGTGC 1200 

GAGGCGGCTA TCGCTGCGTG GCGTCTGTGC 1260 

TGGTCAAGCT GGCCATTTTT GGCCCCCCTT 1320 

TGAAAGAGAA TATGGTGTTG AATCTGTCTT 1380 

TCTCCTGGAA CGTCAACGGC ACGGCAAGTG 1440 

GCACCCTGAA TGTCCTCGTG ACCCCGGAGC 1500 

CCAACGACCT GGGCAAAAAC ACCAGCATCC 1560 

TCACACCAGA CTCCAACACA ACCACTGGCC 1620 

GAGCCAACAG CACCTCCACA GAGAGAAAGC 1680 

TCGTGGCTGT GATTGTGTGC ATCCTGGTCC 1740 

TCTATAAGAA GGGCAAGCTG CCGTGCAGGC 1800 

CGTCTCGTAA GACCGAACTT GTAGTTGAAG 1860 

GCCTCCTGCA GGGCAGCAGC GGTGACAAGA 1920 

TCGATCTGAG GCATTAGCCC CGAATCACTT 1980 

GCTCCCTGCT CACTCTTCTC TCAGCCAAAG 2040 

CTCCCCTCAC CTGCACACCC CCTTTCAGAG 2100 

TTGGCCCTGC AAGCCGCTTT TCAGGGACCA 2160 

AGCTCATCCC AAGCAAGGAG CCCCAGTCTC 2220 

GTGTTTTTTC TTTACACACA TTATGGCTGT 2280 

GGGTAGCCTC TCTGAGCTGG TTTCCTGCCC 2340 

CACTGAAGTG AGGACACACC GGAGCCAGGC 2400 

CCCGCTCCGG AGAGCACCCC AGCGGCATCC 2460 

CTCCTGCTCG CCTCTTCAAA GTCTCCTGTG 2520 

TGGTGTCATT CCTTAAAAGA TACGTGCCGG 2580 

CCAGCACTTT GGGAGGCCGA GGCGGGCGGA 264 0 

AACACGGTGA AACCCTGTCT CTACTAAAAA 2700 

TGGCACCTAT AGTCCCAGCT ACTCGGAAGG 2760 

GGTGGAGCTT GCAGTGAGCC GAGACCGTGC 2820 

GACTCCGTCT CGAGGAAAAA AAAAGAAAAG 2880 

TGTTTTCGAG TTCAGGTGAA TTAGCCTCAA 2940 

TTGATGGATC ACGTAAAACT GAAAGGCAGC 3000 

TCCTTCATGG GGATTAAAGC TATGGTTATA 306 0 

GGGCCCCAAC CCTAGAAGGG CCCAAATGAG 3120 

CTGGCTAGAG CTTCGGGTGT GTGTGTCTGT 3180 

ATGGTTTTGT CAGGTGTGTA AATTTGCAAA 324 0 

TATATGAAAA TAT AT AT ATA TATGAAAAAT 3300 

TGCTTTTTTA TTCTACATGG GTACCACAGG 3360 

AGGCACACAA AACCGTTTCC AGTTGGCAGC 3420 

AAATGGCTCA AGCTCTACCA GAGCAGACAG 34 80 

GTATGACGCA GCACGAAGGG CCTGGCAGGC 3540 
TTTCCGTCCA CTT 



60 1 11 21 31 41 51 

I I I I I I 

MGLPRLVCAF LLAACCCCPR VAGVPGEAEQ PAPELVEVEV GSTALLKCGL SQSQGNLSHV 60 

DWFSVHKEKR TLIFRVRQGQ GQSEPGEYEQ RLSLQDRGAT LALTQVTPQD ERIFLCQGKR 120 

PRSQEYRIQL RVYKAPEEPN IQVNPLGIPV NSKEPEEVAT CVGRNGYPIP QVIWYKNGRP 180 

65 LKEEKNRVHI QSSQTVESSG LYTLQSILKA QLVKEDKDAQ FYCELNYRLP SGNHMKESRE 240 

VTVPVFYPTE KVWLEVEPVG MLKEGDRVEI RCLADGNPPP HFSISKQNPS TREAEEETTN 300 

DNGVLVLEPA RKEHSGRYEC QAWNLDTMIS LLSEPQELLV NYVSDVRVSP AAPERQEGSS 360 

LTLTCEAESS QDLEFQWLRE ETDQVLERGP VLQLHDLKRE AGGGYRCVAS VPSIPGLNRT 420 

QLVKIiAIFGP PWMAFKERKV WVKENMVLNIj SCEASGHPRP TISWNVNGTA SEQDQDPQRV 480 

70 LSTLNVLVTP ELLETGVECT ASNDLGKNTS ILFLELVNLT TLTPDSNTTT GLSTSTASPH 540 

TRANSTSTER KLPEPESRGV VIVAVIVCIL VLAVLGAVLY FLYKKGKLPC RRSGKQEITL 600 
PPSRKTELW EVKSDKLPEE MGLLQGSSGD KRAPGDQGEK YIDLRH 



75 Seq ID NO: 126 DNA sequence 

Nucleic Acid Accession #: NM 001955.1 



238 
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Coding sequence: 337-975 (underlined sequences correspond to start and stop'codons) 



15 



GG AG CTGTTT 
AAGTCAGACG 
AGCTCTCCAC 
AGGCGCTGCC 
1 0 AGGAACCCGC 
GTTTGAACGG 
TCTCTGCTGT 
AGCGCGGTGG 
CGGTCCAAGC 
CTGGACATCA 
AGGTCCAAGA 
TGCCAATGTG 
CTCAGGGCTG 
TCCAAGCTTG 
AGTTCAGAGG 
TCTTTTCATG 
CGAGCACATT 
TGTGGCCGAC 
TTCCTGACTG 
25 TCCCCCAACC 
TGGGGATGAC 



20 



n 
I 

ACCCCCACTC 
CGCCTCTGCA 
CACCGCCGCG 
TTTTCTCCCC 
AGCGCTTTGA 
GAGGTTTTTG 
TTGTGGCTTG 
GTGAGAACGG 
GCTGCTCCTG 
TTTGGGTCAA 
GAGCCTTGGA 
CTAGCCAAAA 
AAGACATTAT 
GGAAAAAGTG 
AACACCTAAG 
ATCCCAAGCT 
G GTGA CAGAC 
TCTGCACTCT 
GCAAAGGACC 
ATCTTCACTG 
AATGGACCTC 



21 



TAATAGGGGT 
TCTGCGCCAG 
TGCGCCTGCA 
GTTAAAGGGC 
GGGACCTGAA 
ATCCCTTTTT 
CCAAGGAGCT 
CGGGGAGAAA 
CTCGTCCCTG 
CACTCCCGAG 
GAATTTACTT 
AGACAAGAAG 
GGAGAAAGAC 
TATTTATCAG 
ACAAACCAGG 
GAAAGGCAAG 
TTCGGGGCCT 
CCACCCTGGC 
AGCGTCCTCG 
GCTTCCATCA 
TCAGCAGAAA 



31 
I 

TCAATATAAA 
GCGAACGGGT 
GACGCTCCGC 
ACTTGGGCTG 
GCTGTTTTTC 
TTCAG AATG G 
CCAGAAACAG 
CCCACTCCCA 
ATGGATAAAG 
CACGTTGTTC 
CCCACAAAGG 
TGCTGGAATT 
TGGAATAATC 
CAGTTAGTGA 
TCGGAGACCA 
CCCTCCAGAG 
GTCTGAAGCC 
TGGGATCAGA 
TTCAAAACAT 
GTGGTAACTG 
CACACAGTCA 



41 
I 

AAGCCGGCAG 
CCTGCGCCTC 
TCGCTGCCTT 
AAGGATCGCT 
TTCGTTTTCC 
ATTATTTGCT 
CAGTCTTAGG 
GTCCACCCTG 
AGTGTGTCTA 
CGTATGGACT 
CAACAGACCG 
TTTGCCAAGC 
ATAAGAAAGG 
GAGGAAGAAA 
TGAGAAACAG 
AGCGTTATGT 
ATAGCCTCCA 
GCAGGAGCAT 
TCCAAGAAAG 
CTTTGGTCTC 
CATTCGAATT 



51 



AGAGCTGTCC 60 

CTGCAGTCCC 120 

CTCTCCTGGC 180 

TTGAGATCTG 24 0 

TTTGGGTTCA 300 

CATGATTTTC 360, 

CGCTGAGCTC 420 

GCGGCTCCGC 480 

CTTCTGCCAC 540 

TGGAAGCCCT 600 

TGAGAATAGA 660 

AGGAAAAGAA 720 

AAAAGACTGT 780 

AATCAGAAGA 840 

CGTCAAATCA 900 

GACCCACAAC 960 

CGGAGAGCCC 1020 

CCTCTGCTGG 1080 

GTTAAGGAGT 1140 

TTCTTTCATC 1200 
C 



30 



Seq ID No: 127 Protein sequence; 
Protein Accession #: NP 001946.1 



1 11 21 31 41 51 

I I I I I I 

MDYLLMIFSL LFVACQGAPE TAVLGAELSA VGENGGEKPT PSPPWRLRRS KRCSCSSLMD 

35 KECVYFCHLD IIWVNTPEHV VPYGLGSPRS KRALENLLPT KATDRENRCQ CASQKDKKCW 

NFCQAGKELR AEDIMEKDWN NHKKGKDCSK LGKKCIYQQL VRGRKIRRSS EEHLRQTRSE 
TMRNSVKSSF HDPKLKGKPS RERYVTHNRA HW 



60 
120 
180 



40 Seq ID NO: 128 DNA sequence 

Nucleic Acid Accession #: NM_001721.1 

Coding sequence: 34-2061 (underlined sequences correspond to start and stop codons) 



45 



ii 



21 



31 



41 



51 



55 



GCAAGCACGG 
CTTCTTCTCA 
50 CTTTTTGTTT 
AGCAGAAAAG 
GAGCAGACGC 
TATGTCTATG 
ATAAGGGGTA 
TTCCTGTGTT 
GCTAATCTGC 
GTGCTGAAGA 
ACTCTAGCCC 
AGTACCAGTC 
60 TTCAACATGC 
AAAAGTAGCA 
CACACCACCT 
AACCTGGATG 
CTCAGACAAA 
65 TACACAGTGT 
CACGTGCATA 
ATTCCAAAGC 
CACCCTGTGT 
TGGGAACTGA 
70 GTGGTCCAGC 
GGCTCCATGT 
CCCAAGCTGG 
GAATATATAA 
CCTTCCCAGC 
75 CACCAATTCA 
GTGAAAGTAT 



AACAAGCTGA 
AAAGATCACA 
TGACCAAAAC 
GATCCATTGA 
CTGTAGAGAG 
CATCAAATGA 
ACCCCCACCT 
GCCAGCAGAG 
ATACTGCAGT 
TACCTCGGGC 
AATATGACAA 
TAGCGCAATA 
AGTATATTCC 
GCAGCAGTGA 
CAAAGATTTC 
ATTATGACTG 
AGGGAAAAGA 
CCTTATTTAG 
CAAATGCTGA 
TTATTCATTA 
CAACAAAGGC 
AAAGAGAAGA 
TGGGCAAGTG 
CAGAAGATGA 
TTAAATTCTA 
GCAATGGCTG 
TCTTAGAAAT 
TACACCGGGA 
CTGACTTTGG 



GACGGATGAT 
GCAAAAGAAG 
AAACCTTTCC 
AATTAAGAAA 
ACAGTACCCA 
AGAGAGCCGA 
GCTGGTCAAG 
CTGTAAAGCA 
CAATGAAGAG 
AGTTCCTGTT 
CGAATCAAAG 
TGACAGCAAC 
AAGGGAAGAC 
AGATGTTGCA 
ATGGGAATTC 
GTTTGCTGGT 
AGGAGCATTT 
TAAGGCTGTG 
GAACAAATTA 
TCATCAACAC 
CAACAAGGTC 
GATTACCTTG 
GAAGGGGCAG 
ATTCTTTCAG 
TGGAGTGTGT 
CTTGCTGAAT 
GTGCTACGAT 
CTTGGCTGCT 
AATGACAAGG 



AATATGGATA 
AAAATGTCAC 
TACTATGAAT 
ATCAGATGTG 
TTTCAGATTG 
AGTCAGTGGT 
TACCATAGTG 
GCCCCAGGAT 
AAACACAGAG 
CTCAAAATGG 
AAAAACTATG 
TCAAAGAAAA 
TTCCCTGACT 
AGCAGTAACC 
CCTGAGTCAA 
AACATCTCCA 
ATGGTTAGAA 
AATGATAAAA 
TACCTGGCAG 
AATTCAGCAG 
CCCGACTCTG 
TTGAAGGAGC 
TATGATGTTG 
GAGGCCCAGA 
TCAAAGGAAT 
TACCTGAGGA 
GTCTGTGAAG 
CGTAACTGCT 
TATGTTCTTG 



CAAAATCTAT 
CAAATAATTA 
ATGACAAAAT 
TGGAGAAAGT 
TCTATAAAGA 
TGAAAGCATT 
GGTTCTTCGT 
GTACCCTCTG 
TTCCCACCTT 
ATGCACCATC 
GCTCCCAGCC 
TCTATGGCTC 
GGTGGCAAGT 
AAAAAGAAAG 
GTTCATCTGA 
GATCACAATC 
ATTCGAGCCA 
AAGGAACTGT 
AAAACTACTG 
GCATGATCAC 
TGTCCCTGGG 
TGGGAAGTGG 
CTGTTAAGAT 
CTATGATGAA 
ACCCCATATA 
GTCACGGAAA 
GCATGGCCTT 
TGGTGGACAG 
ATGACCAGTA 



TCTAGAAGAA 60 

CAAAGAACGG 120 

GAAAAGGGGC 180 

AAATCTCGAG 240 

TGGGCTTCTC 300 

ACAAAAAGAG 360 

GGACGGGAAG 420 

GGAAGCATAT 4 80 

CCCAGACAGA 540 

TTCAAGTACC 600 

ACCATCTTCA 660 

CCAGCCAAAC 720 

AAGAAAACTG 780 

AAATGTGAAT 840 

AGAAGAGGAA 900 

TGAACAGTTA 960 

AGTGGGAATG 1020 

CAAACATTAC 108 0 

TTTTGATTCC 114 0 

ACGGCTCCGC 1200 

AAATGGAATC 1260 

CCAGTTTGGA 1320 

GATCAAGGAG 1380 

ACTCAGCCAT 1440 

CATAGTGACT 1500 

AGGACTTGAA 1560 

CTTGGAGAGT 162 0 

AGATCTCTGT 1680 

TGTCAGTTCA 1740 



239 
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10 



GTCGGAACAA 
AGCAGCAAGT 
AAGCAGCCCT 
AGGCTTTACC 
CACGAGCTTC 
CGGGAAAAAG 
CTGGCCAGCA 
TTTTAATAGT 
ATTCCCTTGA 
TTTCCAGCCT 
AAAGACTGAG 
ATTGTCACAA 



AGTTTCCAGT 
CAGACGTATG 
ATGACTTGTA 
GGCCCCACCT 
CAGAAAAGCG 
ACAAGCATTG 



TTTTCATTCA 
GTTCTCTGTA 
AATTTAGATC 
ATAGCAGAAG 
CAGAACTGAA 
CAATTAAATA 



CAAGTGGTCA 
GGCATTTGGG 
TGACAACTCC 
GGCATCGGAC 
TCCCACATTT 
AAjGAAGAAAT 
TTTTAAGGAA 
TTGTCTATTA 
AAATTAGTAA 
CACATTTTCA 
AAATTACTTA 
TACTACCAAG 



GCTCCAGAGG 
ATCCTGATGT 
CAGGTGGTTC 
ACCATCTACC 
CAGCAACTCC 
TAGGAGTGCT 
AGTAGGAAGG 
TTTAGAAATG 
TTTTGTTTTA 
GACTGCAATA 
TTGGATATTC 
TACAGAAATG 



TGTTTCATTA 
GGGAGGTGTT 
TGAAGGTCTC 
AGATCATGTA 
TGTCTTCCAT 
GATAAGAATG 
CATAAGTAAT 
AACAAGGCAG 
TGCTGCTCCT 
TAGAGACTGT 
ATTCTTTTCT 
TGGAAAAAAA 



CTTCAAATAC 1800 

CAGCCTGGGG 18 SO 

CCAGGGCCAC 1920 

CAGCTGCTGG 1980 

TGAACCACTT 2040 

AATATAGATG 2100 

TTTAGCTAGT 2160 

GAAACAAAAG 2220 

GATATAACAC 2280 

GTTCATGTGT 2340 

TTATATTGTC 2400 
AAACCG 



Seq ID No: 129 Protein sequence; 
15 Protein Accession #: NP 001712.1 



20 



25 



30 



MDTKSILEEL 
RCVEKVNLEE 
HSGFFVDGKF 
KMDAPSSSTT 
PDWWQVRKLK 
ISRSQSEQLL 
IiAENYCFDSI 
KELGSGQFGV 
KEYPIYIVTE 
NCLVDRDLCV 
LMWEVFSLGK 
QLLSSIEPLR 



11 
I 

LLKRSQQKKK 
QTPVERQYPF 
LCCQQSCKAA 
LAQYDNESKK 
SSSSSEDVAS 
RQKGKEGAFM 
PKLIHYHQHN 
VQLGKWKGQY 
YISNGCLLNY 
KVSDFGMTRY 
QPYDLYDNSQ 
EKDKH 



21 
i 

MSPNNYKERL 
QIVYKDGLLY 
PGCTLWEAYA 
NYGSQPPSSS 
SNQKERNVNH 
VRNSSQVGMY 
SAGMITRLRH 
DVAVKMIKEG 
LRSHGKGLEP 
VLDDQYVSSV 
WLKVSQGHR 



31 
I 

FVLTKTNLSY 
VYASNEESRS 
NLHTAVNEEK 
TSLAQYDSNS 
TTSKISWEFP 
TVSLFSKAW 
PVSTKANKVP 
SMSEDEFFQE 
SQLIjEMCYDV 
GTKFPVKWSA 
LYRPHLASDT 



41 

I 

YEYDKMKRGS 
QWLKALQKEI 
HRVPTFPDRV 
KKIYGSQPNF 
ESSSSEEEEN 
DKKGTVKHYH 
DSVSLGNGIW 
AQTMMKLSHP 
CEGMAFLESH 
PEVFHYFKYS 
IYQIMYSCWH 



51 
I 

RKGSIEIKKI 60 

RGNPHLLVKY 120 

LKIPRAVPVL 180 

NMQYIPREDF 240 

LDDYDWFAGN 300 

VHTNAENKLY 360 

ELKREEITLL 420 

KLVKFYGVCS 480 

QFIHRDLAAR 540 

SKSDVWAFGI 600 

ELPEKRPTFQ 660 



Seq ID NO: 130 DNA sequence 
35 Nucleic Acid Accession #: NM_012072.2 

Coding sequence: 149-2107 (underlined sequences correspond to start and stop codons) 



40 



45 



50 



55 



60 



65 



70 



75 



AAAGCCCTCA 
CCCCTTGGGG 
TCCCGCAGAG 
GCTGCTGCTC 
CGTGGGGACC 
CCACTGCAAC 
CGTCCAGCGA 
CAAGTTCTGG 
GAAGGGCTTC 
GCTCCGGAAC 
GCTCCTTCCC 
CGGAAGTAAC 
GGCCCTGGGG 
CTTGGAGGCT 
CGAGACTCAG 
CAGCTCGGGC 
CCACCAGGAC 
CCGGCTGCTG 
TCGTGGGGGG 
CCAAGGGTAC 
CTCCCCCTGT 
TGGCTATGAG 
GGGTCGCTCG 
TGAGGAGGGC 
TGTGGGCCCG 
CTGTGGCTGC 
TGTGTCTCTG 
GAGCACCGTG 
GGCTACACCC 
ACTCAAGATG 
CGCCACAGCT 
AAACAACGAT 
GGCCATCCTA 
GAAGAGGGAG 
TCCAGAGCGA 
CTGCTGAAAG 



11 
I 

GCCTTTGTGT 
CCCAGCTGGG 
GGCCACACAG 
CTGACCCAGC 
GCCTGCTACA 
CAGAACGGGG 
GTACTGGCCC 
ATTGGGCTCC 
AGCTGGGTGG 
TCGTGCATCT 
AACCGCCTGC 
ATTGAGGGCT 
GGCCCAGGTC 
GTGCCCTTTG 
AGTCATTATT 
CCCCTCTGTG 
TGCTTTGAAG 
GATGACCTGG 
GCCACGTGCG 
CAGCTGGACT 
GCCCAGGAGT 
CCGGGCGGTC 
CCTTGCGCCC 
TACGTCCTGG 
GGGGGCCCCC 
CTGCCAGGCT 
GGACCACCAT 
CCCCGCGCTG 
ACCACAAGTA 
CTGGCCCCCA 
GCCTCTGGCC 
GGCACTGACG 
CTCCTGCTGG 
GAGAAGAAGG 
GCTGAGAGCA 
TGAGGTGGCC 



21 
I 

CCTTCTCTGC 
AG CCGAG AT A 
AGACCGGGAT 
CCGGGGCGGG 
CGGCCCACTC 
GCAACCTGGC 
AGCTCCTGAG 
AGCGAGAGAA 
GCGGGGGGGA 
CCAAGCGCTG 
CCAAGTGGTC 
TCGTGTGCAA 
AGGTGACCTA 
CCTCTGCGGC 
TCCTGTGCAA 
TCAGCCCCAA 
GGGGGGATGG 
TGACCTGTGC 
TCCTGGGACC 
CGAGTCAGCT 
GTGTCAACAC 
CTGGAGAGGG 
AGGGCTGCAC 
CCGGGGAGGA 
TCTGCGACAG 
GGGTGCTGGC 
CTGGGCCCCC 
CAACAGCCAG 
GACCTTCGCT 
GTGGGTCCTC 
CCCAGGAGCC 
GGCAAAAGCT 
CCCTGGCTCT 
AGAAGAAGCC 
GGGCCATGGA 
CTAGAGACAC 



31 
I 

GCCGGAGTGG 
GAAGCTCCTG 
GGCCACCTCC 
GACGGGAGCT 
GGGCAAGCTG 
CACTGTGAAG 
GCGGGAGGCA 
GGGCAAGTGC 
GGACACGCCT 
TGTGTCTCTG 
TGAGGGCCCC 
GTTCAGCTTC 
CACCACCCCC 
CAATGTAGCC 
GGAGAAGGCC 
GTATGGCTGC 
CTCCTTCCTC 
CTCTCGAAAC 
CCATGGGAAA 
GGACTGTGTG 
CCCTGGGGGC 
GGCCTGTCAG 
CAACACAGAT 
CGGGACTCAG 
CTTGTGCTTC 
CCCAAATGGG 
CGATGAGGAG 
TCCCACAAGG 
GTCATCTGAC 
AGGCGTCTGG 
TGCAGGTGGG 
GCTTTTATTC 
GGGGCTACTG 
CCAGAATGCG 
GAACCAGTAC 
TAGAGTCACC 



41 


51 




1 

CTGCAGCTCA 


1 

CCCCTCAGCT 


60 


TCGCCGCTGG 


GCTTCTCGCC 


120 


ATGGGCCTGC 


TGCTGCTGCT 


180 


GACACGGAGG 


CGGTGGTCTG 


240 


AGCGCTGCCG 


AGGCCCAGAA 


300 


AGCAAGGAGG 


AGGCCCAGCA 


360 


GCCCTGACGG 


CGAGGATGAG 


420 


CTGGACCCTA 


GTCTGCCGCT 


480 


TACTCTAACT 


GGCACAAGGA 


540 


CTGCTGGACC 


TGTCCCAGCC 


600 


TGTGGGAGCC 


CAGGCTCCCC 


660 


AAAGGCATGT 


GCCGGCCTCT 


720 


TTCCAGACCA 


CCAGTTCCTC 


780 


TGTGGGGAAG 


GTGACAAGGA 


840 


CCCGATGTGT 


TCGACTGGGG 


900 


AACTTCAACA 


ATGGGGGCTG 


960 


TGCGGCTGCC 


GACCAGGATT 


1020 


CCTTGCAGCT 


CCAGCCCATG 


1080 


AACTACACGT 


GCCGCTGCCC 


1140 


GACGTGGATG 


AATGCCAGGA 


1200 


TTCCGCTGCG 


AATGCTGGGT 


1260 


GATGTGGATG 


AGTGTGCTCT 


1320 


GGCTCATTTC 


ACTGCTCCTG 


1380 


TGCCAGGACG 


TGGATGAGTG 


1440 


AACACACAAG 


GGTCCTTCCA 


1500 


GTCTCTTGCA 


CCATGGGGCC 


1560 


GACAAAGGAG 


AGAAAGAAGG 


1620 


GGCCCCGAGG 


GCACCCCCAA 


1680 


GCCCCCATCA 


CATCTGCCCC 


1740 


AGGGAGCCCA 


GCATCCATCA 


1800 


GACTCCTCCG 


TGGCCACACA 


1860 


TACATCCTAG 


GCACCGTGGT 


1920 


GTCTATCGCA 


AGCGGAGAGC 


1980 


GCAGACAGTT 


ACTCCTGGGT 


2040 


AGTCCGACAC 


CTGGGACAGA 


2100 


AGCCACCATC 


CTCAGAGCTT 


2160 



240 
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TGAACTCCCC ATTCCAAAGG GGCACCCACA 
CAAACAATTG TAAGTCTCCT CCTTAAAGGC 
TGTTTGATGT TCCTGAAGTG GAAGCTGTGT 
TCTATAATGA TTGTTACTCC CCCTCCCTTT 
5 GGGTGTGAGG AGGCTGGGGC TAAGGGGCTC 
ATCTAAGAGG AAAAGGTGAG TTGCTCATGC 
CCTAGGATGA AAACTAAATC AATTAATTAT 
TCAAAGGGAA CATGTTCGGA CTGGAAACAT 
AGCACAAGTC TTGCTAAATG TGATACTGTT 

10 TAACCTCTTA GGTGGCAAGG AGGCAGGAAG 
CTTGGGTTTA TTTGCAAAGG AAGCTTGAAA 
CAGGTGTTTG TGAAGTCACA TAATCTACGG 
CACAGATACT TGAATTAATT CATCCAAATG 
TGTGATCAAC ACTAACAAGG AAACAAATTC 

15 CCTCAGACAC CCTGCCTGTG GCCCCGCCTC 
CGAGCTCAGA CAGAGGAAGC CCTGCAGAAA 
TGAACGGGAG ATGATGCACT GTGTTTTGAA 
TCATAGTCCA CAGTTGATGC AGCATCCTGA 
CACACCAAGT AGGGAGCTAG TCAGGCAGTT 

20 TTCCTTAAAA TTGGGGGTAA GGAGGGAAGG 
TTTTTACAGC AAAAACTGCT CAAAGCCATT 
TTGCAAATAT TTCTCCCTAT GATAATGCAG 
TCTCTCTCAC ACACACACAC ACACACACAC 
CCTGGGGCAC TGGAACACAT TCCTGGGGGT 

25 TGAGTATCTC TGGGAGGCCT CATGTCTCCT 
ACAGACAGAG GAAATGTGTC TCCCTCCAAG 
GGTTTTGCCT TAGCAATGCA TCGGTCTCTG 
CAAGGTGCAG GGTTAATACT CTTGCCAGTT 
TTTTAATAGA AAACTAAAGG GGCAGGGGAA 

30 TCGATGGGGC ATTTGGAACT TCTTTTTAAA 
AACTCTGGTG TTTAACACTT AAGGGAGACA 
GGCCACGAGA CTCTAGGTGA TGTGTGAAGC 
CTGTCTGGCC ATTCAGAGGA TTCTAAAGAC 
CACTTAAATA AATG CAAATG CAACATTTCT 

35 TCATTTGGGG TGAAGGAGAC ATTTCTGTCC 
GTATGATTCC TGGGATCCAA CGAGCCCTCC 
GCCCAGGCCC ATCGTCTGTT CTCTGAATGC 
GAACCCCTCT GTGGAACCCA CAAGGGGAGA 
ACCTTCCCTG GCAGGCTGGG TCCCTCTCCT 

40 CCCACCACGG GGGGAGAGCC AGCAACCCAA 
AACCACTGGG CTCAAACACG TGCTTTATTC 
ATGGAAATTC TTGTTTGGGG GATCTTGGGG 
GCCAAGAGGC CATTAACAAA TCGTCCTTGT 
CACAGTGGGG AATCCAAGGG TCACAGTATG 

45 TCTCGCTAGA CACAGTGTTT CTGCCCAGGT 
CATGGGGACG GGGGAAGTTT TCACTTGGAG 
CCAAATAGGT CAATAATTCT GGGAGACTCT 
TCTCTCCCTC CCCTCATCCC ACATCTCAAA 
CACCCAGCTC GCCATGCCTA CTCATTCCTG 

50 CTTCTTTGTC ATTTGAGAAA GGATGCAGGA 
CAGAAAAACC AGGGCAGGAC AGTTATCGAC 
TAGAGGGACT CCACCCCTGC TCAACAGCTT 
CTCTGCCTTC GGTGGCCCAC ACACCTAAGC 
AACACATCTA CGTGTAGCAC TACGACGTTA 

55 AGGCTCTGAT TAAGGATGTG GGGAAGTGGG 
CTGGAGGCCT GTCTGTTAGC CAGTGGTGGA 
TGCCATCTTC CCTGCGATCA GGCAAAAAAG 
TGTGTTATGT CCATTTTGCA GGATGAACTG 
TTGCTTTGTC TTTTCCATCC TCATCACAAG 

60 TCTTTCGATG GATGGAGATG ATCATTAGGT 
ATTTCTGTGA AAACTAGGAG AACAGAGATG 
TAACACAGTC TTTTTAAAAC TAACATAGGA 
TCTCCATTGT CTAAATCAGG AAAACAGGAA 
TTAATGCCCC CTACATATTT CCATCACCTT 

65 TATGATCCCA GAAAACATCT GTCTCTACTT 
TGGTTTGTGC ATTTTCTCAA CTAAAAATAG 
CTAATCAAAG ACACTATTTT CATACTAGAT 
TTTAAAAATA AATTGTGTTT TGGTCTGTTC 
AAGCTCTGGA ATCCCTTTAT TGTGCTGTTG 

70 TTCAGCAGAT TTTGCCCACT ATTCCTCTGA 
GCTTGAATTA GATCCCTGCA AAGGCTTGCT 
GTAATCACTT CATGAATGCT AAATGAGAAT 
TTTGTTTGAC TAATTCTGGA ATTACAAGAT 
ATGTTTCCCA AACTGTGAGG AGGGAAGGCT 

75 CAAAATGGTG CTTTGAGGGT CAGCCTTTAG 
TCTGTTATGT GCCTATCCTA ATAAACTCTT 



TTTTTTTGAA AGACTGGACT GGAATCTTAG 2220 

CCCTTGGAAC ATGCAGGTAT TTTCTACGGG 2280 

GTTGGCGTGC CACGGTGGGG ATTTCGTGAC 2340 

TCAAATTCCA ATGTGACCAA TTCCGGATCA 2400 

CCCTGAATAT CTTCTCTGCT CACTTCCACC 2460 

TGATTAGGAT TGAAATGATT TGTTTCTCTT 2520 

TCAATTAGGT AAGAAGATCT GGTTTTTTGG 2580 

TTCTTTACAT TTGCATTCCT CCATTTCGCC 2640 

GACATCCTCC AGAATGGCCA GAAGTGCAAT 2700 

TGCCTCTTTA GTTCTTACAT TTCTAATAGC 2760 

AATATGAGAA AAGTTGCTTG AAGTGCATTA 2820 

GGCTAGGGCG AGAGAGGCCA GGGATTTGTT 2880 

TACTGAGGTT ACCACACACT TGACTACGGA 294 0 

AAGGACAACC TGTCTTTGAG CCAGGGCAGG 3000 

CACTTCATCC TGCCCGGAAT GCCAGTGCTC 3060 

GTTCCATCAG GCTGTTTCCT AAAGGATGTG 3120 

AGTTGTCATT TTAAAGCATT TTAGCACAGT 3180 

GATTTTAAAT C CTGAAGTGT GGGTGGCGCA 3240 

TGCTTAAGGA ACTTTTGTTC TCTGTCTCTT 3300 

AAGAGGGAAA GAGATGACTA ACTAAAATCA 3360 

TAAATTATAT CCTCATTTTA AAAGTTACAT 3420 

TCGATAGTGT GCACTCTTTC TCTCTCTCTC 3480 

ACACACACAC AGAGACACGG CACCATTCTG 3540 

CACCGATGGT CAGAGTCACT AGAAGTTACC 3600 

GTGGGCTTTT TACCACCACT GTGCAGGAGA 3660 

GCCCCAAAGC CTCAGAGAAA GGGTGTTTCT 3720 

AGGTGACACT CTGGAGTGGT TGAAGGGCCA 3780 

TTGAAATATA GATGCTATGG TTCAGATTGT 3840 

GTGAAAGGAA AGATGGAGGT TTTGTGCGGC 3900 

GTCATCTCAT GGTCTCCAGT TTTCAGTTGG 3960 

AAGGCTGTGT CCATTTGGCA AAACTTCCTT 4 020 

TGGGCAGTCT GTGGTGTGGA GAGCAGCCAT 4080 

ATGGCTGGAT GCGCTGCTGA CCAACATCAG 414 0 

CCCTCTGGGC CTTGAAAATC CTTGCCCTTA 4200 

TTGGCTTCCC ACAGCCCCAA CGCAGTCTGT 4260 

TATTTTCACA GTGTTCTGAT TGCTCTCACA 4320 

AGCCCTGTTC TCAACAACAG GGAGGTCATG 4380 

AATGGGTGAT AAAGAATCCA GTTCCTCAAA 444 0 

GCTGGGTGGT GCTTTCTCTT GCACACCACT 4500 

CCAGACAGCT CAGGTTGTGC ATCTGATGGA 4560 

TCCTGTTTAT TTTTGCTGTT ACTTTGAAGC 4620 

CTACAGTAGT GGGTAAACAA ATGCCCACCG 4 680 

CCTGAGGGGC CCCAGCTTGC TCGGGCGTGG 4 74 0 

GGGAGAGGTG CACCCTGCCA CCTGCTAACT 4 800 

GACCTGTTCA GCAGCAGAAC AAGCCAGGGC 4 860 

ATGGACACCA AGACAATGAA GATTTGTTGT 4 920 

TGGAAAAAAC TGAATATATT CAGGACCAAC 4980 

GCAGACAATG TAAAGAGAGA ACATCTCACA 504 0 

AATTTCAGGT GCCATCACTG CTCTTTCTTT 5100 

GGACAATTCC CACAGATAAT CTGAGGAATG 5160 

AATGCATTAG AACTTGGTGA GCATCCTCTG 5220 

GGCTTCCAGG CAAGACCAAC CACATCTGGT 5280 

GTCATCGTCA TTGCCATAGC ATCATGATGC 5340 

TGTTTGGGTA ATGTGGGGAT GAACTGCATG 5400 

CTGCGGTCAC TGTCGGCCTT GCAAGGCCAC 5460 

GGAGCAAGGC TTCAGGAAGG GCCAGCCACA 5520 

TGGAATTAAA AAGTCAAACC TTTATATGCA 5580 

AGTTTAAAAG AATTTTTTTT TCTCTTCAAG 5640 

CCCTTGTTTG AGTGTCTTAT CCCTGAGCAA 5700 

ACTTTTGTTT CAACCTTTAT TCCTGTAAAT 5760 

AGATTTGACA AAAAAAAATT GAATTAAAAA 5820 

AAGCCTTTCC TATTATTTCT CTTCTTAGCT ' 5880 

AACACAGCTT TCTAGCAGCT GCAAAATGGT 5940 

GAACAATAGC TTTAGCTTGG GAATCTGAGA 6000 

CGGCTGCAAA ACCCATGGTT TAAATCTATA 6060 

AGATGATAAT CCGAATTCTC CATATATTCA 6120 

TCCTGAGACA AATACTCACT GAAGGGCTTG 6180 

TTGTAGATAA TGCCCTTCTA TTTTAGGTAG 6240 

CTCTTATCTG CAAGGTGGCA AGCAGTTCTT 6300 

GCTGAAGTTC TTTGCATAGA TTTGGCTTAA 6360 

CTGTGATGTC AGATGTAATT GTAAATGTCA 6420 

GTAAGTATTT TTAAATGTGT GTATTTCAAA 6480 

TTCTATGCAG GATTTACCTT CATCCTGTGC 6540 

CAGAGATCGA GCTTCTCCTC TGAGTTCTAA 6600 

GAAGGTGCAG CTTTGTTGTC CTTTGAGCTT 6660 
AAACACATT 
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Seq ID No: 131 Protein sequence : 
Protein Accession #: NP 036204.1 



1 11 21 31 41 51 

I 1 I I I I 

MATSMGLLLL LLLLLTOPGA GTGADTEAW CVGTACYTAH SGKLSAAEAQ NHCNQNGGNL 60 

ATVKSKEEAQ HVQRVLAQLL RREAALTARM SKFWIGLQRE KGKCLDPSLP LKGFSWVGGG 120 

EDTPYSNWHK ELRNSCISKR CVSLLLDLSQ PLLPNRLPKW SEGPCGSPGS PGSNIEGFVC 180 

KFSFKGMCRP LALGGPGQVT YTTPFQTTSS SLEAVPFASA ANVACGEGDK DETQSHYFLC 240 

KEKAPDVFDW GSSGPLCVSP KYGCNFNNGG CHQDCFEGGD GSFLCGCRPG FRLLDDLVTC 300 

ASRNPCSSSP CRGGATCVLG PHGKNYTCRC PQGYQLDSSQ LDCVDVDECQ DSPGAQECVN 360 

TPGGFRCECW VGYEPGGPGE GACQDVDECA LGRSPCAQGC TNTDGSFHCS CEEGYVLAGE 420 

DGTQCQDVDE CVGPGGPLCD SLCFNTQGSF HCGCLPGWVL APNGVSCTMG PVSLGPPSGP 480 

PDEEDKGEKE GSTVPRAATA SPTRGPEGTP KATPTTSRPS LSSDAPITSA PLKMLAPSGS 540 

SGVWREPSIH HATAASGPQE PAGGDSSVAT QNNDGTDGQK LLLFYILGTV VAILLLLALA 600 
LGLLVYRKRR AKREEKKEKK PQNAADSYSW VPERAESRAM ENQYSPTPGT DC 



Seq ID NO: 132 DNA sequence 

Nucleic Acid Accession #: NM_000963.1 

Coding sequence: 135-1949 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAATTGTCAT ACGACTTGCA GTGAGCGTCA GGAGCACGTC CAGGAACTCC TCAGCAGCGC 60 

CTCCTTCAGC TCCACAGCCA GACGCCCTCA GACAGCAAAG CCTACCCCCG CGCCGCGCCC 120 

TGCCCGCCGC TCGG ATG CTC GCCCGCGCCC TGCTGCTGTG CGCGGTCCTG GCGCTCAGCC 180 

ATACAGCAAA TCCTTGCTGT TCCCACCCAT GTCAAAACCG AGGTGTATGT ATGAGTGTGG 240 

GATTTGACCA GTATAAGTGC GATTGTACCC GGACAGGATT CTATGGAGAA AACTGCTCAA 300 

CACCGGAATT TTTGACAAGA ATAAAATTAT TTCTGAAACC CACTCCAAAC ACAGTGCACT 360 

ACATACTTAC CCACTTCAAG GGATTTTGGA ACGTTGTGAA TAACATTCCC TTCCTTCGAA 420 

ATGCAATTAT GAGTTATGTC TTGACATCCA GATCACATTT GATTGACAGT CCACCAACTT 480 

ACAATGCTGA CTATGGCTAC AAAAGCTGGG AAGCCTTCTC TAACCTCTCC TATTATACTA 540 

GAGCCCTTCC TCCTGTGCCT GATGATTGCC CGACTCCCTT GGGTGTCAAA GGTAAAAAGC 600 

AGCTTCCTGA TTCAAATGAG ATTGTGGAAA AATTGCTTCT AAGAAGAAAG TTCATCCCTG 660 

ATCCCCAGGG CTCAAACATG ATGTTTGCAT TCTTTGCCCA GCACTTCACG CATCAGTTTT 720 

TCAAGACAGA TCATAAGCGA GGGCCAGCTT TCACCAACGG GCTGGGCCAT GGGGTGGACT 780 

TAAATCATAT TTACGGTGAA ACTCTGGCTA GACAGCGTAA ACTGCGCCTT TTCAAGGATG 840 

GAAAAATGAA ATATCAGATA ATTGATGGAG AGATGTATCC TCCCACAGTC AAAGATACTC 900 

AGGCAGAGAT GATCTACCCT CCTCAAGTCC CTGAGCATCT ACGGTTTGCT GTGGGGCAGG 960 

AGGTCTTTGG TCTGGTGCCT GGTCTGATGA TGTATGCCAC AATCTGGCTG CGGGAACACA 1020 

ACAGAGTATG CGATGTGCTT AAACAGGAGC ATCCTGAATG GGGTGATGAG CAGTTGTTCC 10 80 

AGACAAGCAG GCTAATACTG ATAGGAGAGA CTATTAAGAT TGTGATTGAA GATTATGTGC 1140 

AACACTTGAG TGGCTATCAC TTCAAACTGA AATTTGACCC AGAACTACTT TTCAACAAAC 12 00 

AATTCCAGTA CCAAAATCGT ATTGCTGCTG AATTTAACAC CCTCTATCAC TGGCATCCCC 1260 

TTCTGCCTGA CACCTTTCAA ATTCATGACC AGAAATACAA CTATCAACAG TTTATCTACA 1320 

ACAACTCTAT ATTGCTGGAA CATGGAATTA CCCAGTTTGT TGAATCATTC ACCAGGCAAA 13 80 

TTGCTGGCAG GGTTGCTGGT GGTAGGAATG TTCCACCCGC AGTACAGAAA GTATCACAGG 14 40 

CTTCCATTGA CCAGAGCAGG CAGATGAAAT ACCAGTCTTT TAATGAGTAC CGCAAACGCT 1500 

TTATGCTGAA GCCCTATGAA TCATTTGAAG AACTTACAGG AGAAAAGGAA ATGTCTGCAG 1560 

AGTTGGAAGC ACTCTATGGT GACATCGATG CTGTGGAGCT GTATCCTGCC CTTCTGGTAG 1620 

AAAAGCCTCG GCCAGATGCC ATCTTTGGTG AAACCATGGT AGAAGTTGGA GCACCATTCT 1680 

CCTTGAAAGG ACTTATGGGT AATGTTATAT GTTCTCCTGC CTACTGGAAG CCAAGCACTT 1740 

TTGGTGGAGA AGTGGGTTTT CAAATCATCA ACACTGCCTC AATTCAGTCT CTCATCTGCA 1800 

ATAACGTGAA GGGCTGTCCC TTTACTTCAT TCAGTGTTCC AGATCCAGAG CTCATTAAAA 1860 

CAGTCACCAT CAATGCAAGT TCTTCCCGCT CCGGACTAGA TGATATCAAT CCCACAGTAC 1920 

TACTAAAAGA ACGTTCGACT GAACTGTAGA AGTCTAATGA TCATATTTAT TTATTTATAT 1980 

GAACCATGTC TATTAATTTA ATTATTTAAT AATATTTATA TTAAACTCCT TATGTTACTT 2040 

AACATCTTCT GTAACAGAAG TCAGTACTCC TGTTGCGGAG AAAGGAGTCA TACTTGTGAA 2100 

GACTTTTATG TCACTACTCT AAAGATTTTG CTGTTGCTGT TAAGTTTGGA AAACAGTTTT 2160 

TATTCTGTTT TATAAACCAG AGAGAAATGA GTTTTGACGT CTTTTTACTT GAATTTCAAC 2220 

TTATATTATA AGAACGAAAG TAAAGATGTT TGAATACTTA AACACTATCA CAAGATGGCA 22 80 

AAATGCTGAA AGTTTTTACA CTGTCGATGT TTCCAATGCA TCTTCCATGA TGCATTAGAA 2340 

GTAACTAATG TTTGAAATTT TAAAGTACTT TTGGTTATTT TTCTGTCATC AAACAAAAAC 2400 

AGGTATCAGT G C ATT ATT AA ATGAATATTT AAATTAGACA TTACCAGTAA TTTCATGTCT 2460 

ACTTTTTAAA ATCAGCAATG AAACAATAAT TTGAAATTTC TAAATTCATA GGGTAGAATC 2520 

ACCTGTAAAA GCTTGTTTGA TTTCTTAAAG TTATTAAACT TGTACATATA CCAAAAAGAA 2580 

GCTGTCTTGG ATTTAAATCT GTAAAATCAG ATGAAATTTT ACTACAATTG CTTGTTAAAA 2640 

TATTTTATAA GTGATGTTCC TTTTTCACCA AGAGTATAAA CCTTTTTAGT GTGACTGTTA 2700 

AAACTTCCTT TTAAATCAAA ATGCCAAATT TATTAAGGTG GTGGAGCCAC TGCAGTGTTA 2760 

TCTCAAAATA AGAATATTTT GTTGAGATAT TCCAGAATTT GTTTATATGG CTGGTAACAT 2820 

GTAAAATCTA TAT C AG C AAA AGGGTCTACC TTTAAAATAA GCAATAACAA AGAAGAAAAC 2880 

CAAATTATTG TTCAAATTTA GGTTTAAACT TTTGAAGCAA ACTTTTTTTT ATCCTTGTGC 294 0 
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ACTGCAGGCC TGGTACTCAG ATTTTGCTAT GAGGTTAATG AAGTACCAAG CTGTGCTTGA 3000 

ATAACGATAT GTTTTCTCAG ATTTTCTGTT GTACAGTTTA ATTTAGCAGT CCATATCACA 3060 

TTGCAAAAGT AGCAATGACC TCATAAAATA CCTCTTCAAA ATGCTTAAAT TCATTTCACA 3120 

CATTAATTTT ATCTCAGTCT TGAAGCCAAT TCAGTAGGTG CATTGGAATC AAGCCTGGCT 3180 

5 ACCTGCATGC TGTTCCTTTT CTTTTCTTCT TTTAGCCATT TTGCTAAGAG ACACAGTCTT 324 0 

CTCATCACTT CGTTTCTCCT ATTTTGTTTT ACTAGTTTTA AGATCAGAGT TCACTTTCTT 3300 

TGGACTCTGC CTATATTTTC TTACCTGAAC TTTTGCAAGT TTTCAGGTAA ACCTCAGCTC 3360 

AGGACTGCTA TTTAGCTCCT CTTAAGAAGA TTAAAAGAGA AAAAAAAAGG CCCTTTTAAA 342 0 

AATAGTATAC ACTTATTTTA AGTGAAAAGC AGAGAATTTT ATTTATAGCT AATTTTAGCT 3480 

10 ATCTGTAACC AAGATGGATG CAAAGAGGCT AGTGCCTCAG AGAGAACTGT ACGGGGTTTG 3540 

TGACTGGAAA AAGTTACGTT CCCATTCTAA TTAATGCCCT TTCTTATTTA AAAACAAAAC 3600 

CAAATGATAT CTAAGTAGTT CTCAGCAATA ATAATAATGA CGATAATACT TCTTTTCCAC 3660 

ATCTCATTGT CACTGACATT TAATGGTACT GTATATTACT TAATTTATTG AAGATTATTA 3720 

TTTATGTCTT ATTAGGACAC TATGGTTATA AACTGTGTTT AAGCCTACAA TCATTGATTT 3780 

15 TTTTTTGTTA TGTCACAATC AGTATATTTT CTTTGGGGTT ACCTCTCTGA ATATTATGTA 3 840 

AACAATCCAA AGAAATGATT GTATTAAGAT TTGTGAATAA ATTTTTAGAA ATCTGATTGG 3900 

CATATTGAGA TATTTAAGGT TGAATGTTTG TCCTTAGGAT AGGCCTATGT GCTAGCCCAC 3960 

AAAGAATATT GTCTCATTAG CCTGAATGTG CCATAAGACT GACCTTTTAA AATGTTTTGA 4020 

GGGATCTGTG GATGCTTCGT TAATTTGTTC AGCCACAATT TATTGAGAAA ATATTCTGTG 4080 

20 TCAAGCACTG TGGGTTTTAA TATTTTTAAA TCAAACGCTG ATTACAGATA ATAGTATTTA 4140 

TATAAATAAT TGAAAAAAAT TTTCTTTTGG GAAGAGGGAG AAAATGAAAT AAATATCATT 4200 

AAAGATAACT CAGGAGAATC TTCTTTACAA TTTTACGTTT AGAATGTTTA AGGTTAAGAA 4260 

AGAAATAGTC AATATGCTTG TATAAAACAC TGTTCACTGT TTTTTTTAAA AAAAAAACTT 4320 

GATTTGTTAT TAACATTGAT CTGCTGACAA AACCTGGGAA TTTGGGTTGT GTATGCGAAT 4380 

25 GTTTCAGTGC CTCAGACAAA TGTGTATTTA ACTTATGTAA AAGATAAGTC TGGAAATAAA 4440 
TGTCTGTTTA TTTTTGTACT ATTTA 

Seq ID No: 133 Protein sequence; 
Protein Accession #: NP 000954.1 

30 

1 11 21 31 41 SI 

I I I I I I 

MIARALLLCA VLALSHTANP CCSHPCQNRG VCMSVGFDQY KCDCTRTGFY GENCSTPEFL 60 

35 TRIKLFLKPT PNTVHYILTH FKGFWNWNN IPFLRNAIMS YVLTSRSHLI DSPPTYNADY 120 

GYKSWEAFSN LSYYTRALPP VPDDCPTPLG VKGKKQLPDS NEIVEKLLLR RKFIPDPQGS 180 

NMMFAFFAQH FTHQFFKTDH KRGPAFTNGL GHGVDLNHIY GETLARQRKL RLFKDGKMKY 24 0 

QIIDGEMYPP TVKDTQAEMI YPPQVPEHLR FAVGQEVFGL VPGLMMYATI WLREHNRVCD 300 

VLKQEHPEWG DEQLFQTSRL XLIGETIKIV IEDYVQHLSG YHFKLKFDPE LLFNKQFQYQ 360 

40 NRIAAEFNTL YHWHPLLPDT FQIHDQKYNY QQFIYNNSIL LEHGITQFVE SFTRQIAGRV 420 

AGGRNVPPAV QKVSQASIDQ SRQMKYQSFN EYRKRFMLKP YESFEELTGE KEMSAELEAL 480 

YGDIDAVELY PALLVEKPRP DAIFGETMVE VGAPFSLKGL MGNVICSPAY WKPSTFGGEV 54 0 

GFQIINTASI QSLICNNVKG CPFTSFSVPD PELIKTVTIN ASSSRSGLDD INPTVLLKER 600 
STEL • 

45 

Seq ID NO: 134 DNA sequence 

Nucleic Acid Accession #: XM_059648.1 

Coding sequence: 35-664 (underlined sequences correspond to start and stop codons) 

50 

1 11 21 31 41 51 

I I I I I I 

AGGCTGCTGA GACTTCCCTC TAGAATCCTC CAAC ATGG AG CCTCTTGCAG CTTACCCGCT 60 

AAAATGTTCC GGGCCCAGAG CAAAGGTATT TGCAGTTTTG CTGTCTATAG TTCTATGCAC 120 

55 AGTAACGCTA TTTCTTCTAC AACTAAAATT CCTCAAACCT AAAATCAACA GCTTTTATGC 180 

CTTTGAAGTG AAGGATGCAA AAGGAAGAAC TGTTTCTCTG GAAAAGTATA AAGGCAAAGT 240 

TTCACTAGTT GTAAACGTGG CCAGTGACTG CCAACTCACA GACAGAAATT ACTTAGGGCT 300 

GAAGGAACTG CACAAAGAGT TTGGACCATC CCACTTCAGC GTGTTGGCTT TTCCCTGCAA 360 

TCAGTTTGGA GAATCGGAGC CCCGCCCAAG CAAGGAAGTA GAATCTTTTG CAAGAAAAAA 420 

60 CTACGGAGTA ACTTTCCCCA TCTTCCACAA GATTAAGATT CTAGGATCTG AAGGAGAACC 480 

TGCATTTAGA TTTCTTGTTG ATTCTTCAAA GAAGGAACCA AGGTGGAATT TTTGGAAGTA 54 0 

TCTTGTCAAC CCTGAGGGTC AAGTTGTGAA GTTCTGGAAG CCAGAGGAGC CCATTGAAGT 600 

CATCAGGCCT GACATAGCAG CTCTGGTTAG ACAAGTGATC ATAAAAAAGA AAGAGGATCT 660 

ATGAGAATGC CATTGCGTTT CTAATAGAAC AGAGAAATGT CTCCATGAGG GTTTGGTCTC 720 

65 ATTTTAAACA TTTTTTTTTT GGAGACAGTG TCTCACTCTG TCACCCAGGC TGGAGTGCAG 780 

TAGTGCGTTC TCAGCTCATT GCAACCTCTG CCTTTTTAAA CATGCTATTA AATGTGGCAA 840 

TGAAGGATTT TTTTTTAATG TTATCTTGCT ATTAAGTGGT AATGAATGTT CCCAGGATGA 900 

GGATGTTACC CAAAGCAAAA ATCAAGAGTA GCCAAAGAAT CAACATGAAA TATATTAACT 960 

ACTTCCTCTG ACCATACTAA AGAATTCAGA ATACACAGTG ACCAATGTGC CTCAATATCT 1020 

70 TATTGTTCAA CTTGACATTT TCTAGGACTG TACTTGATGA AAATGCCAAC ACACTAGACC 1080 

ACTCTTTGGA TTCAAGAGCA CTGTGTATGA CTGAAATTTC TGGAATAACT GTAAATGGTT 1140 

ATGTTAATGG AATAAAACAC AAATGTTGAA AAATGTAAAA TATATATACA TAG ATT C AAA 1200 

TCCTTATATA TGTATGCTTG TTTTGTGTAC AGGATTTTGT TTTTTCTTTT TAAGTACAGG 1260 

TTCCTAGTGT TTTACTATAA CTGTCACTAT GTATGTAACT GACATATATA AATAGTCATT 1320 

75 TATAAATGAC CGTATTATAA CA 
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Seq ID No: 135 Protein sequence: 
Protein Accession #: XP 059648.1 



1 11 21 31 41 51 

I I I I I I 

MEPLAAYPLK CSGPRAKVFA VLLSIVLCTV TLFLLQLKFL KPKINSFYAF EVKDAKGRTV 60 
SLEKYKGKVS LWNVASDCQ LTDRNYLGLK ELHKEFGPSH FSVLAFPCNQ FGESEPRPSK 120 
EVESFARKNY GVTFPIFHKI KILGSEGEPA FRFLVDSSKK EPRWNFWKYL VNPEGQWKF 180 
WKPEEPIEVI RPDIAALVRQ VIIKKKEDL 



Seq ID NO: 136 DNA sequence 

Nucleic Acid Accession #: NM_003003.1 

Coding sequence: 304-2451 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CAAGTGCCGT CGCCGCGCCC CTTCCCCCTC CCGCCTCCCC GGCCCCCTCC CCGGAACCGG 60 

CGGTCGAGCT ACGGTCGCGG ACGAGTGGAA CCGAGACTGC CCCGCGGAGC CGCCGGTATG 120 

AGCGCCCCTC GCCACCCCGT GTCCCAGGCC CGGCCTTTCT GACAAGAGCT AGACTTCGGG 180 

CTCCTTGAGG ATATTCAGTT TTGTATGTTT GAATATCCTC TCACCATGTT CAGCATAAAG 240 

TACCATTCTT AATGATTATC CTCAACAAGA CAGGTGTGAG AGGGTTGCTG TTGCATTGCA 300 

ATCATGGTGC AAAAATACCA GTCCCCAGTG AGAGTGTACA AATACCCCTT TGAATTAATT 360 

ATGGCTGCCT ATGAAAGGAG GTTCCCTACA TGTCCTTTGA TTCCGATGTT CGTGGGCAGT 420 

GACACTGTGA GTGAATTCAA GAGCGAAGAT GGGGCTATTC ATGTCATTGA AAGGCGCTGC 480 

AAGCTGGATG TAGATGCACC CAGACTGCTG AAGAAGATTG CAGGAGTTGA TTATGTTTAT 540 

TTTGTCCAGA AAAACTCACT GAATTCTCGG GAACGTACTT TGCACATTGA GGCTTATAAT 600 

GAAACGTTTT CCAATCGGGT CATCATTAAT GAGCATTGCT GCTACACCGT TCACCCTGAA 660 

AATGAAGATT GGACCTGTTT TGAACAGTCT GCAAGTTTAG ATATTAAATC TTTCTTTGGT 720 

TTTGAAAGTA CAGTGGAAAA AATTGCAATG AAACAATATA CCAGCAACAT TAAAAAAGGA 780 

AAGGAAATCA TCGAATACTA CCTTCGCCAA TTAGAAGAAG AAGGCATAAC CTTTGTGCCC 840 

CGTTGGAGTC CGCCTTCCAT CACGCCCTCT TCAGAGACAT CTTCATCATC CTCCAAGAAA 900 

CAAGCAGCGT CCATGGCCGT CGTCATCCCA GAAGCTGCCC TCAAGGAGGG GCTGAGTGGT 960 

GATGCCCTCA GCAGCCCCAG TGCACCTGAG CCCGTGGTGG GCACCCCTGA CGACAAACTA 1020 

GATGCCGACC ACATCAAGAG ATACCTGGGC GATTTGACTC CGCTGCAGGA GAGCTGCCTC 10 BO 

ATTAGACTTC GCCAGTGGCT CCAGGAGACC CACAAGGGCA AAATTCCAAA AGATGAGCAT 1140 

ATTCTTCGGT TCCTCCGTGC ACGGGATTTT AATATTGACA AAGCCAGAGA GATCATGTGT 1200 

CAGTCTTTGA CGTG GAG AAA GCAGCATCAG GTAGACTACA TTCTTGAAAC CTGGACCCCT 1260 

CCTCAGGTCC TTCAGGATTA CTACGCGGGA GGCTGGCATC ATCACGACAA AGATGGGCGG 1320 

CCCCTCTACG TGCTCAGGCT GGGGCAGATG GACACCAAAG GCTTGGTGAG AGCGCTCGGG 1380 

GAGGAAGCCC TGCTGAGATA CGTTCTCTCC GTAAATGAAG AACGGCTAAG GCGATGCGAA 1440 

GAGAATACAA AAGTCTTTGG TCGGCCTATC AGCTCATGGA CCTGCCTGGT GGACTTGGAA 1500 

GGGCTGAACA TGCGCCACTT GTGGAGACCT GGTGTGAAAG CGCTGCTGCG GATCATCGAG 1560 

GTGGTGGAGG CCAACTACCC TGAGACACTG GGCCGCCTTC TCATCCTGCG GGCGCCCAGG 1620 

GTATTTCCTG TGCTCTGGAC GCTGGTTAGT CCGTTCATTG ATGACAACAC CAGAAGGAAG 1680 

TTCCTCATTT ATGCAGGAAA TGACTACCAG GGTCCTGGAG GCCTGCTGGA TTACATCGAC 1740 

AAAGAGATTA TTCCAGATTT CCTGAGTGGG GAGTGCATGT GCGAAGTGCC AGAGGGTGGA 1800 

CTGGTCCCCA AATCTCTGTA CCGGACTGCA GAGGAGCTGG AGAACGAAGA CCTGAAGCTC 1860 

TGGACTGAGA CCATCTACCA GTCTGCAAGC GTCTTCAAAG GAGCCCCACA TGAGATTCTC 1920 

ATTCAGATTG TGGATGCCTC GTCAGTCATC ACTTGGGATT TCGACGTGTG CAAAGGGGAC 1980 

ATTGTGTTTA ACATCTATCA CTCCAAGAGG TCGCCACAAC CACCCAAAAA GGACTCCCTG 2040 

GGAGCCCACA GCATCACCTC TCCGGGTGGG AACAATGTGC AG CT CAT AG A CAAAGTCTGG 2100 

CAGCTGGGCC GCGACTACAG CATGGTGGAG TCGCCTCTGA TCTGCAAAGA AGGAGAAAGC 2160 

GTGCAGGGTT CCCATGTGAC CAGGTGGCCG' GGCTTCTACA TCCTGCAGTG GAAATTCCAC 2220 

AGCATGCCTG CGTGCGCCGC CAGCAGCCTT CCCCGGGTGG ACGACGTGCT TGCGTCCCTG 2280 

CAGGTCTCTT CGCACAAGTG TAAAGTGATG TACTACACCG AGGTGATCGG CTCGGAGGAT 2340 

TTCAGAGGTT CCATGACGAG CCTGGAGTCC AGCCACAGCG GCTTCTCCCA GCTGAGTGCC 2400 

GCCACCACCT CCTCCAGCCA GTCCCACTCC AGCTCCATGA TCTCCAGGJA_GTGCCGCGCT 2460 

GCCTGCACCT AGTGTGCAGA GGGGACGGCC GCCCCTCCTC GGACAGCAGC TGCACCCGCC 2520 

CACCCAGCGG CGACATTGTA CAGACTCCTC TCACCTCTAG ATAGCAAATA GCTCTCAGAT 2580 

GGTAAACGTA GTCGTTTGAT CCCAAAACTA CCTTGGCAGG TAGTTTTAAC TCTGATCCTA 264 0 

ACTTAACTCA ATAGCCATAG ATTTTGTATA CGTTGTGCAC AAAATCCAAC CAGAGCGCAA 2700 

GGGCTCTCTT GAAAGAAAAG TAGTTTCTGT ACCAATTAAA GGATTGACGT GGTCTCAGAT 2760 

ATTGATGCAA AAAATTTTTC CAACGAACTC CGCATTGTCC ATTAGTGAAT GAATTCCTGT 2820 

GACATCCTCC AGAGATGGCC CCTCCTCACC TGGGACGGAA GCTGCCAGCT CGCTTCCCCC 2880 

AAGCTGCCTC ATGGCCCGCA CGCCGCCTCA CGGCCCCCAT GCTTCCCGCC AGTCAAGATG 2940 

GTCTGTGGAC TTAGGGCCAG CCCTTGAGGT CCTTATCCTC TGAGGATTCA GAGGTTGCCT 3000 

GCGGAGTACC TTGTCCCAGG GCCAGACACA CCCACACCAC CCACTGTCTG CAGTGGGGCC 3060 

GGGGGCTCAG GAGGGGCTCT CAGGGACTCC TGGTGACTCC AGGAAAATGC TGCCATCGTT 3120 

AAACATTACT TTCTCTTTCC TCCTTTTCAA ATCTTTTTGA TACTTTTTAG AGCAGGATTT 3180 

TTCTGTATGT GAACTTGGGT GGGGGGGTTC TTCCCGTTTC CTTCCGTGCG TCGCCCCTCT 324 0 

CACCTGCAGT CAGCTCCCAG CCCAGTGTAG GCCATCTCCT CTGTGCCCTC TGGAGGCTCA 33 00 

TTGTCTCAGA GCCCAGACAG TTCCAGCCAC TAGGAGGCCG TCTTGGAACC AGCAAGTCGC 3360 

ATTTGCCACT TGACACTGTC CATGGGGTTT TATTAGTAGC TAAGCAGCAG CTCTCGCATC 3420 
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CACTTCAGGG TGGCGTGTGG CATGTAGGAG TCCTGCTTCT TTGTACATGG GAATTGTGGA 3480 

CTCATGCGTG TGTGTGTGTG CATGTGCTGT GTGTGTGCAT GTGTGCATGA CGGTGGGGGT 3540 

GCTGGGGGGA CGGGGTGAGT GGAAACTTAG TTTGAGTAAT GAAGGAATCT TCACAGAAGC 3600 

AAATCAGAAT ATGGGATTTG TTTGCCTTTT ACATTTTGTT TAATTCCTGA TTTTAAAGCC 3660 

TGCTCTATCT GGTACAGGCC CTTATTTTTT CAGCTTTTTA TGGGAAAAGC AGGTTATTTG 3720 

AGAATCTGTC CAGAAGTTGC ATAGGGGATG GCCTCCACGA TAAGGACATG CAACACGTGT 3780 

TTCTGTGTGC AGCAGAGGCC GTGTTTTTCA TGCCAAACCC CACGCGGCTG TCAACTGTGT 3840 

GCGTGGTAGG CATGGAGATC CTGGTTGTGC CGTCTCAGCT CCGCTCTGAA GGCACTGTGT 3900 

GGGTGCTGCG TGACTGGAGA GCTGTGTGGA GGCCATGTGT GCCCCGTGCA GGGATCAGGA 3960 

GGGCGGGGGA GGGACCGAGC AGCCCTCTTG CCCGGTCGGG TCAGCCCTAG TGGCTGCCTG 4020 

CACACTGTAG ACGTCCCAGG GCCTGTGCTG TGATCACCTG CCTTTGGACC ACATTTGTGT 4080 

TTGCTCTTAG AGATCGAGCT CCTCAGTGGT ACCTGAAGCC TTTGCTTCCG GAAAGCGCGG 414 0 

TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA 4200 

GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG 4260 

GGCTAGTAGG TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT AGGGCTAGTA 4320 

GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG TAGGTAGGGT 438 0 

TAGTAGGTAG GGCTAGTAGG TAGGGTTCGT AGGTAGGGCT AGTAGGTAGG GTTAGTAGGT 4440 

AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG GTAGGGCTGG 4500 

TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT AGTAGGTAGG 4560 

GTTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG TTAGTAGGTA GGGTTCGTAG 462 0 

GTAGGGCTGG TAGGTAGGGT TAGTAGGTAG GGCTAGTAGG TAGGGCTAGT AGGTAGGGCT 4680 

AGTAGGTAGG GCTAGTAGGT AGGGCTAGTA GGTAGGGCTA GTAGGTAGGG CTAGTAGGTA 4740 

GGGTTCGTAG GTAGGGTTCG TAGGTAGGGT TCGTAGGTAG GGTTAGTAGC GCGTCTGTGC 4800 

TGCTTCCACC TGGTGCTTCC TGTTCCCAAA TCACAAGGGC CTGAAGGTGG TCCCTGCTTT 4860 

CTCTTTCTCT TTCTCTGTGT CTCAGATGGC GATTTTGCTG ACAGCTGCCA AGAAAATGCT 4920 

TCACTCAACA GTCCTCATGT GCCCAGAGAT GTTTATAGAA CTGTTTGAAT TGCAGCCATC 4980 

CCCTGCCCCC TCCCAGGCTG AAGATCTGTT CTTTTTAAGT TGATTCGGGA GTGGCATTCT 5040 

TTTATACCCA AAGACTGTAG TGCATCTTGA AGAGCTCAAA GCACATGACC GCACAAATGC 5100 

TTACAGGGTT TCCTCCCGAG TAATCCAATC TCACTCCCCT TGTAAGGGAA TTCTGGGGCA 5160 

GCTATGGTTT GAGTATGCAG TTTGCATCGT GTTTCTACCT TTAGTACCTT GCCACTCTTT 5220 

TAAAACGCTG CTGTCATTTC CCATTTCTTA GTACTAATGA TTCTTTGATT CTCCCTCTAT 5280 

TATGTCTTAA TTCACTTTCC TTCCTAAATT TGTTATTTGC ATATCAAATT CTGTAAATGT 5340 

TTTGTAAACA TATTACCTCA CTTGGTAATA CAATACTGAT AGTCTTTAAA AGATTTTTTT 5400 
ATTGTTATCA ATAATAAATG TGAACTATTT AAAG 



Seq ID No: 137 Protein sequence: 
Protein Accession #: NP 002994.1 



1 11 21 31 41 51 

I I I I I I 

MVQKYQSPVR VYKYPFELIM AAYERRFPTC PLIPMFVGSD TVSEFKSEDG AIHVIERRCK 60 

LDVDAPRLLK KIAGVDYVYF VQKNSLNSRE RTLHIEAYNE TFSNRVIINE HCCYTVHPEN 120 

EDWTCFEQSA SLDIKSFFGF ESTVEKIAMK QYTSNIKKGK EIIEYYLRQL EEEGITFVPR 180 

WSPPSITPSS ETSSSSSKKQ AASMAWIPE AALKEGLSGD ALSSPSAPEP WGTPDDKLD 24 0 

ADHIKRYLGD LTPLQESCLI RLRQWLQETH KGKIPKDEHI LRFLRARDFN IDKAREIMCQ 300 

SLTWRKQHQV DYILETWTPP QVLQDYYAGG WHHHDKDGRP LYVLRLGQMD TKGLVRALGE 360 

EALLRYVLSV NEERLRRCEE NTKVFGRPIS SWTCLVDLEG LNMRHLWRPG VKALLRIIEV 420 

VEANYPETLG RLLILRAPRV FPVLWTLVSP FIDDNTRRKF LIYAGNDYQG PGGLLDYIDK 4 80 

EIIPDFLSGE CMCEVPEGGL VPKSLYRTAE ELENEDLKLW TETIYQSASV FKGAPHEILI 54 0 

QIVDASSVIT WDFDVCKGDI VFNIYHSKRS PQPPKKDSLG AHSITSPGGN NVQLIDKVWQ 600 

LGRDYSMVES PLICKEGESV QGSHVTRWPG FYILQWKFHS MPACAASSLP RVDDVLASLQ 660 

VSSHKCKVMY YTEVIGSEDF RGSMTSLESS HSGFSQLSAA TTSSSQSHSS SMISR 



Seq ID NO: 138 DNA sequence 

Nucleic Acid Accession #: NM_00418l.l 

Coding sequence: 32-670 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I l 

GCAGAAATAG CCTAGGGAGA TCAACCCCGA GATGCTGAAC AAAGTGCTGT CCCGGCTGGG 60 

GGTCGCCGGC CAGTGGCGCT TCGTGGACGT GCTGGGGCTG GAAGAGGAGT CTCTGGGCTC 120 

GGTGCCAGCG CCTGCCTGCG CGCTGCTGCT GCTGTTTCCC CTCACGGCCC AGCATGAGAA 180 

CTTCAGGAAA AAGCAGATTG AAG AG CTGAA GGGACAAGAA GTTAGTCCTA AAGTGTACTT 240 

CATGAAGCAG ACCATTGGGA ATTCCTGTGG CACAATCGGA CTTATTCACG CAGTGGCCAA 300 

TAATCAAGAC AAACTGGGAT TTGAGGATGG ATCAGTTCTG AAACAGTTTC TTTCTGAAAC 360 

AGAGAAAATG TCCCCTGAAG ACAGAGCAAA ATGCTTTGAA AAGAATGAGG CCATACAGGC 420 

AGCCCATGAT GCCGTGGCAC AGGAAGGCCA ATGTCGGGTA GATGACAAGG TGAATTTCCA 4 80 

TTTTATTCTG TTTAACAACG TGGATGGCCA CCTCTATGAA CTTGATGGAC GAATGCCTTT 54 0 

TCCGGTGAAC CATGGCGCCA GTTCAGAGGA CACCCTGCTG AAGGACGCTG CCAAGGTGTG 600 

CAGAGAATTC ACCGAGCGTG AGCAAGGAGA AGTCCGCTTC TCTGCCGTGG CTCTCTGCAA 660 

GGCAGCC TAA TGCTCTGTGG GAGGGACTTT GCTGATTTCC CCTCTTCCCT TCAACATGAA 720 

AATATATACC CCCCATGCAG TCTAAAATGC TTCAGTACTT GTGAAACACA GCTGTTCTTC 780 

TGTTCTGCAG ACACGCCTTC CCCTCAGCCA CACCCAGGCA CTTAAGCACA AGCAGAGTGC 84 0 

ACAGCTGTCC ACTGGGCCAT TGTGGTGTGA GCTTCAGATG GTGAAGCATT CTCCCCAGTG 900 
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TATGTCTTGT ATCCGATATC TAACGCTTTA AATGGCTACT TTGGTTTCTG TCTGTAAGTT 960 
AAGACCTTGG ATGTGGTTAT GTTGTCCTAA AGAATAAATT TTGCTGATAG TAGC 



Seq ID No: 139 Protein sequence: 
Protein Accession #: NP 004172.1 



1 11 21 31 41 51 

I I I I I I 

MLNKVLSRLG VAGQWRFVDV LGLEEESLGS VPAPACALLL LFPLTAQHEN FRKKQIEELK 60 
GQEVSPKVYF MKQTIGNSCG TIGLIHAVAN NQDKLGFEDG SVLKQFLSET EKMSPEDRAK 120 
CFEKNEAIQA AHDAVAQEGQ CRVDDKVNFH FILFNNVDGH LYELDGRMPF PVNHGASSED 180 
TLLKDAAKVC REFTEREQGE VRFSAVALCK AA 

Seq ID NO: 140 DNA sequence 

Nucleic Acid Accession #: NM_000201.1 

Coding sequence: 58-1656 (underlined sequences correspond to start and stop codons) 



1 

r 

GCGCCCCAGT 
GCTCCCAGCA 
CCAGGACCTG 
GGCTCCGTGC 
ACCCCGTTGC 
AGCAATGTGC 
ACAGCTAAAA 
CCCTCTTGGC 
CCCCGGGCCA 
GCTGTGGGGG 
GCCAATTTCT 
AACACCTCGG 
GTCAGCCCCC 
CTGTTCCCAG 
ACAGTCACCT 
GACGAGGGCA 
CTGCAGACAG 
GTCTCAGAAG 
CTGAATGGGG 
CCAGAGGACA 
ATACACAAGA 
GATTGTCCGG 
TGGGGGAACC 
GGGGAATCAG 
ACTCAAGGGG 
GTCATCATCA 
CTCTATAACC 
CCCATGAAAC 
CCTCGGCCTT 
CATGCAGCTA 
TACAACAGCA 
CTGTAGTCAC 
TTAAAGTCTA 
ATACAACTGG 
CAGAAGAAGT 
GACGGATGCC 
TTCATTTGTT 
GGTCTCTGGC 
TACAGGTTGT 
CATTGGCCAA 
ATGGACTGGT 
CCCCCAAAAC 
CACAATGACA 
GCCTTGTCCT 
TCCTGCAGTG 
CCTCCCAGCT 
CGCTCTGTCA 
TTTGGGCTCA 
ACACCACACC 
TGCCCAGACT 



11 
I 

CGACGCTGAG 
GCCCCCGGCC 
GCAATGCCCA 
TGGTGACATG 
CTAAAAAGGA 
AAGAAGATAG 
CCTTCCTCAC 
AGCCAGTGGG 
ACCTCACCGT 
AGCCCGCTGA 
CGTGCCGCAC 
CCCCCTACCA 
GGGTCCTAGA 
TCTCGGAGGC 
ATGGCAACGA 
CCCAGCGGCT 
TGACCATCTA 
GGACCGAGGT 
TTCCAGCCCA 
ACGGGCGCAG 
ACCAGACCCG 
GAAACTGGAC 
CATTGCCCGA 
TGACTGTCAC 
AGGTCACCCG 
CTGTGGTAGC 
GCCAGCGGAA 
CGAACACACA 
CCCATATTGG 
CACCTACCGG 
TTTGGGGCCA 
ATGACTAAGC 
GCCTGATGAG 
GAAATACTGA 
GGCCCTCCAT 
AGCTTGGGCA 
ATTTTACCAG 
CTCACGGAGC 
ACACTGCAGG 
CCTGCCTTTC 
AATGGTTCAC 
TGACACCTTT 
CTCAGCGGTC 
CTTGTCCTGT 
ATCAGGGTCC 
TTGGAAGGGT 
CCCAGGCTGG 
AGTGATCCTC 
TGGCAAATTT 
TCCTTTGTGT 



21 
i 

CTCCTCTGCT 
CGCGCTGCCC 
GACATCTGTG 
CAGCACCTCC 
GTTGCTCCTG 
CCAACCAATG 
CGTGTACTGG 
CAAGAACCTT 
GGTGCTGCTC 
GGTCACGACC 
TGAACTGGAC 
GCTCCAGACC 
GGTGGACACG 
CCAGGTCCAC 
CTCCTTCTGG 
GACGTGTGCA 
CAGCTTTCCG 
GACAGTGAAG 
GCCACTGGGC 
CTTCTCCTGC 
GGAGCTTCGT 
GTGGCCAGAA 
GCTCAAGTGT 
TCGAGATCTT 
CGAGGTGACC 
AGCCGCAGTC 
GATCAAGAAA 
AGCCACGCCT 
TGGCAGTGGT 
CCCTGGGACG 
TGGTACCTGC 
CAAGAGGAAG 
AGGGGAAGTG 
AACTTGCTGC 
AGACATGTGT 
CTGCTGTCTA 
CTATTTATTG 
TCCCAGTCCA 
AGAGTGCCTG 
CCCAGAAGGA 
AGGTTCAGAG 
GTTAGCCACC 
ATGTCTGGAC 
TTGCATTTCA 
TGCAAGCAGT 
CATCCGCGTG 
AGTGCAGTGG 
CCACCTCAGC 
GATTTTTTTT 
TAGTTAATAA 



31 
I 

ACTCAGAGTT 
GCACTCCTGG 
TCCCCCTCAA 
TGTGACCAGC 
CCTGGGAACA 
TGCTATTCAA 
ACTCCAGAAC 
ACCCTACGCT 
CGTGGGGAGA 
ACGGTGCTGG 
CTGCGGCCCC 
TTTGTCCTGC 
CAGGGGACCG 
CTGGCACTGG 
GCCAAGGCCT 
GTAATACTGG 
GCGCCCAACG 
TGTGAGGCCC 
CCGAGGGCCC 
TCTGCAACCC 
GTCCTGTATG 
AATTCCCAGC 
CTAAAGGATG 
GAGGGCACCT 
GTGAATGTGC 
ATAATGGGCA 
TACAGACTAC 
CCC TGAA CCT 
GCCACACTGA 
CCGGAGGACA 
ACACCTAAAA 
GAGCAAGACT 
GTGGGGGAGA 
CTATTGGGTA 
AGCATCAAAA 
CTGACCCCAA 
AGTGTCTTTT 
TGTCACATTC 
GCAAAAAGAT 
GTGATTTTTC 
ATTACCCAGT 
TCCCCACCCA 
ATGAGTGCCC 
CTGGGAGCTT 
GGGGAAGGGG 
TGTGTGTGTG 
TGCAATCATG 
CTCCTGAGTA 
TTTTTTTTCA 
AGCTTTCTCA 



41 
I 

GCAACCTCAG 
TCCTGCTCGG 
AAGTCATCCT 
CCAAGTTGTT 
ACCGGAAGGT 
ACTGCCCTGA 
GGGTGGAACT 
GCCAGGTGGA 
AGGAGCTGAA 
TGAGGAGAGA 
AAGGGCTGGA 
CAGCGACTCC 
TGGTCTGTTC 
GGGACCAGAG 
CAGTCAGTGT 
GGAACCAGAG 
TGATTCTGAC 
ACCCTAGAGC 
AGCTCCTGCT 
TGGAGGTGGC 
GCCCCCGACT 
AGACTCCAAT 
GCACTTTCCC 
ACCTCTGTCG 
TCTCCCCCCG 
CTGCAGGCCT 
AACAGGCCCA 
ATCCCGGGAC 
ACAGAGTGGA 
GGGCATTGTC 
CACTAGGCCA 
CAAGACATGA 
CATAGCCCCA 
TGCTGAGGCC 
CACAAAGGCC 
CCCTTGATGA 
ATGTAGGCTA 
AAGGTCACCA 
CAAATGGGGC 
TATCGGCACA 
GAGGCCTTAT 
CATACATTTC 
AGGGAATATG 
GCACTATTGC 
GCCAAGGTAT 
TGTATGTGTA 
GTTCACTGCA 
GCTGGGACCA 
GAGACGGGGT 
ACTGCC 



51 
I 

CCTCGCT ATG 60 

GGCTCTGTTC 120 

GCCCCGGGGA 180 

GGGCATAGAG 240 

GTATGAACTG 300 

TGGGCAGTCA 360 

GGCACCCCTC 420 

GGGTGGGGCA 480 

ACGGGAGCCA 540 

TCACCATGGA 600 

GCTGTTTGAG 660 

CCCACAACTT 720 

CCTGGACGGG 780 

GTTGAACCCC 84 0 

GACCG CAG AG 900 

CCAGGAGACA 960 

GAAGCCAGAG 1020 

CAAGGTGACG 1080 

GAAGGCCACC 1140 

CGGCCAGCTT 1200 

GGACGAGAGG 1260 

GTGCCAGGCT 1320 

ACTGCCCATC 1380 

GGCCAGGAGC 144 0 

GTATGAGATT 1500 

CAGCACGTAC 1560 

AAAAGGGACC 1620 

AGGGCCTCTT 1680 

AGACATATGC 174 0 

C'TCAGTCAGA 1800 

CGCATCTGAT 1860 

TTGATGGATG 1920 

CCATGAGGAC 198 0 

CACAGACTTA 204 0 

CACACTTCCT 2100 

TATGTATTTA 2160 

AATGAACATA 2220 

GGTACAGTTG 2280 

TGGGACTTCT 2340 

AAAGCACTAT 2400 

TCCTCCCTTC 2460 

TGCCAGTGTT 2520 

CCCAAGCTAT 2580 

AGCTCCAGTT 2640 

TGGAGGACTC 270 0 

GACAAGCTCT 2760 

GTCTTGACCT 2820 

TAGGCTCACA 2880 

CTCGCAACAT 294 0 



Seq ID No: 141 Protein sequence: 
Protein Accession #: NP 000192.1 
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I I I I I I 

MLQFVRAGAR AWLRPTGSQG LSSLAEEAAR ATENPEOVAS EGLPEPVLRK VELPVPTHRR 60 
PVQAWVESLR GFEQERVGLA DLHPDVFATA PRLDILHQVA MWQKNFKRIS YAKTKTRAEV 120 
RGGGGKPLAA ERHWAGPAWQ H PL S ALARRR CCPWPPGPTS YYYMLPMKVR ALGLKVALTV 180 
KLAQDDLHIM DSLELPTGDP QYLTELAHYR RWGDSVLLVD LTHEEMPQSI VEATSRLKTF 240 
NLIPAVGLNV HSMLKHQTLV LTLPTVAFLE DKLLWQDSRY RPLYPFSLPY SDFPRPIiPHA 300 
TQGPAATPYH C 

Seq ID NO: 142 DNA sequence 

Nucleic Acid Accession #: NM_000270.1 

Coding sequence: 110-979 {underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I i I I I 

AACTGTGCGA ACCAGACCCG GCAGCCTTGC TCAGTTCAGC ATAGCGGAGC GGATCCGATC 60 

GGATCGGAGC ACACCGGAGC AGGCTCATCG AGAAGGCGTC TGCGAGAC CA TG GAGAACGG 120 

ATACACCTAT GAAGATTATA AGAACACTGC AGAATGGCTT CTGTCTCATA CTAAGCACCG 180 

ACCTCAAGTT GCAATAATCT GTGGTTCTGG ATTAGGAGGT CTGACTGATA AATTAACTCA 240 

GGCCCAGATC TTTGACTACA GTGAAATCCC CAACTTTCCT CGAAGTACAG TGCCAGGTCA 300 

TGCTGGCCGA CTGGTGTTTG GGTTCCTGAA TGGCAGGGCC TGTGTGATGA TGCAGGGCAG 360 

GTTCCACATG TATGAAGGGT ACCCACTCTG GAAGGTGACA TTCCCAGTGA GGGTTTTCCA 420 

CCTTCTGGGT GTGGACACCC TGGTAGTCAC CAATGCAGCA GGAGGGCTGA ACCCCAAGTT 480 

TGAGGTTGGA GATATCATGC TGATCCGTGA CCATATCAAC CTACCTGGTT TCAGTGGTCA 540 

GAACCCTCTC AGAGGGCCCA ATGATGAAAG GTTTGGAGAT CGTTTCCCTG CCATGTCTGA 600 

TGCCTACGAC CGGACTATGA GGCAGAGGGC TCTCAGTACC TGGAAACAAA TGGGGGAGCA 660 

ACGTGAGCTA CAGGAAGGCA CCTATGTGAT GGTGGCAGGC CCCAGCTTTG AGACTGTGGC 720 

AGAATGTCGT GTGCTGCAGA AGCTGGGAGC AGACGCTGTT GGCATGAGTA CAGTACCAGA 780 

AGTTATCGTT GCACGGCACT GTGGACTTCG AGTCTTTGGC TTCTCACTCA TCACTAACAA 840 

GGTCATCATG GATTATGAAA GCCTGGAGAA GGCCAACCAT GAAGAAGTCT TAGCAGCTGG 900 

CAAACAAGCT GCACAGAAAT TGGAACAGTT TGTCTCCATT CTTATGGCCA GCATTCCACT 960 

CCCTGACAAA GCCAGTTGAC CTGCCTTGGA GTCGTCTGGC ATCTCCCACA CAAGACCCAA 1020 

GTAGCTGCTA CCTTCTTTGG CCCCTTGCTG GAGTCATGTG CCTCTGTCCT TAGGTTGTAG 1080 

CAGAAAGGAA AAGATTCCTG TCCTTCACCT TTCCCACTTT CTTCTACCAG ACCCTTCTGG 1140 

TGCCAGATCC TCTTCTCAAA GCTGGGATTA CAGGTGTGAG CATAGTGAGA CCTTGGCGCT 1200 

ACAAAATAAA GCTGTTCTCA TTCCTGTTCT TTCTTACACA AGAGCTGGAG CCCGTGCCCT 1260 

ACCACACATC TGTGGAGATG CCCAGGATTT GACTCGGGCC TTAGAACTTT GCATAGCAGC 1320 

TGCTACTAGC TCTTTGAGAT AATACATTCC GAGGGGCTCA GTTCTGCCTT ATCTAAATCA 13 80 
CCAGAGACCA AACAAGGACT AATCCAATAC CTCTTGGA 

Seq ID No: 143 Protein sequence : 
Protein Accession #: NP 000261.1 



1 11 21 31 41 51 

I I I I I I 

MENGYTYEDY KNTAEWLLSH TKHRPQVAII CGSGLGGLTD KLTQAQIFDY SEIPNFPRST 60 
VPGHAGRLVF GFLNGRACWi MQGRFHMYEG YPLWKVTFPV RVFHLLGVDT LWTNAAGGL 120 
NPKFEVGDIM LIRDHINLPG FSGQNPLRGP NDERFGDRFP AMSDAYDRTM RQRALSTWKQ 180 
MGEQRELQEG TYVMVAGPSF ETVAECRVLQ KLGADAVGMS TVPEVIVARH CGLRVFGFSL 24 0 
ITNKVIMDYE SLEKANHEEV LAAGKQAAQK LEQFVSILMA SIPLPDKAS 

Seq ID NO: 144 DNA sequence 

Nucleic Acid Accession #: NM_015577.1 

Coding sequence: 112-3054 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAAGCGGCGG GCGGGGTGGA GCAGCCAGCT GGGTCCGGGG AGCGCCGCCG CCGCCTCGAT 60 

GGGGTGTTGA AAAGTCTCCT CTAGAGCTTT GGAAGGCTGA ATGCACTAAA C ATGA AGAGC 120 

TTGAAAGCGA AGTTCAGGAA GAGTGACACC AATGAGTGGA ACAAGAATGA TGACCGGCTA 180 

CTGCAGGCCG TGGAGAATGG AGATGCGGAG AAGGTGGCCT CACTGCTCGG CAAGAAGGGG 240 

GCCAGTGCCA CCAAACACGA CAGTGAGGGC AAGACCGCTT TCCATCTTGC TGCTGCAAAA 300 

GGACACGTGG AATGCCTCAG GGTCATGATT ACACATGGTG TGGATGTGAC AGCCCAAGAT 360 

ACTACCGGAC ACAGCGCCTT ACATCTCGCA GCCAAGAACA GCCACCATGA ATGCATCAGG 420 

AGGCTGCTTC AGTCTAAATG CCCAGCCGAA AGTGTCGACA GCTCTGGGAA AACAGCTTTA 480 

CATTATGCAG CGGCTCAGGG CTGCCTTCAA GCTGTGCAGA TTCTCTGCGA ACACAAGAGC 540 

CCCATAAACC TCAAAGATTT GGATGGGAAT ATACCGCTGC TTCTTGCTGT ACAAAATGGT 600 

CACAGTGAGA TCTGTCACTT TCTCCTGGAT CATGGAGCAG ATGTCAATTC CAGGAACAAA 660 

AGTGGAAGAA CTGCTCTCAT GCTGGCCTGT GAGATTGGCA GCTCTAACGC TGTGGAAGCC 720 

TTAATTAAAA AGGGTGCAGA CCTAAACCTT GTAGATTCTC TTGGATACAA TGCCTTACAT 780 

TATTCCAAAC TCTCAGAAAA TGCAGGAATT CAAAGCCTTC TATTATCAAA AATCTCTCAG 840 

GATGCTGATT TAAAGACCCC AACAAAACCA AAGCAGCATG ACCAAGTCTC TAAAATAAGC 900 
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TCAGAAAGAA GTGGAACTCC AAAAACACGC AAAGCTCCAC 
CAGTTGAGTG ATGTCTCTTC CCCAAGATCA ATAACTTCGA 
TCGGTATTTT TTGCTGAACC ACCCTTCAAG GCTGAGATCA 
GACAGACTAA GTGACAGTAC TACAGGTGCT GATAGCTTAT 
5 GACCAACAAG ATCTTCTCTC TCTATTGCAA GCAAAAGTTG 
AAGGAGTTAC AAGATAAATT ACAGGCCAAA TCACCCAAGG 
TTTGACTCAT ACCATTCCAC CCAAACTGAC TTGGGCCCAT 
ACCTCTCCCC CAGACTCCAA ATCATCTCCA TCTGTCTTAA 
ACTACTGACA ATGATGTCAG AATTCAGCAA CTGCAAGAGA 

10 AGATTAGAGA GCTCTGAAGC AGAGAGAAAA CAGCTACAGG 
GCAGAACTGG TATGCTTAAA CAACACTGAG ATTTCAGAGA 
AAACTTAAAG AAACTCAGAG CAAATACGAG GAGGCTATGA 
AAGCAGATGA AACTCGGTCT TGTCTCACCT GAAAGCATGG 
GAGCTGAGGG TCACGGAAGA GGAAATAAAT GTGCTAAAGC 

15 GAAGAAAGTG AAAGAAATAA AGAGAAAGTG AGAGAGTTAG 
GAGAAAGGTA CAGTGATTAA GCCACCTGTG GAAGAGTACG 
TGCTCTGTTA TTGAGAATAT GAATAAGGAG AAAGCATTTT 
GCCCAAGAAG AAATCATGAA ATTAAAAGAC ACACTAAAAA 
AGTGATGAAG CTGAGGACAT GAAAGAAGCC ATGAATAGGA 

20 CAGGTGAGCG AGCTGTCACA GCTGTACAAA GAAGCCCAGG 
AAGAGGAAAT CTCTAGAGGA TGTCACAGCT GAATATATCC 
CTGATGCAAT TGACAAACGT GTCCAGGGCT AAAGCAGAAG 
TCTCAGTATT CAAAAGTGTT GAATGAGTTG ACCCAGCTCA 
AAAGAGAACT CTGTCTCTAT CACAGAACAT TTGCAAGTGA 

25 GCAAAAGAGA TGGAAGAAAA AATAAGCAAT CTTAAGGAAC 
GAAGTAGCAA AGCTGGAGAA ACAACTCTTA GAAGAGAAAG 
GTACCTCGGT CTTCCTATGA AAAACTCCAG TCATCCTTAG 
GCATCGAAAT TAAAGGAATC TGTGAAAGAG AAAGAGAAGG 
ATTAGAAGTG AGGTCTCACA GGTGAAAAGA GAAAAGGAAA 

30 TCCAAAGAGC AAGAAGTAAA TGAACTTCTG CAAAAATTCC 
GCAGAAATGA AAAGATACGC TGAGAGCTCT TCAAAACTGG 
ATAAATGAGA TGTCGAAGGA AGTCACCAAA TTGAAGGAGG 
CTCTCCTACT CAACAAGCTC ATCCAAAAGG CAGAGTCAGC 
CAAGTCAAAC AGCTCCAGAA CCAGCTGGCG GAATGCAAGA 

35 TCAGTTTACA GAATGCATCT TCTGTATGCT GTGCAGGGCC 
AAAGTACTGA AGCAAATC CT TACCATGTGT AAAAACCAGT 
TTCCTTGGCA GGACACTGCC CCTTGTCATC TGTCTTTGTG 
AGCCGCTGCC ATTGTTCTCA TTCGTGGTAT GCACTGTGGC 
TCCAAAGGTT TCTGAGGACT TCTCCCAGGA GAAGACTGCC 

40 ACTTCAAACC AGCAGAGGTG AAAGTCCCTG TCATCCCTTC 
GCCATGCCCA GAGGTCTGGT CCTGATGCTG GCAGGGGGGC 
GGCTGAGTGG CTTTATCACC ACCGAGTGAT GTGCTGAGGC 
TCCATTCCTG TACTCGGGCA GTGCCATTCA GCACAGGAGA 
TCAATTCCAA AACATGATTT AATTTCTAAC TAAATTAGTA 

45 CTGCTTAAAA CCCTTCATCA TGATATCCTG TGGATTTAAA 
TTCCCATCTG CCTTATATAT CTCATCACCC TGCTTATCAA 
ATTAACTAAA ATATGAAACT TAAAAACAAA AGCAAGTTGT 
AGTAAATTGT TGACATACTG CAAATTTTCT ATGCAAACTT 
AAGCTCAGGA AATCCAAACA TTTGTGTTTC AACAAGGGAC 

50 CCAAAAGAAA TGCCTCATAG TTCTTAACCT CAACTTTTGT 
AATATT TTT A TTGGCTCATA AAGATGTTTT CATATCTGAA 
CAGTAGATTA TATTAACAAA ATACTTTTTA GGTAGCCATG 
TAACTTTTTC CTTAAAGTTT TCAGCTATAG CAAAAGGTAG 
ATGAGCTGCC ACCAACACCC CTAGAACTTT CAGCCATGGT 

55 TTTCTGAATC TAGCAAATCC TCCTTTTACC CGTTGAATGT 
CAGCGCCCAT AAATGATCTC TAGAAGGACT GTTAGTACCA 
GCTAAAAACC CTGATATGGT AATATTATGG TGCATAGCAG 
TTCTGTTCAC TTTACTTTCA GGTTAAAAAT GTTTCTAACA 
GCATTAATCT TGTTGAGGGA GAGAGACAGA ATCCTGGACT 

60 AGTAGGGCCT GCTCTGACAG GGCCCATGTC CCACAAGGCT 
TGGCTGTGCT GGATGATATG TTGATCTGTA TTGGATAAGG 
AATGGCTTTA AAGCTTGGTG TTACTTTTCT TAAGTTGTTT 
AAAAATGCTC CAAAGAAATG TGAAAGGACC TTTTGTCACA 
CAGCCCCTTC TGCCCCCGCA CAGAAATGCT GCAGAGTATA 

65 TAGGATGCCT GACGAGGTGT AGCCTTTTAT CTTGTTTCCG 
ACTCTGGTTA AATATTGAAA AGTTATATGC TGTAGTTTTT 
TACAGAAGTT ATTGGAGAAA ATAAACTTGT TTCATTTTGC 
AAAAA 

70 Seq ID No: 145 Protein sequence: 
Protein Accession #: NP 056392.1 



1 11 21 31 

75 | | | | 

MKSLKAKFRK SDTNEWNKND DRLLQAVENG DAEKVASLLG 



PCT/US02/04915 



CACCTCCTAT 


CAGTCCTACC 


960 


CTCCACTATC 


GGGAAAGGAA 


1020 


GTTCTATACG 


AGAAAACAAA 


1080 


TGGATATAAG 


TTCTGAAGCT 


1140 


CTTCCCTTAC 


CTTACACAAT 


1200 


AGGCGGAAGC 


AGACCTAAGC 


1260 


CCCTGGGAAA 


ACCTGGTGAA 


1320 


TACATTCTTT 


AGGTAAATCC 


1380 


TTTTGCAAGA 


TCTACAGAAG 


1440 


TCGAACTCCA 


ATCCCGAAGG 


1500 


ACAGCTCTGA 


CCTCAGCCAG 


1560 


AAGAAGTCCT 


TAGTGTGCAG 


1620 


ATAATTATTC 


ACATTTCCAC 


1680 


AGGATCTGCA 


GAATGCATTA 


1740 


AGGAAAAACT 


GGTAGAGAGG 


1800 


AGGAAATGAA 


AAGTTCATAT 


1860 


TGTTTGAGAA 


ATACCAAGAA 


1920 


GTCAGATGAC 


ACAGGAAGCC 


1980 


TGATAGATGA 


ACTCAATAAA 


2040 


CTGAGCTGGA 


GGATTACAGG 


2100 


ATAAAGCAGA 


GCATGAGAAA 


2160 


ATGCACTGTC 


TGAAATGAAG 


2220 


AACAACTGGT 


GGATGCACAA 


2280 


TAACCACGCT 


GCGGACTGCA 


2340 


ACCTTGCAAG 


CAAGGAAGTG 


2400 


CTGCTATGAC 


TGATGCAATG 


2460 


AGAGTGAAGT 


GAGTGTGTTG 


2520 


TCCATTCAGA 


GGTTGTCCAG 


2580 


ATATTCAGAC 


TCTCTTGAAA 


2640 


AGCAAGCTCA 


GGAAGAACTT 


2700 


AGGAAGATAA 


AGATAAAAAG 


2760 


CCTTGAACAG 


CCTCTCCCAG 


2820 


AGCTGGAGGC 


GCTGCAGCAG 


2880 


AACAACACCA 


GGAGGTCATA 


2940 


AGATGGATGA 


AGATGTCCAG 


3000 


CTCAAAAGAA 


GTAAAGTGGA 


3060 


TTAGATCCAG 


AGTTGTCGGC 


3120 


CTAGCGTAGC 


TTCTTCCCTT 


3180 


CGCCTCAGAA 


CTGCTTAGAG 


3240 


AGATTCCAGA 


GCTGGGATCA 


3300 


CCCCTCCTCC 


ATCCCTGACT 


3360 


CTCCTGCAGT 


GAATGCTCCT 


3420 


GCTCTTTTTG 


CCTTTGGCTT 


3480 


TGGCACTAGT 


TATGAAGTAT 


3540 


AACTCTAATT 


CCATGTTTTC 


3600 


TATTCAGTTT 


GATGAGCACT 


3660 


CCTTAAAAGT 


TCTTTTTTTA 


3720 


GCCTCCTGCT 


GTTATCTGTG 


3780 


AGTAAACTGT 


GTGTTTACAG 


3840 


AGAAGTATTT 


TTTTCTCTGT 


3900 


CTCCTAAATA 


AGTGAAATTA 


3960 


CTTGAGACTT 


TTTAAAAATA 


4020 


TTATGTATGC 


CAGACCTAAT 


4080 


GTCTTCAGAA 


TTGTAGCGCA 


4140 


TTTGAATGCC 


CTGACTCTAC 


4200 


ATCTGTTTTT 


CAACTTTGAA 


4260 


AGGTCTCGGA 


AAAAAAATAT 


4320 


CGCTTGCAAC 


TTCCCTTATG 


4380 


CTCCAAAGTA 


TTTAACTGAA 


4440 


GCTTGGCCTC 


AGTGGGTGCT 


4500 


ACCAATGACA 


GCAAAGCAAA 


4560 


AATTATAGTT 


AAGCAATTTC 


4620 


GCACTTCAGA 


AAATACACAA 


4680 


TAAAACTTGA 


GACATTTTTG 


4740 


GATGCATATT 


TATTACGAGT 


4800 


AGTATTTTGT 


CTTTGTAATT 


4860 


AAAAAAAAAA 


AAAAAAAAAA 


4920 


41 


51 




1 

KKGASATKHD 


1 

SEGKTAFHLA 


60 
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AAKGHVECLR VMITHGVDVT AQDTTGHSAL HLAAKNSHHE CIRRLLQSKC PAESVDSSGK 120 

TALHYAAAQG CLQAVQILCE HKSPINLKDL DGNIPLLLAV QNGHSEICHF LLDHGADVNS 18 0 

RNKSGRTALM LACEIGSSNA VEALIKKGAD LNLVDSLGYN ALHYSKLSEN AGIQSLLLSK 24 0 

ISQDADLKTP TKPKQHDQVS KISSERSGTP KTRKAPPPPI SPTQLSDVSS PRSITSTPLS 300 

GKESVFFAEP PFKAEISSIR ENKDRLSDST TGADSLLDIS SEADQQDLLS LLQAKVASLT 360 

LHNKELQDKL QAKSPKEAEA DLSFDSYHST QTDLGPSLGK PGETSPPDSK SSPSVLIHSL 420 

GKSTTDNDVR IQQLQEILQD LQKRLESSEA ERKQLQVELQ SRRAELVCLN NTEISENSSD 480 

LSQKLKETQS KYEEAMKEVL SVQKQMKLGL VSPESMDNYS HFHELRVTEE EINVLKQDLQ 540 

NALEESERNK EKVRELEEKL VEREKGTVIK PPVEEYEEMK SSYCSVIENM NKEKAFLFEK 600 

YQEAQEEIMK LKDTLKSQMT QEASDEAEDM KEAMNRMIDE LNKQVSELSQ LYKEAQAELE 660 

DYRKRKSLED VTAEYIHKAE HEKLMQLTNV SRAKAEDALS EMKSQYSKVL NELTQLKQLV 720 

DAQKENSVSI TEHLQVITTL RTAAKEMEEK ISNLKEHLAS KEVEVAKLEK QLLEEKAAMT 780 

DAMVPRSSYE KLQSSLESEV SVLASKLKES VKEKEKVHSE WQIRSEVSQ VKREKENIQT 840 

LLKSKEQEVN ELLQKFQQAQ EELAEMKRYA ESSSKLEEDK DKKINEMSKE VTKLKEALNS 900 

LSQLSYSTSS SKRQSQQLEA LQQQVKQLQN QLAECKKQHQ EVISVYRMHL LYAVQGQMDE 960 
DVQKVLKQIL TMCKNQSQKK 

Seq ID NO: 146 DNA sequence 

Nucleic Acid Accession #: NM_000459.1 

Coding sequence: 14 9-3523 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTTCTGTGCT GTTCCTTCTT GCCTCTAACT TGTAAACAAG ACGTACTAGG ACGATGCTAA 60 

TGGAAAGTCA CAAACCGCTG GGTTTTTGAA AGGATCCTTG GGACCTCATG CACATTTGTG 120 

GAAACTGGAT GGAGAGATTT GGGGAAGC AT G GACTCTTTA GCCAGCTTAG TTCTCTGTGG 180 

AGTCAGCTTG CTCCTTTCTG GAACTGTGGA AGGTGCCATG GACTTGATCT TGATCAATTC 240 

CCTACCTCTT GTATCTGATG CTGAAACATC TCTCACCTGC ATTGCCTCTG GGTGGCGCCC 300 

CCATGAGCCC ATCACCATAG GAAGGGACTT TGAAGCCTTA ATGAACCAGC ACCAGGATCC 360 

GCTGGAAGTT ACTCAAGATG TGACCAGAGA ATGGGCTAAA AAAGTTGTTT GGAAGAGAGA 420 

AAAGGCTAGT AAGATCAATG GTGCTTATTT CTGTGAAGGG CGAGTTCGAG GAGAGGCAAT 480 

CAGGATACGA ACCATGAAGA TGCGTCAACA AGCTTCCTTC CTACCAGCTA CTTTAACTAT 540 

GACTGTGGAC AAGGGAGATA ACGTGAACAT ATCTTTCAAA AAGGTATTGA TTAAAGAAGA 600 

AGATGCAGTG ATTTACAAAA ATGGTTCCTT CATCCATTCA GTGCCCCGGC ATGAAGTACC 660 

TGATATTCTA GAAGTACACC TGCCTCATGC TCAGCCCCAG GATGCTGGAG TGTACTCGGC 720 

CAGGTATATA 'GGAGGAAACC TCTTCACCTC GGCCTTCACC AGGCTGATAG TCCGGAGATG 780 

TGAAGCCCAG AAGTGGGGAC CTGAATGCAA CCATCTCTGT ACTGCTTGTA TGAACAATGG 840 

TGTCTGCCAT GAAGATACTG GAGAATGCAT TTGCCCTCCT GGGTTTATGG GAAGGACGTG 900 

TGAGAAGGCT TGTGAACTGC ACACGTTTGG CAGAACTTGT AAAGAAAGGT GCAGTGGACA 960 

AGAGGGATGC AAGTCTTATG TGTTCTGTCT CCCTGACCCC TATGGGTGTT CCTGTGCCAC 1020 

AGGCTGGAAG GGTCTGCAGT GCAATGAAGC ATGCCACCCT GGTTTTTACG GGCCAGATTG 1080 

TAAGCTTAGG TGCAGCTGCA ACAATGGGGA GATGTGTGAT CGCTTCCAAG GATGTCTCTG 1140 

CTCTCCAGGA TGGCAGGGGC TCCAGTGTGA GAGAGAAGGC ATACCGAGGA TGACCCCAAA 1200 

GATAGTGGAT TTGCCAGATC ATATAGAAGT AAACAGTGGT AAATTTAATC CCATTTGCAA 1260 

AGCTTCTGGC TGGCCGCTAC CTACTAATGA AGAAATGACC CTGGTGAAGC CGGATGGGAC 1320 

AGTGCTCCAT CCAAAAGACT TTAACCATAC GGATCATTTC TCAGTAGCCA TATTCACCAT 13 80 

CCACCGGATC CTCCCCCCTG ACTCAGGAGT TTGGGTCTGC AGTGTGAACA CAGTGGCTGG 1440 

GATGGTGGAA AAGCCCTTCA ACATTTCTGT TAAAGTTCTT CCAAAGCCCC TGAATGCCCC 1500 

AAACGTGATT GACACTGGAC ATAACTTTGC TGTCATCAAC ATCAGCTCTG AGCCTTACTT 1560 

TGGGGATGGA CCAATCAAAT CCAAGAAGCT TCTATACAAA CCCGTTAATC ACTATGAGGC 1620 

TTGGCAACAT ATTCAAGTGA CAAATGAGAT TGTTACACTC AACTATTTGG AACCTCGGAC 1680 

AGAATATGAA CTCTGTGTGC AACTGGTCCG TCGTGGAGAG GGTGGGGAAG GGCATCCTGG 1740 

ACCTGTGAGA CGCTTCACAA CAGCTTCTAT CGGACTCCCT CCTCCAAGAG GTCTAAATCT 1800 

CCTGCCTAAA AGTCAGACCA CTCTAAATTT GACCTGGCAA CCAATATTTC CAAGCTCGGA 1860 

AGATGACTTT TATGTTGAAG TGGAGAGAAG GTCTGTGCAA AAAAGTGATC AGCAGAATAT 1920 

TAAAGTTCCA GGCAACTTGA CTTCGGTGCT ACTTAACAAC TTACATCCCA GGGAGCAGTA 1980 

CGTGGTCCGA GCTAGAGTCA ACACCAAGGC CCAGGGGGAA TGGAGTGAAG ATCTCACTGC 2040 

TTGGACCCTT AGTGACATTC TTCCTCCTCA ACCAGAAAAC ATCAAGATTT CCAACATTAC 2100 

ACACTCCTCG GCTGTGATTT CTTGGACAAT ATTGGATGGC TATT CTATTT CTTCTATTAC 2160 

TATCCGTTAC AAGGTTCAAG GCAAGAATGA AGACCAGCAC GTTGATGTGA AGATAAAGAA 2220 

TGCCACCATC ATTCAGTATC AGCTCAAGGG CCTAGAGCCT GAAACAGCAT ACCAGGTGGA 2280 

CATTTTTGCA GAGAACAACA TAGGGTCAAG CAACCCAGCC TTTTCTCATG AACTGGTGAC 2340 

CCTCCCAGAA TCTCAAGCAC CAGCGGACCT CGGAGGGGGG AAGATGCTGC TTATAGCCAT 2400 

CCTTGGCTCT' GCTGGAATGA CCTGCCTGAC TGTGCTGTTG GCCTTTCTGA TCATATTGCA 2460 

ATTGAAGAGG GCAAATGTGC AAAGGAGAAT GGCCCAAGCC TTCCAAAACG TGAGGGAAGA 2520 

ACCAGCTGTG CAGTTCAACT CAGGGACTCT GGCCCTAAAC AGGAAGGTCA AAAACAACCC 2580 

AGATCCTACA ATTTATCCAG TGCTTGACTG GAATGACATC AAATTTCAAG ATGTGATTGG 2640 

GGAGGGCAAT TTTGGCCAAG TTCTTAAGGC GCGCATCAAG AAGGATGGGT TACGGATGGA 2700 

TGCTGCCATC AAAAGAATGA AAGAATATGC CTCCAAAGAT GATCACAGGG ACTTTGCAGG 2760 

AGAACTGGAA GTTCTTTGTA AACTTGGACA CCATCCAAAC ATCATCAATC TCTTAGGAGC 2820 

ATGTGAACAT CGAGGCTACT TGTACCTGGC CATTGAGTAC GCGCCCCATG GAAACCTTCT 2880 

GGACTTCCTT CGCAAGAGCC GTGTGCTGGA GACGGACCCA GCATTTGCCA TTGCCAATAG 294 0 

CACCGCGTCC ACACTGTCCT CCCAGCAGCT CCTTCACTTC GCTGCCGACG TGGCCCGGGG 3000 

CATGGACTAC TTGAGCCAAA AACAGTTTAT CCACAGGGAT CTGGCTGCCA GAAACATTTT 3060 

AGTTGGTGAA AACTATGTGG CAAAAATAGC AGATTTTGGA TTGTCCCGAG GTCAAGAGGT 3120 
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10 



15 



GTACGTGAAA 
TTACAGTGTG 
TGTTAGCTTA 
GCCCCAGGGC 
GAGACAATGC 
CTTAAACAGA 
TACTTATGCA 
CCCTCTGTTT 
CTGCCAAAGG 
TAATATTTAA 
TCCCTCACCT 
TTTCCACAGC 
TTGCTTACAA 
ATTTTCTTTT 
TGGGTGACAT 
GCACATTGTA 
CACTTTGCAC 



AAGACAATGG 
TACACAACCA 
GGAGGCACAC 
TACAGACTGG 
TGGCGGGAGA 
ATGTTAGAGG 
GGAATTGACT 
CCCTTTCACT 
ATGTGATATA 
GACACTGAAA 
GTAGCATGCC 
CTGCAAGTTC 
GCCTAAGAAT 
CTTTTCTCTG 
TTGGGAGACA 
AAAAGTTTTA 
TGATATATCA 



GAAGGCTCCC 
ACAGTGATGT 
CCTACTGCGG 
AGAAGCCCCT 
AGCCTTATGA 
AGCGAAAGAC 
GTTCTGCTGA 
GGCATGGGAG 
TAAGTGTACA 
AATCTAAGTG 
AGTCCCGTTT 
AGT CCAGG AT 
CTTTAGAGAA 
GTAATATTGA 
TGTGACATTT 
GTTTTGATGA 
TGAGTGAATA 



AGTGCGCTGG 
ATGGTCCTAT 
GATGACTTGT 
GAACTGTGAT 
GAGGCCATCA 
CTACGTGAAT 
AGAAGCGGCC 
ACCCTTGACA 
TATGTGCTGG 
ATATAAATCA 
CATTTAGTCA 
GCTAACATCT 
GTATACATAA 
CTTGTATATT 
ATATATTGAA 
GTTGTGAGTT 
AATGTCTTGC 



ATGGCCATCG 
GGTGTGTTAC 
GCAGAACTCT 
GATGAGGTGT 
TTTGCCCAGA 
ACCACGCTTT 
TAGGACAGAA 
ACTGCTGAGA 
AATTCTAACA 
GATTCTTCTC 
TGTGACCACT 
AAAAATAGAC 
GTTTAGGATA 
TTAAGAAATA 
TTAATATCCC 
TACCTTGTAT 
CTACTCAAAA 



AGTCACTGAA 3180 

TATGGGAGAT 324 0 

ACGAGAAGCT 3300 

ATGATCTAAT 3360 

TATTGGTGTC 3420 

ATGAGAAGTT 3480 

CATCTGTATA 3540 

AAACATGCCT 3600 

AGTCATAGGT 3660 

TCTCATTTTA 3720 

CTGTCTTGTG 3780 

TTAAATCTCA 3840 

AAATAATGGG 3900 

ACAGAAAGCC 3960 

TACATGTATT 4020 

ACTGTAGGCA 4 080 
AAAAAAAA 



Seg ID No: 147 Protein sequence: 
20 Protein Accession #: NP 000450.1 



25 



30 



35 



40 



45 



MDSIiASLVLC 
FEALMNQHQD 
QASFLPATLT 
AQPQDAGVYS 
ICPPGFMGRT 
ACHPGFYGPD 
VNSGKFNPIC 
VWVCSVNTVA 
LLYKPVNHYE 
IGLPPPRGLN 
LLNNLHPREQ 
ILDGYSISSI 
SNPAFSHELV 
MAQAFQNVRE 
ARIKKDGLRM 
AIEYAPHGNL 
IHRDLAARNI 
VWSYGVLLWE 
ERPSFAQILV 



11 
I 

GVSLLLSGTV 
PLEVTQDVTR 
MTVDKGDNVN 
ARYIGGNLFT 
CEKACELHTF 
CKLRCSCNNG 
KASGWPLPTN 
GMVEKPFNIS 
AWQHIQVTNE 
LLPKSQTTLN 
YWRARVNTK 
TIRYKVQGKN 
TLPESQAPAD 
EPAVQFNSGT 
DAAIKRMKEY 
LDFLRKSRVL 
LVGENYVAKI 
IVSLGGTPYC 
SLNRMLEERK 



21 
I 

EGAMDLILIN 
EWAKKVWKR 
ISFKKVLIKE 
SAFTRLIVRR 
GRTCKERCSG 
EMCDRFQGCIi 
EEMTLVKPDG 
VKVLPKPtiNA 
IVTLNYLEPR 
LTWQPIFPSS 
AQGEWSEDLT 
EDQHVDVKIK 
LGGGKMLLIA 
LALNRKVKNN 
ASKDDHRDFA 
ETDPAFAIAN 
ADFGLSRGQE 
GMTCAELYEK 
TYVNTTLYEK 



31 



SLPLVSDAET 
EKASKINGAY 
EDAVIYKNGS 
CEAQKWGPEC 
QEGCKSYVFC 
CSPGWQGLQC 
TVLHPKDFNH 
PNVIDTGHNF 
TEYELCVQLV 
EDDFYVEVER 
AWTLSDILPP 
NATIIQYQLK 
ILGSAGMTCL 
PDPTIYPVLD 
GELEVLCKLG 
STASTLSSQQ 
VYVKKTMGRL 
LPQGYRLEKP 
FTYAGIDCSA 



41 
I 

SLTCIASGWR 
FCEGRVRGEA 
FIHSVPRHEV 
NHLCTACMNN 
LPDPYGCSCA 
EREGIPRMTP 
TDHFSVAI FT 
AVINISSEPY 
RRGEGGEGHP 
RSVQKSDQQN 
QPENIKISNI 
GLEPETAYQV 
TVLLAFLIIL 
WNDIKFQDVI 
HHPNI INLLG 
LLHFAADVAR 
PVRWMAIESL 
LNCDDEVYDL 
EEAA 



51 



PHEPITIGRD 60 

IRIRTMKMRQ 120 

PDILEVHIiPH 180 

GVCHEDTGEC 240 

TGWKGLQCNE 300 

KIVDLPDHIE 360 

IHRILPPDSG 420 

FGDGPIKSKK 480 

GPVRRFTTAS 540 

IKVPGNLTSV 600 

THSSAVISWT 660 

DIFAENNIGS 720 

QLKRANV QRR 78 0 

GEGNFGQVLK 84 0 

ACEHRGYLYL 900 

GMDYLSQKQF 960 

NYSVYTTNSD 1020 

MRQCWREKPY 1080 



Seq ID NO: 148 DNA sequence 

Nucleic Acid Accession #: NM_000552.2 

Coding sequence: 311-8752 (underlined sequences correspond to start and stop codons) 



50 



55 



60 



65 



70 



75 



AGCTCACAGC 
TATCTCCCCC 
GAGCTGTAGC 
GCACCATTGT 
GCAGGGGAAG 
GCCCTCATTT 
GCCAGGGACC 
TTTCGGAAGT 
CAGTTACCTC 
GAATGGCAAG 
TGTCAATGGT 
GCTGTATCTA 
GGCCAGGATC 
GACCTGCGGG 
AGGGACCTTG 
ACAGTGGTGT 
GCAGAAGGGC 
CCACCCTCTG 
TGCTGGGGGG 
GGAGGGAATG 
TGGTATGGAG 
CAATGAAATG 
GGATGAAGGC 
CCCTCCCGGC 
GATCTGCAGC 



11 
I 

TATTGTGGTG 
AGCAGTGGGG 
AGACCTGATT 
CCAGCAGCTG 
GCACCATTGT 
ATGATTCCTG 
CTTTGTGCAG 
GACTTCGTCA 
CTGGCAGGGG 
AGAGTGAGCC 
ACCGTGACAC 
GAAACTGAGG 
GATGGCAGCG 
CTGTGTGGCA 
ACCTCGGACC 
GAACGGGCAT 
CTGTGGGAGC 
GTGGACCCCG 
CTGGAGTGCG 
GTGCTGTACG 
TATAGGCAGT 
TGTCAGGAGC 
CTCTGCGTGG 
ACCTCCCTCT 
AATGAAGAAT 



21 
I 

GGAAAGGGAG 
ACTCCACAGC 
GAGCCTTTGC 
AGTTTCCCAG 
CCAGCAGCTG 
CCAGATTTGC 
AAGGAACTCG 
ACACCTTTGA 
GCTGCCAGAA 
TCTCCGTGTA 
AGGGGGACCA 
CTGGGTACTA 
GCAACTTTCA 
ACTTTAACAT 
CTTATGACTT 
CTCCTCCCAG 
AGTGCCAGCT 
AGCCTTTTGT 
CCTGCCCTGC 
GCTGGACCGA 
GTGTGTCCCC 
GATGCGTGGA 
AGAGCACCGA 
CTCGAGACTG 
GTCCAGGGGA 



31 



41 



51 



GGTGGTTGGT 
CCCTGGGCTA 
AGCAGCTGAG 
GGACCTTGGA 
AGTTTCCCAG 
CGGGGTGCTG 
CGGCAGGTCA 
TGGGAGCATG 
ACGCTCCTTC 
TCTTGGGGAA 
AAGAGTCTCC 
CAAGCTGTCC 
AGTCCTGCTG 
CTTTGCTGAA 
TGCCAACTCA 
CAGCTCATGC 
TCTGAAGAGC 
GGCCCTGTGT 
CCTCCTGGAG 
CCACAGCGCG 
TTGCGCCAGG 
TGGCTGCAGC 
GTGTCCCTGC 
CAACACCTGC 
GTGCCTTGTC 



GGATGTCACA 
CATAACAGCA 
AGCATGGCCT 
GATAGCCGCA 
GGACCTTGGA 
CTTGCTCTGG 
TCCACGGCCC 
TACAGCTTTG 
TCGATTATTG 
TTTTTTGACA 
ATGCCCTATG 
GGTGAGGCCT 
TCAGACAGAT 
GATGACTTTA 
TGGGCTCTGA 
AACATCTCCT 
ACCTCGGTGT 
GAGAAGACTT 
TACGCCCGGA 
TGCAGCCCAG 
ACCTGCCAGA 
TGCCCTGAGG 
GTGCATTCCG 
ATTTGCCGAA 
ACTGGTCAAT 



GCTTGGGCTT 60 

AGACAGTCCG 120 

AGGGTGGGCG 180 

GCCCTCATTT 240 

GATAGCCGCA 300 

CCCTCATTTT 3 60 

GATGCAGCCT 420 

CGGGATACTG 4 80 

GGGACTTCCA 54 0 

TCCATTTGTT 600 

CCTCCAAAGG 660 

ATGGCTTTGT 720 

ACTTCAACAA 780 

TGACCCAAGA 84 0 

G CAGTGG AG A 900 

CTGGGGAAAT 960 

TTGCCCGCTG 1020 

TGTGTGAGTG 108 0 

CCTGTGCCCA 1140 

TGTGCCCTGC 1200 

GCCTGCACAT 1260 

GACAGCTCCT 1320 

GAAAGCGCTA 1380 

ACAGCCAGTG 144 0 

CCCACTTCAA 1500 



250 
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GAGCTTTGAC AACAGATACT TCACCTTCAG 
TTGCCAGGAC CACTCCTTCT CCATTGTCAT 
CGCTGTGTGC ACCCGCTCCG TCACCGTCCG 
ACTGAAGCAT GGGGCAGGAG TTGCCATGGA 
5 AGGTGACCTC CGCATCCAGC ATACAGTGAC 
CCTGCAGATG GACTGGGATG GCCGCGGGAG 
CGGGAAGACC TGCGGCCTGT GTGGGAATTA 
CCCCTCTGGG CTGGCAGAGC CCCGGGTGGA 
GGACTGCCAG GACCTGCAGA AGCAGCACAG 

10 CAGGTTCTCC GAGGAGGCGT GCGCGGTCCT 
TGCCGTCAGC CCGCTGCCCT ACCTGCGGAA 
CGGCCGCGAG TGCCTGTGCG GCGCCCTGGC 
CGTGCGCGTC GCGTGGCGCG AGCCAGGCCG 
GTACCTGCAG TGCGGGACCC CCTGCAACCT 

15 GGAATGCAAT GAGGCCTGCC TGGAGGGCTG 
GAGGGGGGAC TGCGTGCCCA AGGCCCAGTG 
GCCAGAAGAC ATCTTCTCAG ACCATCACAC 
CTGTACCATG AGTGGAGTCC CCGGAAGCTT 
GTCTCATCGC AGCAAAAGGA GCCTATCCTG 

20 CGCTGACAAC CTGCGGGCTG AAGGGCTCGA 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
GGAGTGCATG AGCATGGGCT GTGTCTCTGG 
TGAGAACAGA TGTGTGGCCC TGGAAAGGTG 
CCCTGGAGAA ACAGTGAAGA TTGGCTGCAA 

25 CTGCACAGAC CATGTGTGTG ATGCCACGTG 
CTTCGACGGG CTCAAATACC TGTTCCCCGG 
CTGCGGCAGT AACCCTGGGA CCTTTCGGAT 
CTCAGTGAAA TGCAAGAAAC GGGTCACCAT 
TGACGGGGAG GTGAATGTGA AGAGGCCCAT 

30 GTCTGGCCGG TACATCATTC TGCTGCTGGG 
CCTGAGCATC TCCGTGGTCC TGAAGCAGAC 
GAATTTTGAT GGCATCCAGA ACAATGACCT 
CCCTGTGGAC TTTGGGAACT CCTGGAAAGT 
GCCTCTGGAC TCATCCCCTG CCACCTGCCA 

35 TTCCTCCTGT AGAATCCTTA CCAGTGACGT 
CGAGCCATAT CTGGATGTCT GCATTTACGA 
CGCCTGCTTC TGCGACACCA TTGCTGCCTA 
GGTGACCTGG AGGACGGCCA CATTGTGCCC 
GAACGGGTAT GAGTGTGAGT GGCGCTATAA 

40 TCAGCACCCT GAGCCACTGG CCTGCCCTGT 
CCCTCCAGGG AAAATCCTGG ATGAGCTTTT 
AGTGTGTGAG GTGGCTGGCC * GGCGTTTTGC 
TGACCCTGAG CACTGCCAGA TTTGCCACTG 
CCAGGAGCCG GGAGGCCTGG TGGTGCCTCC 

45 GTATGTGGAG GACATCTCGG AACCGCCGTT 
CCTGGTCTTC CTGCTGGATG GCTCCTCCAG 
GGCCTTTGTG GTGGACATGA TGGAGCGGCT 
CGTGGTGGAG TACCACGACG GCTCCCACGC 
GTCAGAGCTG CGGCGCATTG CCAGCCAGGT 

50 CAGCGAGGTC TTGAAATACA CACTGTTCCA 
CTCCCGCATC GCCCTGCTCC TGATGGCCAG 
TGTCCGCTAC GTCCAGGGCC TGAAGAAGAA 
GCCCCATGCC AACCTCAAGC AGATCCGCCT 
CTTCGTGCTG AGCAGTGTGG ATGAGCTGGA 

55 CTGTGACCTT GCCCCTGAAG CCCCTCCTCC 
TGTGGGCCCG GGGCTCTTGG GGGTTTCGAC 
GGATGTGGCG TTCGTCCTGG AAGGATCGGA 
CAAGGAGTTC ATGGAGGAGG TGATTCAGCG 
CACGGTGCTG CAGTACTCCT ACATGGTGAC 

60 CAAAGGGGAC ATCCTGCAGC GGGTGCGAGA 
CACTGGGCTG GCCCTGCGGT ACCTCTCTGA 
GGAGCAGGCG CCCAACCTGG TCTACATGGT 
GAGGCTGCCT GGAGACATCC AGGTGGTGCC 
GGAGCTGGAG AGGATTGGCT GGCCCAATGC 

65 CCCCCGAGAG GCTCCTGACC TGGTGCTGCA 
CCCCACCCTC TCCCCTGCAC CTGACTGCAG 
TGGCTCCTCC AGTTTCCCAG CTTCTTATTT 
CATTTCAAAA GCCAATATAG GGCCTCGTCT 
CATCACCACC ATTGACGTGC CATGGAACGT 

70 TGTGGACGTC ATGCAGCGGG AGGGAGGCCC 
TGTGCGATAC TTGACTTCAG AAATGCATGG 
CATCCTGGTC ACGGACGTCT CTGTGGATTC 
CAACAGAGTG ACAGTGTTCC CTATTGGAAT 
GATCTTGGCA GGCCCAGCAG GCGACTCCAA 

75 CCCTACCATG GTCACCTTGG GCAATTCCTT 
GATTTGCATG GATGAGGATG GGAATGAGAA 



TGGGATCTGC CAGTACCTGC TGGCCCGGGA 1560 

TGAGACTGTC CAGTGTGCTG ATGACCGCGA 1620 

GCTGCCTGGC CTGCACAACA GCCTTGTGAA 1680 

TGGCCAGGAC ATCCAGCTCC CCCTCCTGAA 174 0 

GGCCTCCGTG CGCCTCAGCT ACGGGGAGGA 1800 

GCTGCTGGTG AAGCTGTCCC CCGTCTACGC 1860 

CAATGGCAAC CAGGGCGACG ACTTCCTTAC 1920 

GGACTTCGGG AACGCCTGGA AGCTGCACGG 1980 

CGATCCCTGC GCCCTCAACC CGCGCATGAC 2040 

GACGTCCCCC ACATTCGAGG CCTGCCATCG 2100 

CTGCCGCTAC GACGTGTGCT CCTGCTCGGA 2160 

CAGCTATGCC GCGGCCTGCG CGGGGAGAGG 2220 

CTGTGAGCTG AACTGCCCGA AAGGCCAGGT 2280 

GACCTGCCGC TCTCTCTCTT ACCCGGATGA 2340 

CTTCTGCCCC CCAGGGCTCT ACATGGATGA 2400 

CCCCTGTTAC TATGACGGTG AGATCTTCCA 2460 

CATGTGCTAC TGTGAGGATG GCTTCATGCA 2520 

GCTGCCTGAC GCTGTCCTCA GCAGTCCCCT 2580 

TCGGCCCCCC ATGGTCAAGC TGGTGTGTCC 2640 

GTGTACCAAA ACGTGCCAGA ACTATGACCT 2700 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

CTGCCTCTGC CCCCCGGGCA TGGTCCGGCA 2760 

TCCCTGCTTC CATCAGGGCA AGGAGTATGC 2820 

CACTTGTGTC TGTCGGGACC GGAAGTGGAA 2880 

CTCCACGATC GGCATGGCCC ACTACCTCAC 294 0 

GGAGTGCCAG TACGTTCTGG TGCAGGATTA 3000 

CCTAGTGGGG AATAAGGGAT GCAGCCACCC 3060 

CCTGGTGGAG GGAGGAGAGA TTGAGCTGTT 3120 

GAAGGATGAG ACTCACTTTG AGGTGGTGGA 3180 

CAAAGCCCTC TCCGTGGTCT GGGACCGCCA 3240 

ATACCAGGAG AAAGTGTGTG GCCTGTGTGG 3300 

CACCAGCAGC AACCTCCAAG TGGAGGAAGA 3360 

GAGCTCGCAG TGTGCTGACA CCAGAAAAGT 3420 

TAACAACATC ATGAAGCAGA CGATGGTGGA 3480 

CTTCCAGGAC TGCAACAAGC TGGTGGACCC 3540 

CACCTGCTCC TGTGAGTCCA TTGGGGACTG 3600 

TGCCCACGTG TGTGCCCAGC ATGGCAAGGT 3660 

CCAGAGCTGC GAGGAGAGGA ATCTCCGGGA 3720 

CAGCTGTGCA CCTGCCTGTC AAGTCACGTG 3 780 

GCAGTGTGTG GAGGGCTGCC ATGCCCACTG 3 840 

GCAGACCTGC GTTGACCCTG AAGACTGTCC 3900 

CTCAGGAAAG AAAGTCACCT TGAATCCCAG 3960 

TGATGTTGTC AACCTCACCT GTGAAGCCTG 4020 

CACAGATGCC CCGGTGAGCC CCACCACTCT 4080 

GCACGATTTC TACTGCAGCA GGCTACTGGA 414 0 

GCTGTCCGAG GCTGAGTTTG AAGTGCTGAA 4200 

GCGCATCTCC CAGAAGTGGG TCCGCGTGGC 4260 

CTACATCGGG CTCAAGGACC GGAAGCGACC 4320 

GAAGTATGCG GGCAGCCAGG TGGCCTCCAC 4380 

AATCTTCAGC AAGATCGACC GCCCTGAAGC 4440 

CCAGGAGCCC CAACGGATGT CCCGGAACTT 4500 

GAAGGTCATT GTGATCCCGG TGGGCATTGG 4560 

CAT CG AG AAG CAGGCCCCTG AGAACAAGGC 4620 

GCAGCAAAGG GACGAGATCG TTAGCTACCT 4680 

TACTCTGCCC CCCCACATGG CACAAGTCAC 474 0 

CCTGGGGCCC AAGAGGAACT CCATGGTTCT 48 00 

CAAAATTGGT GAAGCCGACT TCAACAGGAG 4860 

GATGGATGTG GGCCAGGACA GCATCCACGT 4920 

CGTGGAGTAC CCCTTCAGCG AGGCACAGTC 4980 

GATCCGCTAC CAGGGCGGCA ACAGGACCAA 5040 

CCACAGCTTC TTGGTCAGCC AGGGTGACCG 5100 

CACCGGAAAT CCTGCCTCTG ATGAGATCAA 5160 

CATTGGAGTG GGCCCTAATG CCAACGTGCA 5220 

CCCTATCCTC ATCCAGGACT TTGAGACGCT 5280 

GAGGTGCTGC TCCGGAGAGG GGCTGCAGAT 5340 

CCAGCCCCTG GACGTGATCC TTCTCCTGGA 5400 

TGATGAAATG AAGAGTTTCG CCAAGGCTTT 5460 

CACTCAGGTG TCAGTGCTGC AGTATGGAAG 5520 

GGTCCCGGAG AAAGCCCATT TGCTGAGCCT 5580 

CAGCCAAATC GGGGATGCCT TGGGCTTTGC 5640 

TGCCAGGCCG GGAGCCTCAA AGGCGGTGGT 5700 

AGTGGATGCA GCAGCTGATG CCGCCAGGTC 5760 

TGGAGATCGC TACGATGCAG CCCAGCTACG 5 820 

CGTGGTGAAG CTCCAGCGAA TCGAAGACCT 58 80 

CCTCCACAAA CTGTGCTCTG GATTTGTTAG 594 0 

GAGGCCCGGG GACGTCTGGA CCTTGCCAGA 6000 
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CCAGTGCCAC ACCGTGACTT GCCAGCCAGA TGGCCAGACC TTGCTGAAGA GTCATCGGGT 6060 

CAACTGTGAC CGGGGGCTGA GGCCTTCGTG CCCTAACAGC CAGTCCCCTG TTAAAGTGGA 6120 

AGAGACCTGT GGCTGCCGCT GGACCTGCCC CTGCGTGTGC ACAGGCAGCT CCACTCGGCA 6180 

CATCGTGACC TTTGATGGGC AGAATTTCAA GCTGACTGGC AGCTGTTCTT ATGTCCTATT 6240 

5 TCAAAACAAG GAGCAGGACC TGGAGGTGAT TCTCCATAAT GGTGCCTGCA GCCCTGGAGC 6300 

AAGGCAGGGC TGCATGAAAT CCATCGAGGT GAAGCACAGT GCCCTCTCCG TCGAGCTGCA 6360 

CAGTGACATG GAGGTGACGG TGAATGGGAG ACTGGTCTCT GTTCCTTACG TGGGTGGGAA 6420 

CATGGAAGTC AACGTTTATG GTGCCATCAT GCATGAGGTC AGATTCAATC ACCTTGGTCA 64 80 

CATCTTCACA TTCACTCCAC AAAACAATGA GTTCCAACTG CAGCTCAGCC CCAAGACTTT 6540 

10 TGCTTCAAAG ACGTATGGTC TGTGTGGGAT CTGTGATGAG AACGGAGCCA ATGACTTCAT 6600 

GCTGAGGGAT GGCACAGTCA CCACAGACTG GAAAACACTT GTTCAGGAAT GGACTGTGCA 6660 

GCGGCCAGGG CAGACGTGCC AGCCCATCCT GGAGGAGCAG TGTCTTGTCC CCGACAGCTC 6720 

CCACTGCCAG GTCCTCCTCT TACCACTGTT TGCTGAATGC CACAAGGTCC TGGCTCCAGC 6780 

CACATTCTAT GCCATCTGCC AGCAGGACAG TTGCCACCAG GAGCAAGTGT GTGAGGTGAT 6840 • 

15 CGCCTCTTAT GCCCACCTCT GTCGGACCAA CGGGGTCTGC GTTGACTGGA GGACACCTGA 6900 

TTTCTGTGCT ATGTCATGCC CACCATCTCT GGTCTACAAC CACTGTGAGC ATGGCTGTCC 6960 

CCGGCACTGT GATGGCAACG TGAGCTCCTG TGGGGACCAT CCCTCCGAAG GCTGTTTCTG 7020 

CCCTCCAGAT AAAGTCATGT TGGAAGGCAG CTGTGTCCCT GAAGAGGCCT GCACTCAGTG 7080 

CATTGGTGAG GATGGAGTCC AGCACCAGTT CCTGGAAGCC TGGGTCCCGG ACCACCAGCC 7140 

20 CTGTCAGATC TGCACATGCC TCAGCGGGCG GAAGGTCAAC TGCACAACGC AGCCCTGCCC 72 00 

CACGGCCAAA GCTCCCACGT GTGGCCTGTG TGAAGTAGCC CGCCTCCGCC AGAATGCAGA 7260 

CCAGTGCTGC CCCGAGTATG AGTGTGTGTG TGACCCAGTG AGCTGTGACC TGCCCCCAGT 7320 

GCCTCACTGT GAACGTGGCC TCCAGCCCAC ACTGACCAAC CCTGGCGAGT GCAGACCCAA 73 80 

CTTCACCTGC GCCTGCAGGA AGGAGGAGTG CAAAAGAGTG TCCCCACCCT CCTGCCCCCC 7440 

25 GCACCGTTTG CCCACCCTTC GGAAGACCCA GTGCTGTGAT GAGTATGAGT GTGCCTGCAA 7500 

CTGTGTCAAC TCCACAGTGA GCTGTCCCCT TGGGTACTTG GCCTCAACCG CCACCAATGA 7560 

CTGTGGCTGT ACCACAACCA CCTGCCTTCC CGACAAGGTG TGTGTCCACC GAAGCACCAT 7620 

CTACCCTGTG GGCCAGTTCT GGGAGGAGGG CTGCGATGTG TGCACCTGCA CCGACATGGA 7680 

GGATGCCGTG ATGGGCCTCC GCGTGGCCCA GTGCTCCCAG AAGCCCTGTG AGGACAGCTG 7740 

30 TCGGTCGGGC TTCACTTACG TTCTGCATGA AGGCGAGTGC TGTGGAAGGT GCCTGCCATC 7800 

TGCCTGTGAG GTGGTGACTG GCTCACCGCG GGGGGACTCC CAGTCTTCCT GGAAGAGTGT 7860 

CGGCTCCCAG TGGGCCTCCC CGGAGAACCC CTGCCTCATC AATGAGTGTG TCCGAGTGAA 7920 

GGAGGAGGTC TTTATACAAC AAAGGAACGT CTCCTGCCCC CAGCTGGAGG TCCCTGTCTG 7980 

CCCCTCGGGC TTTCAGCTGA GCTGTAAGAC CTCAGCGTGC TGCCCAAGCT GTCGCTGTGA 8040 

35 GCGCATGGAG GCCTGCATGC TCAATGGCAC TGTCATTGGG CCCGGGAAGA CTGTGATGAT 8100 

CGATGTGTGC ACGACCTGCC GCTGCATGGT GCAGGTGGGG GTCATCTCTG GATTCAAGCT 8160 

GGAGTGCAGG AAGACCACCT GCAACCCCTG CCCCCTGGGT TACAAGGAAG AAAATAACAC 8220 

AGGTGAATGT TGTGGGAGAT GTTTGCCTAC GGCTTGCACC ATTCAGCTAA GAGGAGGACA 8280 

GATCATGACA CTGAAGCGTG ATGAGACGCT CCAGGATGGC TGTGATACTC ACTTCTGCAA 834 0 

40 GGTCAATGAG AGAGGAGAGT ACTTCTGGGA GAAGAGGGTC ACAGGCTGCC CACCCTTTGA 8400 

TGAACACAAG TGTCTGGCTG AGGGAGGTAA AATTATGAAA ATTCCAGGCA CCTGCTGTGA 84 60 

CACATGTGAG GAGCCTGAGT GCAACGACAT CACTGCCAGG CTGCAGTATG TCAAGGTGGG 8520 

AAGCTGTAAG TCTGAAGTAG AGGTGGATAT CCACTACTGC CAGGGCAAAT GTGCCAGCAA 8580 

AGCCATGTAC TCCATTGACA TCAACGATGT GCAGGACCAG TGCTCCTGCT GCTCTCCGAC 864 0 

45 ACGGACGGAG CCCATGCAGG TGGCCCTGCA CTGCACCAAT GGCTCTGTTG TGTACCATGA 8700 

GGTTCTCAAT GCCATGGAGT GCAAATGCTC CCCCAGGAAG TGCAGCAAG T GA GGCTGCTG 8760 

CAGCTGCATG GGTGCCTGCT GCTGCCTGCC TTGGCCTGAT GGCCAGGCCA GAGTGCTGCC 8 820 

AGTCCTCTGC ATGTTCTGCT CTTGTGCCCT TCTGAGCCCA CAATAAAGGC TGAGCTCTTA 8 880 
TCTTGCTGCA TGTTCTGCTC TTGTGCCCTT CTGAGCCCAC AAT 

50 

Seq ID No: 149 Protein sequence : 
Protein Accession #: NP 000543.1 



55 1 11 21 31 41 51 

I I I I I I 

MIPARFAGVL LALALILPGT LCAEGTRGRS STARCSLFGS DFVNTFDGSM YSFAGYCSYL 60 

LAGGCQKRSF SIIGDFQNGK RVSLSVYLGE FFDIHLFVNG TVTQGDQRVS MPYASKGLYL 120 

ETEAGYYKLS GEAYGFVARI DGSGNFQVLL SDRYFNKTCG LCGNFNI FAE DDFMTQEGTL 180 

60 TSDPYDFANS WALSSGEQWC ERASPPSSSC NISSGEMQKG LWEQCQLLKS TSVFARCHPL 240 

VDPEPFVALC EKTLCECAGG LECACPALLE YARTCAQEGM VLYGWTDHSA CSPVCPAGME 300 

YRQCVSPCAR TCQSLHINEM CQERCVDGCS CPEGQLLDEG LCVESTECPC VHSGKRYPPG 360 

TSLSRDCNTC ICRNSQWICS NEECPGECLV TGQSHFKSFD NRYFTFSGIC QYLLARDCQD 420 

HSFSIVIETV QCADDRDAVC TRSVTVRLPG LHNSLVKLKH GAGVAMDGQD IQLPLLKGDL 4 80 

65 RIQHTVTASV RLSYGEDLQM DWDGRGRIaLV KLSPVYAGKT CGLCGNYNGN QGDDFLTPSG 540 

LAEPRVEDFG NAWKLHGDCQ DLQKQHSDPC ALNPRMTRFS EEACAVLTSP TFEACHRAVS 600 

PLPYLRNCRY DVCSCSDGRE CLCGALASYA AACAGRGVRV AWREPGRCEL NCPKGQVYLQ 660 

CGTPCNLTCR SLSYPDEECN EACLEGCFCP PGLYMDERGD CVPKAQCPCY YDGEIFQPED 720 

IFSDHHTMCY CEDGFMHCTM SGVPGSLLPD AVLSSPLSHR SKRSLSCRPP MVKLVCPADN 780 

70 LRAEGJJECTK TCQNYDLECM SMGCVSGCLC PPGMVRHENR CVALERCPCF HQGKEYAPGE 840 

TVKIGCNTCV CRDRKWNCTD HVCDATCSTI GMAHYLTFDG LKYLFPGECQ YVLVQDYCGS 900 

NPGTFRILVG NKGCSHPSVK CKKRVTILVE GGEIELFDGE VNVKRPMKDE THFEWESGR 960 

YIILLLGKAL SWWDRHLSI SWLKQTYQE KVCGLCGNFD GIQNNDLTSS NLQVEEDPVD 102 0 

FGNSWKVSSQ CADTRKVPLD SSPATCHNNI MKQTMVDSSC RILTSDVFQD CNKLVDPEPY 1080 

75 LDVCIYDTCS CESIGDCACF CDTIAAYAHV CAQHGKWTW RTATLCPQSC EERNLRENGY 1140 

ECEWRYNSCA PACQVTCQHP EPLACPVQCV EGCHAHCPPG KILDELLQTC VDPEDCPVCE 1200 
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VAGRRFASGK KVTLNPSDPE HCQICHCDW NLTCEACQEP GGLWPPTDA PVSPTTLYVE 1260 

DISEPPLHDP YCSRLLDLVF LLDGSSRLSE AEFEVLKAFV VDMMERLRIS QKWVRVAWE 1320 

YHDGSHAYIG LKDRKRPSEL RRIASQVKYA GSQVASTSEV LKYTLFQIFS KIDRPEASRI 1380 

ALLLMASQEP QRMSRNFVRY VQGLKKKKVI VIPVGIGPHA NLKQIRLIEK QAPENKAFVL 1440 

5 SSVDELEQQR DEIVSYLCDL APEAPPPTLP PHMAQVTVGP GLLGVSTLGP KRNSMVLDVA 1500 

FVIjEGSDKIG EADFNRSKEF MEEVIQRMDV GQDSIHVTVL QYSYMVTVEY PFSEAQSKGD 1560 

ILQRVREIRY QGGNRTNTGL ALRYLSDHSF LVSQGDREQA PNLVYMVTGN PASDEIKRLP 1620 

GDIQWPIGV GPNANVQELE RIGWPNAPIL IQDFETLPRE APDLVLQRCC SGEGLQIPTIj 1680 

SPAPDCSQPL DVILLLDGSS SFPASYFDEM KSFAKAFISK ANIGPRLTQV SVLQYGSITT 1740 

10 IDVPWNWPE KAHIjLSLVDV MQREGGPSQI GDALGFAVRY LTSEMHGARP GASKAWILV 1800 

TDVSVDSVDA AADAARSNRV TVFPIGIGDR YDAAQLRILA GPAGDSNWK LQRIEDLPTM 1860 

VTLGNSFLHK LCSGFVRICM DEDGNEKRPG DVWTLPDQCH TVTCQPDGQT LLKSHRVNCD 1920 

RGLRPSCPNS QSPVKVEETC GCRWTCPCVC TGSSTRHIVT FDGQNFKLTG SCSYVLFQNK 1980 

EQDLEVILHN GACSPGARQG CMKSIEVKHS ALSVELHSDM EVTVNGRLVS VPYVGGNMEV 2040 

15 NVYGAIMHEV RFNHLGHIFT FTPQNNEFQL QLSPKTFASK TYGLCGICDE NGANDFMLRD 2100 

GTVTTDWKTL VQEWTVQRPG QTCQPILEEQ CLVPDSSHCQ VLLLPLFAEC HKVLAPATFY 2160 

AICQQDSCHQ EQVCEVIASY AHLCRTNGVC VDWRTPDFCA MSCPPSIiVYN HCEHGCPRHC 2220 

DGNVSSCGDH PSEGCFCPPD KVMLEGSCVP EEACTQCIGE DGVQHQFLEA WVPDHQPCQI 2280 

CTCLSGRKVN CTTQPCPTAK APTCGLCEVA RLRQNADQCC PEYECVCDPV SCDLPPVPHC 2340 

20 ERGLQPTLTN PGECRPNFTC ACRKEECKRV SPPSCPPHRL PTLRKTQCCD EYECACNCVN 2400 

STVSCPLGYIi ASTATNDCGC TTTTCLPDKV CVHRSTIYPV GQFWEEGCDV CTCTDMEDAV 2460 

MGLRVAQCSQ KPCEDSCRSG FTYVLHEGEC CGRCLPSACE WTGSPRGDS QSSWKSVGSQ 2520 

WASPENPCLI NECVRVKEEV FIQQRNVSCP QLEVPVCPSG FQLSCKTSAC CPSCRCERME 2580 

ACMLNGTVIG PGKTVMIDVC TTCRCMVQVG VISGFKLECR KTTCNPCPLG YKEENNTGEC 2640 

25 CGRCLPTACT IQLRGGQIMT LKRDETLQDG CDTHFCKVNE RGEYFWEKRV TGCPPFDEHK 2700 

CIAEGGKIMK IPGTCCDTCE EPECNDITAR LQYVKVGSCK SEVEVDIHYC QGKCASKAMY 2760 
SIDINDVQDQ CSCCSPTRTE PMQVALHCTN GSWYHEVLN AMECKCSPRK CSK 

30 Seg ID NO: 150 DNA sequence 

Nucleic Acid Accession #: NM_001508.1 

Coding sequence: 1-1362 (underlined sequences correspond to start and stop codons) 

35 1 11 21 31 41 51 

I I I I I I 

ATG GCTTCAC CCAGCCTCCC GGGCAGTGAC TGCTCCCAAA TCATTGATCA CAGTCATGTC 60 

CCCGAGTTTG AGGTGGCCAC CTGGATCAAA ATCACCCTTA TTCTGGTGTA CCTGATCATC 120 

TTCGTGATGG GCCTTCTGGG GAACAGCGTC ACCATTCGGG TCACCCAGGT GCTGCAGAAG 180 

40 AAAGGATACT TGCAGAAGGA GGTGACAGAC CACATGGTGA GTTTGGCTTG CTCGGACATC 24 0 

TTGGTGTTCC TCATCGGCAT GCCCATGGAG TTCTACAGCA TCATCTGGAA TCCCCTGACC 300 

ACGTCCAGCT ACACCCTGTC CTGCAAGCTG CACACTTTCC TCTTCGAGGC CTGCAGCTAC 360 

GCTACGCTGC TGCACGTGCT GACGCTCAGC TTTGAGCGCT ACATCGCCAT CTGTCACCCC 420 

TTCAGGTACA AGGCTGTGTC GGGACCTTGC CAGGTGAAGC TGCTGATTGG CTTCGTCTGG 480 

45 GTCACCTCCG CCCTGGTGGC ACTGCCCTTG CTGTTTGCCA TGGGTACTGA GTACCCCCTG 54 0 

GTGAACGTGC CCAGCCACCG GGGTCTCACT TGCAACCGCT CCAGCACCCG CCACCACGAG 600 

CAGCCCGAGA CCTCCAATAT GTCCATCTGT ACCAACCTCT CCAGCCGCTG GACCGTGTTC 66 0 

CAGTCCAGCA TCTTCGGCGC CTTCGTGGTC TACCTCGTGG TCCTGCTCTC CGTAGCCTTC 720 

ATGTGCTGGA ACATGATGCA GGTGCTCATG AAAAG CC AG A AGGGCTCGCT GGCCGGGGGC 78 0 

50 ACGCGGCCTC CGCAGCTGAG GAAGTCCGAG AGCGAAGAGA GCAGGACCGC CAGGAGGCAG 84 0 

ACCATCATCT TCCTGAGGCT GATTGTTGTG ACATTGGCCG TATGCTGGAT GCCCAACCAG 900 

ATTCGGAGGA TCATGGCTGC GGCCAAACCC AAG CACG ACT GGACGAGGTC CTACTTCCGG 960 

GCGTACATGA TCCTCCTCCC CTTCTCGGAG ACGTTTTTCT ACCTCAGCTC GGTCATCAAC 1020 

CCGCTCCTGT ACACGGTGTC CTCGCAGCAG TTTCGGCGGG TGTTCGTGCA GGTGCTGTGC 1080 

55 TGCCGCCTGT CGCTGCAGCA CGCCAACCAC GAGAAGCGCC TGCGCGTACA TGCGCACTCC 1140 

ACCACCGACA GCGCCCGCTT TGTGCAGCGC CCGTTGCTCT TCGCGTCCCG GCGCCAGTCC 1200 

TCTGCAAGGA GAACTGAGAA GATTTTCTTA AGCACTTTTC AGAGCGAGGC CGAGCCCCAG 1260 

TCTAAGTCCC AGTCATTGAG TCTCGAGTCA CTAGAGCCCA ACTCAGGCGC GAAACCAGCC 1320 
AATTCTGCTG CAGAGAATGG TTTTCAGGAG CATGAAGTT T GA 
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Seq ID No: 151 Protein sequence : 
Protein Accession #: NP 001499.1 



65 1 11 21 31 41 51 

I I I I I 1 

MASPSLPGSD CSQIIDHSHV PEFEVATWIK ITLILVYLII FVMGLLGNSV TIRVTQVLQK 60 

KGYLQKEVTD HMVSLACSDI LVFLIGMPME FYSIIWNPLT TSSYTLSCKL HTFLFEACSY 120 

ATLLHVLTLS FERYIAICHP FRYKAVSGPC QVKLLIGFVW VTSALVALPL LFAMGTEYPL 180 

70 VNVPSHRGLT CNRSSTRHHE QPETSNMSIC TNLSSRWTVF QSSIFGAFW YLWLLSVAF 240 

MCWNMMQVLM KSQKGSLAGG TRPPQLRKSE SEESRTARRQ TIIFLRIilW TLAVCWMPNQ 300 

IRRIMAAAKP KHDWTRSYFR AYMILLPFSE TFFYLSSVIN PLLYTVSSQQ FRRVFVQVLC 360 

CRLSLQHANH EKRLRVHAHS TTDSARFVQR PLLFASRRQS SARRTEKIFL STFQSEAEPQ 420 
SKSQSLSLES LEPNSGAKPA NSAAENGFQE HEV 
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Nucleic Acid Accession #: none found 

Coding sequence: 3-65 {underlined sequences correspond to start and stop codons) 



TT ATT ATTTT 
CTTGAAAAGT 
CATGTAATTG 
CACATGTTAT 
CCCGCTCTTT 
TAAATGACAA 
ACATACTACC 
ATACGTTTAA 
GAGAGAAAGA 
AATATTAATA 
GAAGCCAGCT 
CACTATTTTG 
GTGGTTATAA 
GTAACCCAAT 
AATCAACTAG 
GTCATGAGTT 
TTGGCCTAGA 
CCCGTTGCCT 
ATGGAAAATT 
ATATTTTGAG 
TGTAATATTT 



11 
I 

GTGTAAACTA 
ATTCCAAGGA 
AAATTCATGC 
CAACCTCGTA 
GGGAATGCTA 
AACACAGCAG 
AGAGCTGCTT 
ATTTGCCCTA 
AAGGAGKAAA 
AATCATAATA 
GTTGGTAGGC 
CCAAAACCAA 
AGGGCATATT 
TTTACTTCTT 
ACAGTGGTTT 
CTTGAATCCC 
GAAGTGGCCA 
TCTGAAAAAC 
AATTTATTAA 
ATAAAATGTT 
AATTATTTTA 



21 



TATTCTGCTT 
ATATTATGAA 
AAGGAAACAA 
ACTTTTGGTG 
CATACCCATT 
TGTTTTGAGG 
GGAAAATTAA 
GGATTSAGCT 
CAGTGGTAAT 
TGACAAGACC 
ATTAATGAGT 
AGTAATTATA 
TACATAAATT 
TAAAAAGTCT 
GTTAAATTTA 
AGAGAAATAA 
TTTTATCAAC 
AGCAAGTTAT 
TTAATAGCCT 
GAATAAAACC 
TAAGTTTTAT 



31 
I 

ATAGAGAGTC 
AATAGGGCAA 
CTCATAGAAA 
CTTGCTGAAT 
TCTGGTATTT 
GAGAAAGGAC 
AGGCCACTTG 
AACAGCAAAA 
AAAAAAATCC 
CTCACTGAAT 
TTAAAATTGT 
ATACTGTGTC 
CTACTTTATT 
CAATTCAAGC 
GCAGCATACT 
TGCTTAGGAA 
AGGRAAAAAA 
TTCTTTATAT 
ATTATGTGTT 
ATGGATTATA 
AATAAAGTAT 



41 
I 

TCTGAGACTA 
CATGGACTGT 
AGATAAATAT 
CAGTCCATGA 
AAAAAATATC 
CATCATTTAT 
TGGCTTTTTC 
AAAAAAAAAA 
ATCTGTCTTC 
AAGAGTATTT 
TCTCAATTGA 
CTCCTGTAAT 
CCTCAACTTC 
TGGATTAGCC 
TCGTTCCCAT 
CTTCTCTCAA 
AAAATTTTCT 
AATTATCATT 
CTCACTTGCT 
GAGAAAAGTC 
TCCATTTCTT 



51 
I 

AAATTGACAA 60 

TTAAGATCTC 120 

GGATGCCCTT 180 

AAAGCTACAG 240 

TAGGAGGAGC 300 

AATGCTCTGT 360 

CTACCAACTG 420 

AAAAAAAARA 480 

TTGCTATGTT 540 

TCAGTCATCA 600 

AAAAACATCA 660 

TTTTTGAGAA 720 

TTTGATGAAT 780 

AGCTCAGCAT 840 

TCTAATTAAA 900 

TCTGCTTGGC 960 

CTACTACAAC 1020 

TTATTATTTT 1080 

TCTCTAAGTA 1140 

AAAATATATG 1200 
TATCTT 



Seq ID No: 153 Protein sequence: 
Protein Accession #: none found 



1 11 21 

1 I I 

IILCKLYSAY RESLRLKLTT 
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Seq ID NO: 154 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-36 (underlined sequences correspond to start and stop codons) 



11 



I 

CTGGATGATA 
CTCAAGGTTA 
ATCTTTGAAT 
AAATGAAGAG 
TGTTGTGTAG 
CCCTGTACAA 
TTCCTGGGAG 



TGGAAGAAAT 
AAATAGTTTA 
GGTCCCCTTG 
GAATTCACTA 
TTCTCTGTTC 
CCATACTGCC 
ATATAAAATA 



21 
I 

GGATGGGTTA 
AGTGCCAGAA 
GTTAAATACT 
ATATGCTACG 
CAGGGCTGCC 
TCTCAACACT 
CATAGGTTTA 



31 
I 

AGGTAAAAGG 
GAAAAGGTGG 
TAACTTTTGT 
TGATCTTTTG 
TTTGCTCCAC 
GTGTAATAGG 
GGCTGGCAAA 



41 

I 

CTGATCACAG 
GCACCAGCGA 
CATCAGTGTC 
TTTGTCATGA 
AAAGCACTGA 
CTAACACCGC 
AAAAAAAAAA 



51 
I 

ATGGGTTCCT 60 

ATTAAGAACC 120 

TGCATTTATG 180 

AAAGAGTTAC 240 

GAAGCAGTGG 300 

CCAGCGAACC 3 SO 
AAA 



Seq ID No: 155 Protein sequence 
Protein Accession #: none found 



1 11 

I I 

LDDMEEMDGL r 



21 



31 



41 



51 



Seq ID NO: 156 DNA sequence 

Nucleic Acid Accession #: NM_032961.1 

Coding sequence: 827-3949 (underlined sequences correspond to start and stop codons) 



CAGGCTCAGA 
GGGAAAGAGG 
GCAAGTAACG 
ATAGAGCGAA 
GAAAGTTTTA 
TTAAGCCCTT 
GCATCTCCTT 



11 
I 

GGCTGAAGCA 
CTTGGGAAGA 
GTGGAGATGA 
AGAGGAAAAA 
AACTGCAGAG 
TTGCATTTGC 
TTCCTTCTCC 



21 
I 

GGAGGAAGGA 
AAACAGCAGA 
GGACAGAGGG 
AATGTCAAGA 
CCGTTCTGTG 
CAGCCGTTGA 
TCTTCCTCTT 



31 
1 

AGGACTGGAA 
AAAGAAACTG 
AACCAAGACT 
AGAACATCCA 
CTTTTCCGGC 
CATTAAGAGG 
CTTCTTCTTC 



41 
i 

GGAAAAAGAG 
CT CATTAC AC 
CTGAAAGACA 
TCCGGAGAAA 
ACAAAATTAT 
CATGTTTAAC 
CTCCTCCTCC 



51 
i 

ACAGGTTAGA 60 

TTACAGAGAG 120 

AAAAATACAA 180 

TGAAGAGAAT 240 

ATCGCTGATT 300 

GGTGCCAACA 360 

TCCTCTTTTT 420 
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CCTCCTCCTC GTTCTCCTCC CATCAGCAAG AAGACAAACC GAGGACAGTC TTGAAATATC 480 

GAAATTTCCT CTTTGGGATT TGCCAGCGCC AAGACTGTCG GAATAAAGGA CGCTGACTAT 540 

TGTATTATTG TTATTTTATT AATTAGTCAG TGGAAAGATT ACAGATGAGG AAAGGGGACG 600 

CCTGTCACCC TTCCTGTGCT AAGATTTAAA AAAAAATGAG GCTGGATTGC GGGAAGCTCT 660 

AAAATGAAGC AAAAGGAGTA AGATTTTTAA AGACAGAAAG CCACAGGAGC CCCCACGTAG 720 

CGCACTTTTA TTTGTATTTT TTCAGATTTT TTTTTGTTTC GTGGTGGTGG GGGAGGTGAT 780 

TGGGTGGCTG ACTGGCTGCG GGAAGCTACT TCCTTTCCTT TTGGAG ATG A TTGTGCTATT 840 

ATTGTTTGCC TTGCTCTGGA TGGTGGAAGG AGTCTTTTCC CAGCTTCACT ACACGGTACA 900 

GGAGGAGCAG GAACATGGCA CTTTCGTGGG GAATATCGCT GAAGATCTGG GTCTGGACAT 960 

TACAAAACTT TCGGCTCGCG GGTTTCAGAC GGTGCCCAAC TCAAGGACCC CTTACTTAGA 1020 

CCTCAACCTG GAGACAGGGG TGCTGTACGT GAACGAGAAA ATAGACCGCG AACAAATCTG 1080 

CAAACAGAGC CCCTCCTGTG TCCTGCACCT GGAGGTCTTT CTGGAGAACC CCCTGGAGCT 1140 

GTTCCAGGTG GAGATCGAGG TGCTGGACAT TAATGACAAC CCCCCCTCTT TCCCGGAGCC 1200 

AGACCTGACG GTGGAAATCT CTGAGAGCGC CACGCCAGGC ACTCGCTTCC CCTTGGAGAG 1260 

CGCATTCGAC CCAGACGTGG GCACCAACTC CTTGCGCGAC TACGAGATCA CCCCCAACAG 1320 

CTACTTCTCC CTGGACGTGC AGACCCAGGG GGATGGCAAC CGATTCGCTG AGCTGGTGCT 1380 

GGAGAAGCCA CTGGACCGAG AGCAGCAAGC GGTGCACCGC TACGTGCTGA CCGCGGTGGA 1440 

CGGAGGAGGT GGGGGAGGAG- TAGGAGAAGG AGGGGGAGGT GGCGGGGGAG CAGGCCTGCC 1500 

CCCCCAGCAG CAGCGCACCG GCACGGCCCT ACTCACCATC CGAGTGCTGG ACTCCAATGA 1560 

CAATGTGCCC GCTTTCGACC AACCCGTCTA CACTGTGTCC CTACCAGAGA ACTCTCCCCC 1620 

AGGCACTCTC GTGATCCAGC TCAACGCCAC CGACCCGGAC GAGGGCCAGA ACGGTGAGGT 1680 

CGTGTACTCC TTCAGCAGCC ACATTTCGCC CCGGGCGCGG GAGCTTTTCG GACTCTCGCC 1740 * 

GCGCACTGGC AGACTGGAGG TAAGCGGCGA GTTGGACTAT GAAGAGAGCC CAGTGTACCA 1800 

AGTGTACGTG CAAGCCAAGG ACCTGGGCCC CAACGCCGTG CCTGCGCACT GCAAGGTGCT 1860 

AGTGCGAGTA CTGGATGCTA ATGACAACGC GCCAGAGATC AGCTTCAGCA CCGTGAAGGA 1920 

AGCGGTGAGT GAGGGCGCGG CGCCCGGCAC TGTGGTGGCC CTTTTCAGCG TGACTGACCG 1980 

CGACTCAGAG GAGAATGGGC AGGTGCAGTG CGAGCTACTG GGAGACGTGC CTTTCCGCCT 2040 

CAAGTCTTCC TTTAAGAATT ACTACACCAT CGTTACCGAA GCCCCCCTGG ACCGAGAGGC 2100 

GGGGGACTCC TACACCCTGA CTGTAGTGGC TCGGGACCGG GGCGAGCCTG CGCTCTCCAC 2160 

CAGTAAGTCG ATCCAGGTAC AAGTGTCGGA TGTGAACGAC AACGCGCCGC GTTTCAGCCA 2220 

GCCGGTCTAC GACGTGTATG TGACTGAAAA CAACGTGCCT GGCGCCTACA TCTACGCGGT 2280 

GAGCGCCACC GACCGGGATG AGGGCGCCAA CGCCCAGCTT GCCTACTCTA TCCTCGAGTG 2340 

CCAGATCCAG GGCATGAGCG TCTTCACCTA CGTTTCTATC AACTCTGAGA ACGGCTACTT 2400 

GTACGCCCTG CGCTCCTTCG ACTATGAGCA GCTGAAGGAC TTCAGTTTTC AGGTGGAAGC 2460 

CCGGGACGCT GGCAGCCCCC AGGCGCTGGC TGGTAACGCC ACTGTCAACA TCCTCATAGT 2520 

GGATCAAAAT GACAACGCCC CTGCCATCGT GGCGCCTCTA CCAGGGCGCA ACGGGACTCC 2580 

AGCGCGTGAG GTGCTGCCCC GCTCGGCGGA GCCGGGTTAC CTGCTCACCC GCGTGGCCGC 2640 

CGTGGACGCG GACGACGGCG AGAACGCCCG GCTCACTTAC AGCATCGTGC GTGGCAACGA 2700 

AATGAACCTC TTTCGCATGG ACTGGCGCAC CGGGGAGCTG CGCACAGCAC GCCGAGTCCC 2760 

GGCCAAGCGC GACCCCCAGC GGCCTTATGA GCTGGTGATC GAGGTGCGCG ACCATGGGCA 2820 

GCCGCCCCTT TCCTCCACCG CCACCCTGGT GGTTCAGCTG GTGGATGGCG CCGTGGAGCC 2880 

CCAGGGCGGG GGCGGGAGCG GAGGCGGAGG GTCAGGAGAG CACCAGCGCC CCAGTCGCTC 294 0 

TGGCGGCGGG GAAACCTCGC TAGACCTCAC CCTCATCCTC ATCATCGCGT TGGGCTCGGT 3000 

GTCCTTCATC TTCCTGCTGG CCATGATCGT GCTGGCCGTG CGTTGCCAAA AAGAGAAGAA 3060 

GCTCAACATC TATACTTGTC TGGCCAGCGA TTGCTGCCTC TGCTGCTGCT GCTGCGGTGG 3120 

CGGAGGTTCG ACCTGCTGTG GCCGCCAAGC CCGGGCGCGC AAGAAGAAAC TCAGCAAGTC 3180 

AGACATCATG CTGGTGCAGA GCTCCAATGT ACCCAGTAAC CCGGCCCAGG TGCCGATAGA 3240 

GGAGTCCGGG GGCTTTGGCT CCCACCACCA CAACCAGAAT TACTGCTATC AGGTATGCCT 33 00 

GACCCCTGAG TCCGCCAAGA CCGACCTGAT GTTTCTTAAG CCCTGCAGCC CTTCGCGGAG 3360 

TACGGACACT GAGCACAACC CCTGCGGGGC CATCGTCACC GGTTACACCG ACCAGCAGCC 3420 

TGATATCATC TCCAACGGAA GCATTTTGTC CAACGAGACT AAACACCAGC GAGCAGAGCT 3480 

CAGCTATCTA GTTGACAGAC CTCGCCGAGT TAACAGTTCT GCATTCCAGG AAGCCGACAT 354 0 

AGTAAGCTCT AAGGACAGTG GTCATGGAGA CAGTGAACAG GGAGATAGTG ATCATGATGC 3600 

CACCAACCGT GCCCAGTCAG CTGGTATGGA TCTCTTCTCC AATTGCACTG AGGAATGTAA 3660 

AGCTCTGGGC CACTCAGATC GGTGCTGGAT GCCTTCTTTT GTCCCTTCTG ATGGACGCCA 3720 

GGCTGCTGAT TATCGCAGCA ATCTGCATGT TCCTGGCATG GACTCTGTTC CAGACACTGA 3780 

GGTGTTTGAA ACTCCAGAAG CCCAGCCTGG GGCAGAGCGG TCCTTTTCCA CCTTTGGCAA 3840 

AGAGAAGGCC CTTCACAGCA CTCTGGAGAG GAAGGAGCTG GATGGACTGC TGACTAATAC 3900 

GCGAGCGCCT TACAAACCAC CATATTTGAC ACGGAAAAGG ATATG C TAG T CAATTCTACA 3960 

GGACTTACCT GAAGCAGCAT GATTTGCACA AAGTCGACCA AC AAAAG CAT CAACTTTTCA 4020 

ACTTCATTAT CTTGGCCATC CAGTTAGTCA TGTGTAACTG AGTATTAGAT TTCGGATGGA 4080 

GTCATCATGG CCAATTATAG GACCTAATTG CTCTCAGCAG GCCTGAGAAA TGAGTTGAAA 4140 

TGTGCAGAAC TGTAGAAACT TTAGAGGCAA CAGATTTTGC CTCCCCGATC AGTGTGTGCC 4200 

TGTTTACAGC ACTATCTATC TTTCTCTCTC CAAATGTCAC TGAGCCCTTT AGATGTTTAT 4260 

ATTCACGACG AGAAGCCAGT CATAAAGATA AAGGAAATTT GTGCATTATA AATGCAATAT 4320 

CACTGTTTTA AACTTGACTG TTTTATATTA TTTTTGTGTG ATCAAGTGTT CCGCAAGCTA 4380 

TTCCAACTTT ACAAGAGAAA TTGTGATTAT GTTCTTTTCA CCTGTGGGTT ATAAAAAATG 4440 

TTGTATTCTG AAGACCCACA AAATATCAAA GACATTCTGT AGTTTATACA CCGTGTTGCA 4500 

AAGTGTTTAC TGTACTATTT CAAAGCTTCT AAATAAATAT AAAATATATA TATTATATTA 4560 

TATAATTTTC CTAAAATGTG GTACAACTCA GTTGGTTTTT AAATGGATGC ATACAGTCCA 4620 

CATCATACAA TAAAATAAAA GGTAATTCAG GGTCCCAAAG ACAAACTTAC TAAGAAAAAA 4680 

TCATTAATAG TTTTCTCCCA ATTTCCATAT CTTACTCAAC CGTGTTTTTC CTTGTTTAAA 4740 

AGAAAATGAT GCTCTAAGCT ACAAAATTTT GTCAAAAACT CATATTGAAT TTTCAATGCC 4800 

AAAGATGTAG CTATTGATGT TATCAGACAG AGCACTGACT ATGTACTATC AAACTATCTA 4 860 

ACAATCTGCA TAAGTCTGAT TCTATTTCTA TGACTTTGAA TTTAGAATCA CTTAAAGCTT 4 920 

TTATAAAGAA TCGATAAATT CACCTGTATT TGTTGTTAGA AAAAAACTGG GTGTCTGTAC 4980 
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ATTTTGTGGT GTAAAATATG TAATTGAAGA TTACTATTTT AAGAAGTCAT CAGTCATATC 5040 

ACTCACACAG AATTTTATTT TACATAGTTT TGTGACTTAA TTACACATGA ATATAAAATC 5100 

TATAATTCTA TATGAATATA TAGAGATATA GAAACATCTG AACTGGTAAA GAATAACTAT 5160 

AAAATATGAA AGCTCTAAAT TTAAAATAAA TTTAGAGATA GAATCATGGT ACATTATTGT 5220 

TTCAGTATTC CATGTAAAAA TTTTATAGCT TAAATGTAGT CAGTGTTTGA TTAATGAAAA 5280 

AATTCTTCAT GAGTCAGCCT TCAAAAGTTA AGCTTGCCTT TTACTTTTAT GTCAACAATA 534 0 
TTAATTATTA AATTTAGTAA GACGCAAAAA AAAAAAAAAA AAAA 

Seq ID No: 157 Protein sequence ; 
Protein Accession #: NP 116586-1 



1 11 21 31 41 51 

I I I I I I 

MIVLLLFALL WMVEGVFSQL HYTVQEEQEH GTFVGNIAED LGLDITKLSA RGFQTVPNSR 60 

TPYLDLNLET GVLYVNEKID REQICKQSPS CVLHLEVFLE NPLELFQVEI EVLDINDNPP 120 

SFPEPDLTVE ISESATPGTR FPLESAFDPD VGTNSLRDYE ITPNSYFSLD VQTQGDGNRF 180 

AELVLEKPLD REQQAVHRYV LTAVDGGGGG GVGEGGGGGG GAGLPPQQQR TGTALLTIRV 24 0 

LDSNDNVPAF DQPVYTVSLP ENSPPGTLVI QLNATDPDEG QNGEWYSFS SHISPRAREI* 300 

FGLSPRTGRL EVSGELDYEE SPVYQVYVQA KDLGPNAVPA HCKVLVRVLD ANDNAPEISF 360 

STVKEAVSEG AAPGTWALF SVTDRDSEEN GQVQCELLGD VPFRLKSSFK NYYTIVTEAP 420 

LDREAGDSYT LTWARDRGE PALSTSKSIQ VQVSDVNDNA PRFSQPVYDV YVTENNVPGA 480 

YIYAVSATDR DEGANAQLAY SILECQIQGM SVFTYVSINS ENGYLYALRS FDYEQLKDFS 540 

FQVEARDAGS PQAIiAGNATV NILIVDQNDN APAIVAPLPG RNGTPAREVL PRSAEPGYLL 600 

TRVAAVDADD GENARLTYSI VRGNEMNLFR MDWRTGELRT ARRVPAKRDP QRPYELVIEV 660 

RDHGQPPLSS TATLWQLVD GAVEPQGGGG SGGGGSGEHQ RPSRSGGGET SLDLTLILII 720 

ALGSVSFIFL LAMIVLAVRC QKEKKLNIYT ClaASDCCLCC CCCGGGGSTC CGRQARARKK 780 

KLSKSDIMLV QSSNVPSNPA QVPIEESGGF GSHHHNQNYC YQVCLTPESA KTDLMFLKPC 840 

SPSRSTDTEH NPCGAIVTGY TDQQPDIISN GSILSNETKH QRAELSYLVD RPRRVNSSAF 900 

QEADIVSSKD SGHGDSEQGD SDHDATNRAQ SAGMDLFSNC TEECKALGHS DRCWMPSFVP 960 

SDGRQAADYR SNLHVPGMDS VPDTEVFETP EAQPGAERSF STFGKEKALH STLERKELDG 1020 
LLTNTRAPYK PPYLTRKRIC 

Seq ID NO: 158 DNA sequence 

Nucleic Acid Accession #: NM_022159.1 

Coding sequence: 70-1890 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GTGAAATTTA AACTCCAGTC CTGTGGCGAA AATGCTAATT GCACTAACAC AGAAGGAAGT 60 

TATTATTGT A TG TGTGTACC TGGCTTCAGA TCCAGCAGTA ACCAAGACAG GTTTATCACT 120 

AATGATGGAA CCGTCTGTAT AGAAAATGTG AATGCAAACT GCCATTTAGA TAATGTCTGT 180 

ATAGCTGCAA ATATTAATAA AACTTTAACA AAAATCAGAT CCATAAAAGA ACCTGTGGCT 24 0 

TTGCTACAAG AAGTCTATAG AAATTCTGTG ACAGATCTTT CACCAACAGA TATAATTACA 300 

TATATAGAAA TATTAGCTGA ATCATCTTCA TTACTAGGTT ACAAGAACAA CACTATCTCA 360 

GCCAAGGACA CCCTTTCTAA CTCAACTCTT ACTGAATTTG TAAAAACCGT GAATAATTTT 420 

GTTCAAAGGG ATACATTTGT AGTTTGGGAC AAGTTATCTG TGAATCATAG GAGAACACAT 4 80 

CTTACAAAAC TCATGCACAC TGTTGAACAA GCTACTTTAA GGATATCCCA GAGCTTCCAA 540 

AAGACCACAG AGTTTGATAC AAATTCAACG GATATAGCTC TCAAAGTTTT CTTTTTTGAT 600 

TCATATAACA TGAAACATAT TCATCCTCAT ATGAATATGG ATGGAGACTA CATAAATATA 66 0 

TTTCCAAAGA GAAAAGCTGC ATATGATTCA AATGGCAATG TTGCAGTTGC ATTTTTATAT 720 

TATAAGAGTA TTGGTCCTTT GCTTTCATCA TCTGACAACT TCTTATTGAA ACCTCAAAAT 780 

TATGATAATT CTGAAGAGGA GGAAAGAGTC ATATCTTCAG TAATTTCAGT CTCAATGAGC 84 0 

TCAAACCCAC CCACATTATA TGAACTTGAA AAAATAACAT TTACATTAAG TCATCGAAAG 900 

GTCACAGATA GGTATAGGAG TCTATGTGCA TTTTGGAATT ACTCACCTGA TACCATGAAT 960 

GGCAGCTGGT CTTCAGAGGG CTGTGAGCTG ACATACTCAA ATGAGACCCA CACCTCATGC 1020 

CGCTGTAATC ACCTGACACA TTTTGCAATT TTGATGTCCT CTGGTCCTTC CATTGGTATT 1080 

AAAGATTATA ATATTCTTAC AAGGATCACT CAACTAGGAA TAATTATTTC ACTGATTTGT 114 0 

CTTGCCATAT GCATTTTTAC CTTCTGGTTC TTCAGTGAAA TTCAAAGCAC CAGGACAACA 1200 

ATTCACAAAA ATCTTTGCTG TAGCCTATTT CTTGCTGAAC TTGTTTTTCT TGTTGGGATC 1260 

AATACAAATA CTAATAAGCT CTTCTGTTCA ATCATTGCCG GACTGCTACA CTACTTCTTT 1320 

TTAGCTGCTT TTGCATGGAT GTGCATTGAA GGCATACATC TCTATCTCAT TGTTGTGGGT 1380 

GTCATCTACA ACAAGGGATT TTTGCACAAG AATTTTTATA TCTTTGGCTA TCTAAGCCCA 1440 

GCCGTGGTAG TTGGATTTTC GGCAGCACTA GGATACAGAT ATTATGGCAC AACCAAAGTA 1500 

TGTTGGCTTA GCACCGAAAA CAACTTTATT TGGAGTTTTA TAGGACCAGC ATGCCTAATC 1560 

ATTCTTGTTA ATCTCTTGGC TTTTGGAGTC ATCATATACA AAGTTTTTCG TCACACTGCA 1620 

GGGTTGAAAC CAGAAGTTAG TTGCTTTGAG AACATAAGGT CTTGTGCAAG AGGAGCCCTC 1680 

GCTCTTCTGT TCCTTCTCGG CACCACCTGG ATCTTTGGGG TTCTCCATGT TGTGCACGCA 1740 

TCAGTGGTTA CAGCTTACCT CTTCACAGTC AGCAATGCTT TCCAGGGGAT GTTCATTTTT 1800 

TTATTCCTGT GTGTTTTATC TAGAAAGATT CAAGAAGAAT ATTACAGATT GTTCAAAAAT 1860 

GTCCCCTGTT GTTTTGGATG TTTAAGGTAA ACATAGAGAA TGGTGGATAA TTACAACTGC 1920 

ACAAAAATAA AAATTCCAAG CTGTGGATGA CCAATGTATA AAAATGACTC ATCAAATTAT 1980 

CCAATTATTA ACTACTAGAC AAAAAGTATT TTAAATCAGT TTTTCTGTTT ATGCTATAGG 2040 

AACTGTAGAT AATAAGGTAA AATTATGTAT CATATAGATA TACTATGTTT TTCTATGTGA 2100 

AATAGTTCTG TCAAAAATAG TATTGCAGAT ATTTGGAAAG TAATTGGTTT CTCAGGAGTG 2160 
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ATATCACTGC ACCCAAGGAA AGATTTTCTT TCTAACACGA GAAGTATATG AATGTCCTGA 2220 

AGGAAACCAC TGGCTTGATA TTTCTGTGAC TCGTGTTGCC TTTGAAACTA GTCCCCTACC 2280 

ACCTCGGTAA TGAGCTCCAT TACAGAAAGT GGAACATAAG AGAATGAAGG GGCAGAATAT 2340 

CAAACAGTGA AAAGGGAATG ATAAGATGTA TTTTGAATGA ACTGTTTTTT CTGTAGACTA 2400 

GCTGAGAAAT TGTTGACATA AAATAAAGAA TTGAAGAAAC ACATTTTACC ATTTTGTGAA 2460 

TTGTTCTGAA CTTAAATGTC CACTAAAACA ACTTAGACTT CTGTTTGCTA AATCTGTTTC 2520 
TTTTTCTAAT ATTCTAAAA 

Seq ID No: 159 Protein sequence: 
Protein Accession NP 071442.1 



1 11 21 31 41 51 

I I ! I I I 

MCVPGFRSSS NQDRFITNDG TVCIENVNAN CHLDNVCIAA NINKTLTKIR SIKEPVALLQ 60 

EVYRNSVTDL SPTDIITYIE ILAESSSLLG YKNNTISAKD TLSNSTLTEF VKTVNNFVQR 120 

DTFWWDKLS VNHRRTHLTK LMHTVEQATL RISQSFQKTT EFDTNSTDIA LKVFFFDSYN 180 

MKHIHPHMNM DGDYINIFPK RKAAYDSNGN VAYA FLYYKS IGPLLSSSDN FLLKPQNYDN 240 

SEEEERVISS VISVSMSSNP PTLYELEKIT FTLSHRKVTD RYRSLCAFWN YSPDTMNGSW 3 00 

SSEGCEIiTYS NETHTSCRCN HLTHFAILMS SGPSIGIKDY NILTRITQLG IIISLICLAI 360 

CIFTFWFFSE IQSTRTTIHK NLCCSLFLAE LVFLVGINTN TNKLFCSIIA GLLHYFFLAA 420 

FAWMCIEGIH LYLIWGVIY NKGFLHKNFY IFGYLSPAW VGFSAALGYR YYGTTKVCWL 480 

STENNFIWSF IGPACIillLV NLLAFGVIIY KVFRHTAGLK PEVSCFENIR SCARGAliALL 540 

FLLGTTWIFG VLHWHASW TAYLFTVSNA FQGMFIFLFL CVLSRKIQEE YYRLFKNVPC 600 
CFGCLR 



Seq ID NO: 160 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-216 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

i I I i i i 

TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG AACAGGATGG CAGAGATGAG CACCACCATC 60 

AAAAACTCAA GGACCAGTGC TGTGGGTCCA GTCATCTGTT TCATGGAATT CACCAGTCTG 120 

GTATCTTCAA AATCCAGAAG GATGATGGCA GATGGCAGGA AGGAGGAAGA GGGTAATCTG 180 

GAAGAGTTTC CTGACCTACT CTGCTGCTGT GAT TAAA CAA CCACCAGGAA ATTTTGATGA 240 

CACTGTTCTC CTGAGCTCCT CCCTTTCCTC GGGGAAGAAA AGCATTGAAA CTACAAAAAT 300 

AAAGTGTTAT TTGGCTGGAG TGAGGTCTCA TGTCTGCTTA TGCGGTGGCT CGCTGCTCAG 360 

AACAGGGAAC CATTGGAGAT ACTCATTACT CTTTGAAGGC TTACAGTGGA ATGAATTCAA 420 

ATACGACTTA TTTGAGGAAT TGAAGTTGAC TTTATGGAGC TGATAAGAAT CTTCTTGGAG 4 80 

AAAAAAAGAC TGGTACTTCT GAATTAACCA AAATCACAGT ATTCTGAAGA TGATTCTACA 54 0 

AAGCCTGCTG TTTCTACAAA GGCTGCTGAT GATTTCTACA AAGCCTGCTG TAGTGTTGCT 600 

GTGGCCTCTG CTTAAAAAAG TAGAAAACAC ATTGATGCAG CATGTTCACC CCAACCTCCC 660 

TGCCTAAAGG CTCAGGGACC ATCTTGGAAG AGGAAGGCGC GTGAGATTGT AAGAGCCGAA 72 0 

TTAGGGGGAT GGAGTGTGGA GAATAAGGAC ACTTCATCTT GGATGCTCAC CTGCCAAATT 780 

GACTTCTGAT GAAAGCCAGC TCCAGAAATG TGCCTACAGT TACTACTTTC ACCTAAACCC 84 0 

TGCCCTTAGT CAAATCCTTC TCTTCTTCTA AG CAATC AAC TTCAATTCCT TGTATAACCC 900 

ACAGTATAAA AGGGCTTTTA TACCATTCTA TCCTATTGCA TGTAAGCCTT GGGTCTGGGA 960 

GGTAACAGTG TGGGATTCCA CCATCTCATC TCCCTGCCAC CCAAACATGC CTGCTCTTCT 1020 
TTAAGCAATA TTAAATGTTT GTACTTCA 

Seq ID No: 161 Protein sequence: 
Protein Accession #: none found 



1 11 21 31 41 51 

I I I i I I 

CLLMRWLAAQ NRMAEMSTTI KNSRTSAVGP VTCFMEFTSL VSSKSRRMMA DGRKEEEGNL 60 
EEFPDLLCCC D 

Seq ID NO: 162 DNA sequence 

Nucleic Acid Accession # : none found 

Coding sequence: 1-159 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAGACCCTCC AGAGGCAGGG CCCAGGATTG AAGAGGGAAG CCCTGCTCCA CACGTGTTCA 60 

TCAGGAAGGA CCCACAGACT GCTGCTCCTG GAGGCCTCTC GGTTTATGGA TGTGTGTTTG 120 

TTCCATAAAC CCTCAGAGGG TCACCTGGAG ACCCGC TAAA ATGCAGGTTC TTGGGCCACA 180 

TCCTAGACCT TCTGACCGAC CCAGGGAGTG GGGCCCAGGA AGCTGCATTT GACAGATATC 240 

CCCGTGTGAT CATCATGCAC ACAGGAGTGA GAGAACCAGT GTTCTCCCCG GGCAGAAGGG 300 

AAGCTCGTGT GCAGGACACC TCACACCTCC TTTCCCATTC CCCTGCCAGG CTCTCCCTGC 3 60 

TGACATTGTT TTTGCGGGAG AGCTGTGAAT TCTGAAGATT AGGTTGCTTC TCACCCCAAG 420 
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CTCCAGAAGT CCAGGCTGAG CCAAACCAAG 
GGAGGTGAGG GGACTGACTA CTTGAAGATC 
GCACCAAGGA AAGGCCATGC AAGGACACAG 
GAGCCTTCAC TAGAAACCAA ATCAGCCAGA 
5 GATGTGAAAA AATAAATTTC TGTTGATTAA 



CTTCAAGTTG TGCCTGGACT TGGAGAACCA 480 

ACATGGAGGA GGAGTCTGAT CCAGGCCCAG 540 

GGAGAAGGGC AGCTGTCTGT AAGCCAGAAA 600 

ACCTTCATCT TGGACTTTCC AGCCTTCAGA 660 
CCTAAAAAA 



10 



Seq ID No: 163 Protein sequence: 
Protein Accession #: none found 
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I I I I I I 

ETLQRQGPGL KREALLHTCS SGRTHRLLLL EASRFMDVCL FHKPSEGHIjE TR 

Seq ID NO: 164 DNA sequence 

Nucleic Acid Accession #: NM_020241.1 

Coding sequence: 4-1557 (underlined sequences correspond to start and stop codons) 



50 



I 

GCCATGCAGA 
CTGGGGGG.CG 
GACTACCTGA 
GAAGGTGCTG 
GGGGACAGGG 
TACCAGAGGA 
GGCAAACAGG 
ACGCTCTTTG 
ACCCTGCAGC 
CACGCCAATG 
CTAGCCATTG 
AAACATGACT 
CATGTCTACT 
GTGTCCCGCG 
AAGCAGTGGA 
TTCTACTTCA 
GTCCTGGCCG 
GACCTGACAC 
TCCATCTGGA 
GCCCCCGGGA 
AAGACCCACC 
CGGACCCTGA 
GGCAACCAGA 
CGGCCCAATG 
AGGGTGTGTG 
CGACGCTGGG 
CCCCCCACTC 
AGGGCCTGCC 
CGGCGAAGGT 
GCCACCCGTC 
G 



11 
I 

CCCCGCGAGC 
CCCACGGCCT 
ACCACTATCC 
ACGACCTCAA 
ACAACCTCTA 
AGCTGACCTG 
AGGGCGAGTG 
TGTGCGGTTC 
CCGTCGGAGA 
TTGCCCTCTT 
ATGCTGTCAT 
CCAAGTGGTT 
TCTTCTTCCG 
TGGCCCGAGT 
CGTCCTTCCT 
ACGTGCTGCA 
TTTTTTC CAC 
AGGTGGCAGC 
CGCCGGTGCC 
TGCAGTACAA 
CTCTGATGGA 
TGAGGCACCA 
CCGTTGTCTT 
CCAGCACCTC 
TCCACGAGCG 
GCTTCCAGAA 
TGCAGAGGGA 
CGGAAGTCAC 
GGGTGGGGCC 
CCCTTGTGAC 



21 



GTCCCCTCCC 
CTTTCCTGAG 
CGTGTTTGTG 
CATCCAGCGA 
CCGCGTAGAG 
GAGATCTAAC 
TCGAAACTTC 
CAACGCCTTC 
CAACATCAGC 
CTCTGACGGG 
CTACCGCAGC 
CAAAGAGCCT 
GGAGATTGCG 
GTGCAAGAAC 
GAAGGCGCGG 
GGCTGTCACG 
GCCCAGCAAC 
TGTGTTTGAA 
GGAGGATCAG 
TGCCTCCAGC 
CGAAGCGGTG 
GCTGACTCGA 
CCTGGGTTCT 
AGGGACGTCT 
ACGATCGTGG 
GGCCCGGGGG 
AGCGGGGACA 
ATCGGCAGCA 
CCTCTGTAAA 
CTCCCCCCTC 



31 
I 

CGCCCGGCCC 
GAGCCGCCGC 
GGCAGCGGGC 
GTCCTGCGGG 
TTGGAGCCCC 
CCCAGCGACA 
GTAAAGGTGC 
AACCCGGTGT 
GGTATGGCCC 
ATGCTCTTCA 
CTCGGGGACA 
TACTTTGTCC 
ATGGAGTTTA 
GACGTGGGAG 
CTCAACTGCT 
GGCGTGGTCA 
AGCATCCCTG 
GGCCGCTTCC 
GTGCCTCGAC 
GCCTTGCCGG 
CCCTCGCTGG 
GTGGCTGTGG 
GAGGCGGGGA 
GGGCGTGTGT 
TGGCCCCAGC 
CCTCCGAGGT 
ATGCCGGGGT 
GCTGTCTAAA 
TACGGCCCCA 
TGACCTCCAG 



41 
I 

TCCTGCTTCT 
CGCTTAGCGT 
CCGGACGCCT 
TCAACAGGAC 
CCACGTCCAC 
TAAACGTGTG 
TGCTCCTTCG 
GCGCCAACTA 
GCTGCCCGTA 
CAGCTACTGT 
GGCCCACCCT 
ATGCGGTGGA 
ACTACCTGGA 
GCTCCCCCCG 
CTGTACCCGG 
GCCTCGGGGG 
GCTCGGCTGT 
GAGAGCAGAA 
CCCGGCCCGG 
ATGACATCCT 
GCCATGCGCC 
ACGTGGGAGC 
CGGTCCTCAA 
GTCAAGTGGG 
GGCCTGGGCG 
GCCGGTTAGG 
TTCAGGCAGG 
GGGCTTGGGG 
GGGTGGTGAG 
CTGACCATGC 



51 

I 

GCTGCTGCTA 60 

GGCCCCCAGG 120 

GACCCCCGCA 180 

GCTGTTCATT 240 

GGAGCTGCGG 300 

TCGGATGAAG 360 

GGACGAGTCC 420 

CAGCATAGAC 480 

CGACCCCAAG 540 

TACCGACTTC 600 

GCGCACCGTG 660 

GTGGGGCAGC 720 

GAAGGTGGTG 780 

CGTGCTGGAG 84 0 

AGACTCCCAT 900 

CCGGCCCGTG 960 

CTGCGCCTTT 1020 

GTCCCCCGAG 1080 

GTGCTGCGCA 114 0 

CAACTTTGTC 1200 

CTGGATCCTG 1260 

CGGCCCCTGG 1320 

GTTCCTCGTC 1380 

CCACGCGTGC 144 0 

TTGGCTGAGC 1500 

AGTTTGAACC 1560 

AGACACGAGG 1620 

GCCTGGGGGG 1680 

AGAGTCCCAT 174 0 

ATGCCACGTG 1800 



Seq ID No: 165 Protein sequence: 
55 Protein Accession #: NP 064626.1 



60 



65 



70 



75 



I 

MQTPRASPPR 
GADDLNIQRV 
KQEGECRNFV 
ANVALFSDGM 
VYFFFREIAM 
YFNVLQAVTG 
IWTPVPEDQV 
TLMRHQLTRV 
VCVHERRSWW 



11 
I 

PALLLLLLLL 
LRVNRTLFIG 
KVLLLRDEST 
LFTATVTDFL 
EFNYLEKVW 
WSLGGRPW 
PRPRPGCCAA 
AVDVGAGPWG 
PQRPGRWLSR 



21 
I 

GGAHGLFPEE 
DRDNLYRVEL 
LFVCGSNAFN 
AIDAVIYRSIt 
SRVARVCKND 
LAVFSTPSNS 
PGMQYNASSA 
NQTWFLGSE 
RWGFQKARGP 



31 



41 



51 



PPPLSVAPRD 


YLNHYPVFVG 


SGPGRLTPAE 


60 


EPPTSTELRY 


QRKLiTWRSNP 


SDINVCRMKG 


120 


PVCANYSIDT 


LQPVGDNISG 


MARCPYDPKH 


180 


GDRPTLRTVK 


HDSKWFKEPY 


FVHAVEWGSH 


240 


VGGSPRVLEK 


QWTSFLKARL 


NCSVPGDSHF 


300 


IPGSAVCAFD 


LTQVAAVFEG 


RFREQKSPES 


360 


LPDDILNFVK 


THPLMDEAVP 


SLGHAPWILR 


420 


AGTVLKFLVR 


PNASTSGTSG 


RVCQVGHACR 


480 


PRCRLGV 









Seq ID NO; 166 DNA sequence 

Nucleic Acid Accession #: NM_032108.1 

Coding sequence: 39-2705 (underlined sequences correspond to start and stop codons) 



51 



1 11 21 31 41 

| I I I I I 

TCCGAGGCGT CACCTCCTCC TGTCGCCTGG CCCTCGCCATJSCAGACCCCG CGAGCGTCCC 

CTCCCCGCCC GGCCCTGCTG CTTCTGCTGC TGCTACTGGG GGGCGCCCAC GGCCTCTTTC 



60 
120 



253 



WO 02/079492 



PCT/US02/049.15 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CTGAGGACCC 
TTGTGGGCAG 
AGCGAGTCCT 
TAGAGCTGGA 
CTAACCCCAG 
ACTTCGTAAA 
CCTTCAACCC 
TCAGCGGTAT 
ACGGGATGCT 
GCAGCCTCGG 
AGCCTTACTT 
TTGCGATGGA 
AGAACGACGT 
CGCGGCTCAA 
TCACGGGCGT 
GCAACAGCAT 
TTGAAGGCCG 
ATCAGGTGCC 
CCAGCGCCTT 
CGGTGCCCTC 
CTCGAGTGGC 
GTTCTGAGGC 
CGTCTGGGCT 
GGCCCGGCGG 
GGGGCCTGCT 
AGTACTCGGG 
CCGACGGCTC 
CCGGGGCCAG 
AGGACCGCGC 
TGGTGGGAGC 
GGGAGCTGGC 
TGCTGAGCGT 
GCGGTGGCGG 
GCTGGGCCAA 
CCACGCCCGA 
ACGCCCTGGG 
CCTCCCTCCT 
CGACCCCCGA 
CGCTCACCCC 
TGGACCCAGC 
GCCTGAGGAG 
CGTTCAACAG 
GCACAGACTT 
CCTAGG CCGG 
ACGGTGCCAG 
ACCCACCCGC 
CCCTACCCGG 
CTATGGATTT 
CTTTTGCGGT 
GGCTTGGACG 
CTGGAAAGGT 
GCCGTGTTCG 
GCTGTGGGCG 
CGTGGTGGCC 
GGGGGTCTCC 
GGACAATGCC 
CAGCAGCTGT 



GCCGCCGCTT 
CGGGCCCGGA 
GCGGGTCAAC 
GCCCCCCACG 
CGACATAAAC 
GGTGCTGCTC 
GGTGTGCGCC 
GGCCCGCTGC 
CTTCACAGCT 
GGACAGGCCC 
TGTCCATGCG 
GTTTAACTAC 
GGGAGGCTCC 
CTGCTCTGTA 
GGTCAGCCTC 
CCCTGGCTCG 
CTTCCGAGAG 
TCGACCCCGG 
GCCGGATGAC 
GCTGGGCCAT 
TGTGGACGTG 
GGGGACGGTC 
CAGTGTCTTC 
TGGCGAGACA 
GGCTGCCTTC 
GTGTATGAAG 
CTGCATCTTC 
CACCTCAGGC 
GGGGCTGGTG 
CGTGGTGTCC 
CCGGCGCAAG 
CAGCCGCCTG 
TGGCGCCGGG 
GGCCACGCTG 
GCAGACGCCG 
CCCCCGCGCC 
GCTGCTGGCG 
CGGCCGCCTC 
CCACGCCAGC 
CTCAGCCGCC 
GCCACTGGGC 
CGGCGAGGCC 
GGCCCACCTC 
GGGCCCCCCG 
AACGCCGGGG 
GGAGTGGGGG 
GGCCGCAGGA 
GAGGTTGACC 
TTTCTAACCA 
CCGGTGGGGA 
CCCTCCCCAA 
TGTGCAAGGG 
TGTGTGTCAA 
CCAGCGGCCT 
GAGGTGCCGG 
GGGGTTTCAG 
CTAAAGGGCT 



AGCGTGGCCC 
CGCCTGACCC 
AGGACGCTGT 
TCCACGGAGC 
GTGTGTCGGA 
CTTCGGGACG 
AACTACAGCA 
CCGTACGACC 
ACTGTTACCG 
ACCCTGCGCA 
GTGGAGTGGG 
CTGGAGAAGG 
CCCCGCGTGC 
CCCGGAGACT 
GGGGGCCGGC 
GCTGTCTGCG 
CAGAAGTCCC 
CCCGGGTGCT 
ATCCTCAACT 
GCGCCCTGGA 
GGAGCCGGCC 
CTCAAGTTCC 
CTGGAGGAGT 
GGGCAGCGGC 
CCCCGCTGCG 
AACTGTATCG 
CTCAGCCCGG 
TTAGGGGACT 
TCGGTGAACC 
GGCTTCAGCG 
GACAAGGAGG 
GGCGAGCGCA 
GTTCCCCCGG 
CTGCAGGGCG 
CTGCCGCAGA 
TGGGACCACG 
CCCGCCCGGG 
TATGCTGCCC 
CCGGACCGCC 
GATGGCCTCC 
CCCCACGCCC 
CGGCCTGGGG 
CTCCCCTATG 
ATGCCTTGGC 
CCCGGGGCAA 
GCCCCCTCCG 
CGCTGAGACG 
TTATGCGCGT 
ATTGCACAAC 
ATGGGGGGCC 
CCCAGGCCCC 
GCCGGGGAGG 
GTGGGCCACG 
GGGCGTTGGC 
TTAGGAGTTT 
GCAGGAGACA 
TGGGGGCCTG 



CCAGGGACTA 
CCGCAGAAGG 
TCATTGGGGA 
TGCGGTACCA 
TGAAGGGCAA 
AGTCCACGCT 
TAGACACCCT 
CCAAGCACGC 
ACTTCCTAGC 
CCGTGAAACA 
GCAGCCATGT 
TGGTGGTGTC 
TGGAGAAGCA 
CCCATTTCTA 
CCGTGGTCCT 
CCTTTGACCT 
CCGAGTCCAT 
GCGCAGCCCC 
TTGTCAAGAC 
TCCTGCGGAC 
CCTGGGGCAA 
TCGTCCGGCC 
TTGAGACCTA 
TGCTGAGCTT 
TGGTCCGAGT 
GCAGTCAGGA 
GCACCAGAGC 
GCACAGGACT 
TGCTGGTAAC 
TGGGCTGGTT 
CCATCCTGGC 
GGGCGCAGGG 
AGGCCCTGCT 
GGCCCCACGA 
AGCGCCTGCC 
GCCACCCCCT 
CCCCCGAGCA 
GGCCCGGCCG 
GGCGGGTGGT 
CGCGGCCCTG 
CTCCGGCCGC 
ACCGCCACCG 
GGGGGGCGGA 
AGTGCCAGCC 
CTCCGAGTGG 
CCACAAGGAA 
GTTTGGGGGT 
AGGTTTTGGT 
TCCGTTCTCG 
ACAGCTGCAG 
TGGCGTGTGT 
TGGGCGTGTG 
CGTGCAGGGT 
TGAGCCGACG 
GAACCCCCCC 
CGAGGAGGGC 
GGGGGCGGCG 



CCTGAACCAC 
TGCTGACGAC 
CAGGGACAAC 
GAGGAAGCTG 
ACAGGAGGGC 
CTTTGTGTGC 
GCAGCCCGTC 
CAATGTTGCC 
CATTGATGCT 
TGACTCCAAG 
CTACTTCTTC 
CCGCGTGGCC 
GTGGACGTCC 
CTTCAACGTG 
GGCCGTTTTT 
GACACAGGTG 
CTGGACGCCG 
CGGGATGCAG 
CCACCCTCTG 
CCTGATGAGG 
CCAGACCGTT 
CAATGCCAGC 
CCGGCCGGAC 
GGAGCTGGAC 
GCCTGTGGCT 
CCCCTACTGC 
CGCCTTTGAG 
CCTGCGGGCC 
GTCGTCGGTG 
CGTGGGCCTC 
GCACGGGGCG 
TCCCGGGGGC 
GGCGCCCCTG 
CCTGGACTCG 
CACTCCGCAC 
GCTCCCGGCC 
GCCCCCCGCG 
CGCCTCCCAC 
GTCCGCGCCC 
GAGCCCGCCC 
CACCCTGCGC 
CGGCTGCCAC 
CAGGACTGCG 
ACGGGAACCA 
GTGCTCAAGT 
GCACAACCAG 
GGGTGGGCGG 
TTTTTTTGCA 
GGGTGGCGGC 
ACCTAAGCCC 
GGGTGTGCGT 
TGTGCGTGCC 
GTGTGTCCAC 
CTGGGGCTTC 
CACTCTGCAG 
CTGCCCGGAA 
AAAG 



TATCCCGTGT 180 

CTCAACATCC 240 

CTCTACCGCG 300 

ACCTGGAGAT 360 

GAGTGTCGAA 420 

GGTTCCAACG 480 

GGAGACAACA 540 

CTCTTCTCTG 600 

GTCATCTACC 660 

TGGTTCAAAG 720 

TTCCGGGAGA 780 

CGAGTGTGCA 840 

TTCCTGAAGG 900 

CTGCAGGCTG 960 

TCCACGCCCA 1020 

GCAGCTGTGT 1080 

GTGCCGGAGG 1140 

TACAATGCCT 1200 

ATGGACGAGG 1260 

CACCAGCTGA 1320 

GTCTTCCTGG 1380 

ACCTCAGGGA 1440 

AGGTGTGGAC 1500 

GCAGCTTCGG 1560 

CGCTGCCAGC 1620 

GGGTGGGCCC 1680 

CAGGACGTGT 1740 

AGCCTCTCCG 1800 

GCGGCCTTCG 1860 

CGTGAGCGGC 1920 

GGCGAGGCGG 1980 

CGGGGCGGAG 2040 

ATGCAGAACG 2100 

GGGCTGCTGC 2160 

CCGCACCCCC 2220 

TCCGCTTCAT 2280 

CCTGGGGAGC 2340 

GGCGACTTCC 2400 

ACGGGCCCCT 2460 

CCGACGGGCA 2520 

CGCACCCACA 2580 

GCCCGGCCGG 2640 

CCCCCCGTGC 2700 

GGAGCGAGAG 2760 

CCCCCCCGCG 2820 

CTCGCCCTCC 2880 

GAGGACTTTG 2 94 0 

GTTTTGGTTT 3000 

AGGCAGGGGA 3060 

TCCCCCACCC 3120 

GCGTGTGCGT 3180 

AGCGAAGGCT 324 0 

GAGCGACGAT 3300 

CAGAAGGCCC 3360 

AGGGAAGCGG 3420 

GTCACATCGG 3480 



60 Seg ID No: 167 Protein sequence : 
Protein Accession #: NP 115484.1 



11 



65 



70 



75 



i 

MQTPRASPPR 
GADDLNIQRV 
KQEGECRNFV 
ANVALFSDGM 
VYFFFREIAM 
YFNVLQAVTG 
IWTPVPEDQV 
TLMRHQLTRV 
YRPDRCGRPG 
DPYCGWAPDG 
TSSVAAFWG 
GPGGRGGGGG 



PALLLLLLLL 
LRVNRTLFIG 
KVLLLRDEST 
LFTATVTDFL 
EFNYLEKVW 
WSLGGRPW 
PRPRPGCCAA 
AVDVGAGPWG 
GGETGQRLLS 
SCIFLSPGTR 
AWSGFSVGW 
GG AG VP PEAL 



21 
I 

GGAHGLFPED 
DRDNLYRVEL 
LFVCGSNAFN 
AIDAVIYRSL 
SRVARVCKND 
LAVFSTPSNS 
PGMQYNASSA 
NQTWFLGSE 
LELDAASGGIi 
AAFEQDVSGA 
FVGLRERREL 
LAPLMQNGWA 



31 



41 



PPPLSVAPRD 
EPPTSTELRY 
PVCANYSIDT 
GDRPTLRTVK 
VGGSPRVLEK 
IPGSAVCAFD 
LPDDILNFVK 
AGTVLKFLVR 
LAAFPRCWR 
STSGLGDCTG 
ARRKDKEAIL 
KATLLQGGPH 



YLNHYPVFVG 
QRKLTWRSNP 
LQPVGDNISG 
HDSKWFKEPY 
QWTSFLKARL 
LTQVAAVFEG 
THPLMDEAVP 
PNASTSGTSG 
VPVARCQQYS 
LLRASLSEDR 
AHGAGEAVLS 
DLDSGLLPTP 



51 
I 

SGPGRLTPAE 60 

SDINVCRMKG 120 

MARCPYDPKH 180 

FVHAVEWGSH 240 

NCSVPGDSHF 300 

RFREQKSPES 360 

SLGHAPWILR 420 

LSVFLEEFET 4 80 

GCMKNCIGSQ 540 

AGLVSVNLLV 600 

VSRLGERRAQ 660 

EQTPLPQKRL 72 0 



259 



WO 02/079492 



PCT/US02/04915 



PTPHPHPHAL GPRAWDHGHP LLPASASSSL LLLAPARAPE QPPAPGEPTP DGRLYAARPG 780 
RASHGDPPLT PHASPDRRRV VSAPTGPLDP ASAADGLPRP WSPPPTGSLR RPLGPHAPPA 840 
ATLRRTHTFN SGEARPGDRH RGCHARPGTD LAHLLPYGGA DRTAPPVP 



Seq ID NO: 168 DNA sequence 
Nucleic Acid Accession #: AW205664 

Coding sequence: 1-135 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CGGCACGAGG AGAACAGGGG CCTCTGCCTC AGTTTGCCCG GGAGCCAGCC AGGGCCCATC 60 

CTAATTTGGA GCACAGTCTT CCCGGTGCCT AGACATG CCA AGGCCCCTCC CACGTGGTAC 120 

ACCCTCTCCG TT TAG TACCT GACCACCTGT TTCAAAACGC AGGTGTTTCT GGTTTAGAAA 180 

CTTGGAAGGC GGAATGTGTT TTCGTGTCTT CTAGGAAGGG TCTGCTGAGG ACCAGACCAC 240 

GTAAGCCTGA GTGGATCCTG ACTCAGCTGC AGCCCTTACC TGCCTCGTGC TGATGATCTA 300 

TGCATGGCGT TATGTAGATC ACGTGCGGCA GAGACAGCCA CTGTCCTGTG TGCGGGTTTT 360 
TAAAACAGCT GCCCTGGATG AAACGGAATA AACCAGTGAT GCTAAAAAAA AAAAAAAAAA 

Seq ID No: 169 Protein sequence: 
Protein Accession #: AW205664 



1 11 ■ 21 31 41 51 

I I I I I I 

RHEENRGLCL SLPGSQPGPI LIWSTVFPVP RHAKAPPTWY TLSV 



Seq ID NO: 170 DNA sequence 
Nucleic Acid Accession #: AB033100 

Coding sequence: 32-2623 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

AGGTCTGGGG TCCTGAGGCT GCTGGCAGAC T ATGG GTACA ACGGCCAGCA CAGCCCAGCA 60 

GACGGTCTCG GCAGGCACCC CATTTGAGGG CCTACAGGGC AGTGGCACGA TGGACAGTCG 120 

GCACTCCGTC AGCATCCACT CCTTCCAGAG CACTAGCTTG CATAACAGCA AGGCCAAGTC 180 

CATCATCCCC AACAAGGTGG CCCCTGTTGT GATCACGTAC AACTGCAAGG AGGAGTTCCA 240 

GATCCATGAT GAGCTGCTCA AGGCTCATTA CACGTTGGGC CGGCTCTCGG ACAACACCCC 300 

TGAGCACTAC CTGGTGCAAG GAGCTCAGGC CTTACCCCAG GGCCGCTACT TCCTGGTGCG 360 

GGATGTCACT GAGAAGATGG ATGTG CTGGG CACCGTGGGA AGCTGTGGGG CCCCCAACTT 4 20 

CCGGCAGGTG CAGGGTGGGC TCACTGTGTT CGGCATGGGA CAGCCCAGCC TCTTAGGGTT 4 80 

CAGGCGGGTC CTCCAGAAAC TCCAGAAGGA CGGACATAGG GAGTGTGTCA TCTTCTGTGT 540 

GCGGGAGGAA MCTGTGCTTT TCCTGCGTGC AGATGAGGAC TTTGTGTCCT ACACACCTCG 600 

AGACAAGCAG AACCTTCATG AGAACCTCCA GGGCCTTGGA CCCGGGGTCC GGGTGGAGAG 660 

CCTGGAGCTG GCCATCCGGA AAGAGATCCA CGACTTTGCC CAGCTGAGCG AGAACACATA 720 

CC ATGTG TAC CATAACACCG AGGACCTGTG GGGGGAGCCC CATGCTGTGG CCATCCATGG 780 

TGAGGACGAC TTGCATGTGA CGGAGGAGGT GTACAAGCGG CCCCTCTTCC TGCAGCCCAC 84 0 

CTACAGGTAC CACCGCCTGC CCCTGCCCGA GCAAGGGAGT CCCCTGGAGG CCCAGTTGGA 900 

CGCCTTTGTC AGTGTTCTCC GGGAGACCCC CAGCCTGCTG CAGCTCCGTG ATGCCCACGG 960 

GCCTCCCCCA GCCCTCGTCT TCAGCTGCCA GATGGGCGTG GGCAGGACCA ACCTGGGCAT 1020 

GGTCCTGGGC ACCCTCATCC TGCTTCACCG CAGTGGGACC ACCTCCCAGC CAGAGGCTGC 108 0 

CCCCACGCAG GCCAAGCCCC TGCCTATGGA GCAGTTCCAG GTGATCCAGA GCTTTCTCCG 114 0 

CATGGTGCCC CAGGGAAGGA GGATGGTGGA AGAGGTGGAC AGAGCCATCA CTGCCTGTGC 1200 

CGAGTTGCAT GACCTGAAAG AAGTGGTCTT GGAAAACCAG AAGAAGTTAG AAGGTATCCG 1260 

ACCGGAGAGC CCAGCCCAGG GAAGCGGCAG CCGACACAGC GTCTGGCAGA GGGCGCTGTG 1320 

GAGCCTGGAG CGATACTTCT ACCTGATCCT GTTTAACTAC TACCTTCATG AGCAGTACCC 138 0 

GCTGGCCTTT GCCCTCAGTT TCAGCCGCTG GCTGTGTGCC CACCCTGAGC TGTACCGCCT 144 0 

GCCCGTGACG CTGAGCTCAG CAGGCCCTGT GGCTCCGAGG GACCTCATCG CCAGGGGCTC 1500 

CCTACGGGAG GACGATCTGG TCTCCCCGGA CGCGCTCAGC ACTGTCAGAG AGATGGATGT 1560 

GGCCAACTTC CGGCGGGTGC CCCGCATGCC CATCTACGGC ACGGCCCAGC CCAGCGCCAA 1620 

GGCCCTGGGG AGCATCCTGG CCTACCTGAC GGACGCCAAG AGGAGGCTGC GGAAGGTTGT 1680 

CTGGGTGAGC CTTCGGGAGG AGGCCGTGTT GGAGTGTGAC GGGCACACCT ACAGCCTGCG 1740 

GTGGCCTGGG CCCCCTGTGG CTCCTGACCA GCTGGAGACC CTGGAGGCCC AGCTGAAGGC 1800 

CCATCTAAGC GAGCCTCCCC CAGGCAAGGA GGGCCCCCTG ACCTACAGGT TCCAGACCTG I860 

CCTTACCATG CAGGAGGTCT TCAGCCAGCA CCGCAGGGCC TGTCCTGGCC TCACCTACCA 1920 

CCGCATCCCC ATGCCGGACT TCTGTGCCCC CCGAGAGGAG GACTTTGACC AGCTGCTGGA 1980 

GGCCCTGCGG GCCGCCCTCT CCAAGGACCC AGGCACTGGC TTCGTGTTCA GCTGCCTCAG 2040 

CGGCCAGGGC CGTACCACAA CTGCGATGGT GGTGGCTGTC CTGGCCTTCT GGCACATCCA 2100 

AGGCTTCCCC GAGGTGGGTG AGGAGGAGCT CGTGAGTGTG CCTGATGCCA AGTTCACTAA 2160 

GGGTGAATTT CAGGTAGTAA TGAAGGTGGT GCAGCTGCTA CCCGATGGGC ACCGTGTGAA 2220 

GAAGGAGGTG GACGCAGCGC TGGACACTGT CAGCGAGACC ATGACGCCCA TGCACTACCA 2280 

CCTGCGGGAG ATCATCATCT GCACCTACCG CCAGGCGAAG GCAGCGAAAG AGGCGCAGGA 2340 

AATGCGGAGG CTGCAGCTGC GGAGCCTGCA GTACTTGGAG CGCTATGTCT GCCTGATTCT 2400 



260 



WO 02/079492 



PCT/US02/04915 



CTTCAACGCG TACCTCCACC TGGAGAAGGC CGACTCCTGG CAGAGGCCCT TCAGCACCTG 2460 

GATGCAGGAG GTGGCATCGA AGGCTGGCAT CTACGAGATC CTTAACGAGC TGGGCTTCCC 2520 

CGAGCTGGAG AGCGGGGAGG ACCAGCCCTT CTCCAGGCTG CGCTACCGGT GGCAGGAGCA 2580 

GAGCTGCAGC CTCGAGCCCT CTGCCCCCGA GGACTTGCTG TAGGGGGCCT TACTCCCTGT 2640 

CCCCCCACCC ACAGGGCCCC ACGCAGGCCT GGGGTGTCTG AGGTGCTCTT GGCTGGGAGC 2700 

GGCCCTGAGG GGTGCTGGCC TTGAAATGAT TCCCCCACTT CCTGGAGAGA CTGAGCGGAG 2760 

TTGGGAGCCT TTTTAGAAAG AACTTTTTAT AGGACAGGGA GACAGCACAG CCATCCCTTG 2820 

CAAACCACCA AGGTGTGTGG CTGACCTCCA GGGAGGAGCA CTCACTGGAG TGCTCACAAG 2880 

GTGCACACTG CTGTGTGTAC CTTGCAGACA GGCCGGCGTT CAGCCTCCAA GGGGCTCACT 2940 

CCCCCAGTTG CCAAACACTG TGGATCTCTC TGTCCTCTTC TCCCCTCTCT CAGATTGGCC 3000 

TGGCAGCCCC TGGCACAGAG CAGACCCGGC CACTGGTAGC TCCCCACTTC CTTACTCCTG 3060 

CTGCTCTGCC ATTGCCGCTC CCCTTCTTGC TGCCCAAGCA CTGCCCTCGG GCGTCTGGCA 3120 

GCCTGAGGTG GGTGGAGGGG ACAGTGTTCT GGATAGATCT ATTATGTGAA AGGCAGCTTC 3180 

ACCCAGTTTT CTGGACTCTC ATGCCCCCAT CTCCGACCTG GGAGACTTCA GGAATGACAA 3240 

CCTACCCAGC CTGGTGGGGC TGGCAGGATG GTGGAGGTTT CTCAAGGAGC TGGAGACTTC 3300 

AGGGAGCCCC TCTCATGGGG AGGAAAGAGC TTCCAGGGGG CGAACGCAGC ACAGAGGAAG 3360 

AGGCCTGCTC CACTTGTCTG GGAACCTGGG CAGGAGGCAC AGAGGAAGCC AAGGCCTGGA 3420 

GCTGCAGGTC CCCCGGCATC TCTCTCTGTC CCGGCAGCCC AGGATGGCCT GGTGCCCCCA 3480 

CCTGCTGCAG CAGGAGCCCC AAGGAGTGCT AGCTGAGGGT GGTTGCTGGG GTGGTCCTCA 3540 

TGGACAGTGA GGTGTGCAAG GGTGCACTGA GGGTGGTGGG AGGGGATCAC CTGGGTTCCA 3600 

GGCCATCCTT GCTGAGCATC TTTGAGCCTG CCTTCCGGTG GGAGCAGAAA AGGCCAGACC 3660 

CTGCTGAGTT AGAGGCTGCT GGGATCCACT GTTTCCACAC AGCGGGAAGG CTGCTGGGAA 3720 

CAGGTGGCAG AGAAGTGCCA TGTTTGCGTT GAGCCTTGCA GCTCTTCCAG CTGGGGACTG 3780 

GTGCTTGCTG AAACCCAGGA GCTGAACAGT GAGGAGGCTG TCCACCTTGC TTGGCTCACT 3840 

GGGACCAGSA AAGCCTGTCT TTGGTTAGGC TCGTGTACTT CTGCAGGAAA AAAAAAAAAG 3900 

GATGTGTCAT TGGTCATGAT ATTTGAAAAG GGGAGGAGGC CGAAGTTGTT CCCATTTATC 3960 

CAGTATTGGA AAATATTTGA CCCCCTTGGC TGAATTCTTT TGCAGAACTA CTGTGTGTCT 4020 

GTTCACTACC TTTTCAGGTT TATTGTTTTT ATTTTTGCAT GAATTAAGAC GTTTTAATTT 4080 

CTTTGCAGAC AAGGTCTAGA TGCGGAGTCA GAGATGGGAC TGAATGGGGA GGGATCCTTT 4140 

GTGTTCTCAT GGTTGGCTCT GACTTTCAGC TGTGTTGGGA CCACTGGCTG ATCACATCAC 4200 

CTCTCTGCCT CAGTTTCCCC ATCTGTAAAA TGGGAGAATA ATACTTGCCT ACCTACCTCA 4260 

CRGGGGTGTT GTGAGGATTC ATTTGTGATT TTTTTTTTTT TTTTTGTACA GAGCTTTTAA 4320 
GCATTAAAAA CAGCTAAATG TG 



Seq ID No: 171 Protein sequence : 
Protein Accession #: BAA86588.1 



1 11 21 31 41 51 

I I I I I I 

MGTTASTAQQ TVSAGTPFEG LQGSGTMDSR HSVSIHSFQS TSLHNSKAKS IIPNKVAPW 60 

ITYNCKEEFQ IHDELLKAHY TLGRLSDNTP EHYLVQGAQA LPQGRYPLVR DVTEKMDVLG 12 0 

TVGSCGAPNF RQVQGGLTVF GMGQPSLLGF RRVLQKLQKD GHRECVIFCV REEVLFLRAD 180 

EDFVSYTPRD KQNLHENLQG LGPGVRVESL ELAIRKEIHD FAQLSENTYH VYHNTEDLWG 240 

EPHAVAIHGE DDLHVTEEVY KRPLFLQPTY RYHRLPLPEQ GSPLEAQLDA FVSVLRETPS 300 

LLQLRDAHGP PPALVFSCQM GVGRTNLGMV LGTLILLHRS GTTSQPEAAP TQAKPLPMEQ 360 

FQVIQSFLRM VPQGRRMVEE VDRAITACAE IiHDLKEWLE NQKKLEGIRP ESPAQGSGSR 420 

HSVWQRALWS LERYFYLILF NYYLHEQYPL AFALSFSRWL CAHPELYRLP VTLSSAGPVA 4 80 

PRDLIARGSL REDDLVS PDA LSTVREMDVA NFRRVPRMPI YGTAQPSAKA LGSILAYLTD 54 0 

AKRRLRKWW VSLREEAVLE CDGHTYSLRW PGPPVAPDQL ETLEAQLKAH LSEPPPGKEG 600 

PLTYRFQTCL TMQEVFSQHR RACPGLTYHR IPMPDFCAPR EEDFDQLLEA LRAALSKDPG 660 

TGFVFSCLSG QGRTTTAMW AVLAFWHIQG FPEVGEEELV SVPDAKFTKG EFQWMKWQ 720 

LLPDGHRVKK EVDAALDTVS ETMTPMHYHL REIIICTYRQ AKAAKEAQEM RRLQLRSLQY 780 

LERYVCLILF NAYLHLEKAD SWQRPFSTWM QEVASKAGIY EILNELGFPE LESGEDQPFS 840 
RLRYRWQEQS CSLEPSAPED LL 



Seq ID NO: 172 DNA sequence 

Nucleic Acid Accession #: AK021806.1 

Coding sequence: 1-645 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

ACTGTGCTTT TCCTGCGTGC AGATGAGGAC TTTGTGTCCT ACACACCTCG AGACAAGCAG 60 

AACCTTCATG AGAACCTCCA GGGCCTTGGA CCCGGGGTCC GGGTGGAGAG CCTGGAGCTG 120 

GCCATCCGGA AAGAGATCCA CGACTTTGCC CAGCTGAGCG AGAACACATA CCATGTGTAC 180 

CATAACACCG AGGACCTGTG GGGGGAG CCC CATGCTGTGG CCATCCATGG TGAGGACGAC 240 

TTGCATGTGA CGGAGGAGGT GTACAAGCGG CCCCTCTTCC TGCAGCCCAC CTACAGGTAC 300 

CACCGCCTGC CCCTGCCCGA GCAAGGGAGT CCCCTGGAGG CCCAGTTGGA CGCCTTTGTC 360 

AGTGTTCTCC GGGAGACCCC CAGCCTGCTG CAGCTCCGTG ATGCCCACGG GCCTCCCCCA 420 

GCCCTCGTCT TCAGCTGCCA GATGGGCGTG GGCAGGACCA ACCTGGGCAT GGTCCTGGGC 480 

ACCCTCATCC TGCTTCACCG CAGTGGGACC ACCTCCCAGC CAGAGGCTGC CCCCACGCAG 540 

GCCAAGCCCC TGCCTATGGA GCAGTTCCAG GTGATCCAGA GCTTTCTCCG CATGGTGCCC 600 

CAGGGAAGGA GGATGGTGGA AGAGGTGGAT AGATCTATTA TGTGAAAGGC AGCTTCACCC 660 

AGTTTTCTGG ACTCTCATGC CCCCATCTCC GACCTGGGAG ACTTCAGGAA TGACAACCTA 720 

CCCAGCCTGG TGGGGCTGGC AGGATGGTGG AGGTTTCTCA AGGAGCTGGA GACTTCAGGG 780 

AGCCCCTCTC ATGGGGAGGA AAGAGCTTCC AGGGGGCGAA CGCAGCACAG AGGAAGAGGC 840 
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10 



15 



CTGCTCCACT 
CAGGTCCCCC 
CTGCAGCAGG 
CAGTGAGGTG 
ATCCTTGCTG 
TGAGTTAGAG 
TGGCAGAGAA 
TTGCTGAAAC 
CCAGGAAAGC 
TGTCATTGGT 
ATTGGAAAAT 
ACTACCTTTT 
GCAGACAAGG 
TCTCATGGTT 
CTGCCTCAGT 
GGTGTTGTGA 
TAAAAACAGC 



TGTCTGGGAA 
GGCATCTCTC 
AGCCCCAAGG 
TGCAAGGGTG 
AGCATCTTTG 
GCTGCTGGGA 
GTGCCATGTT 
CCAGGAGCTG 
CTGTCTTTGG 
CATGATATTT 
ATTTGACCCC 
CAGGTTTATT 
TCTAGATGCG 
GGCTCTGACT 
TTCCCCATCT 
GGATTCATTT 
TAAATGTG 



CCTGGGCAGG 
TCTGTCCCGG 
AGTGCTAGCT 
CACTGAGGGT 
AGCCTGCCTT 
TCCACTGTTT 
TGCGTTGAGC 
AACAGTGAGG 
TTAGGCTCGT 
GAAAAGGGGA 
CTTGGCTGAA 
GTTTTTATTT 
GAGTCAGAGA 
TTCAGCTGTG 
GTAAAATGGG 
GTGATTTTTT 



AGGCACAGAG 
CAGCCCAGGA 
GAGGGTGGTT 
GGTGGGAGGG 
CCGGTGGGAG 
CCACACAGCG 
CTTGCAGCTC 
AGGCTGTCCA 
GTACTTCTGC 
GGAGGCCGAA 
TTCTTTTGCA 
TTGCATGAAT 
TGGGACTGAA 
TTGGGACCAC 
AGAATAATAC 



GAAGCCAAGG 
TGGCCTGGTG 
GCTGGGGTGG 
GATCACCTGG 
CAGAAAAGGC 
GGAAGGCTGC 
TTCCAGCTGG 
CCTTGCTTGG 
AGGAAAAAAA 
GTTGTTCCCA 
GAACTACTGT 
TAAGACGTTT 
TGGGGAGGGA 
TGGCTGATCA 
TTGCCTACCT 
TGTACAGAGC 



CCTGGAGCTG 900 

CCCCCACCTG 960 

TCCTCATGGA 1020 

GTTCCAGGCC 1080 

CAGACCCTGC 1140 

TGGGAACAGG 1200 

GGACTGGTGC 1260 

CTCACTGGGA 1320 

AAAAAGGATG 1380 

TTTATCCAGT 1440 

GTGTCTGTTC 1500 

TAATTTCTTT 1560 

TCCTTTGTGT 1620 

CATCACCTCT 1680 

ACCTCACGGG 1740 

TTTTAAGCAT 1800 



20 



Seq ID No: 173 Protein sequence : 
Protein Accession #: AK021806.1 



25 1 11 21 31 41 51 

I I I I I I 

TVLFLRADED FVSYTPRDKQ NLHENLQGLG PGVRVESLEL AIRKEIHDFA QLSENTYHVY 

HNTEDLWGEP HAVAIHGEDD LHVTEEVYKR PLFLQPTYRY HRLPLPEQGS PLEAQLDAFV 

SVLRETPSLL QLRDAHGPPP AIiVFSCQMGV GRTNLGMVLG TLILLHRSGT TSQPEAAPTQ 

30 AKPLPMEQFQ VIQSFLRMVP QGRRMVEEVD RSIM 



60 
120 
180 



35 



Seq ID- NO: 174 DNA sequence 

Nucleic Acid Accession #: NM_016580.2 

Coding sequence: 1212-4766 (underlined sequences correspond to start and stop codons) 



45 



50 



40 GGGAAGCGGG 
AGACCATGGG 
. TCCAGAGGTA 
AACGTCACTG 
TTTAATCCTC 
AGATGAGGTA 
AGTTGCAAAG 
CAGCTGCTTT 
ACAAGATTGG 
GAGAGTTGGG 
ACGTGCAAGC 
CCCTCAGCAG 
CCGGCCTCAT 
TTGGCATTGA 
TGCTTCTCTG 
55 ACTGGGTCAG 
GTTCCCCAGT 
CTTCCAATGA 
GACAAGGTTG 
AACACTGGAG 
60 TGGCGGTAAG 
ACTTATTTCT 
CAGAGGAAGT 
AGAGGCGGAG 
TTCAGGTGGA 
65 TGTGCCGACA 
CTCTGATCCA 
AAGGCGAGCA 
ACAGAGCTCT 
GTGAGCACTT 
TAGTGGTGAA 
ATGACAATGG 
CCAATGACAA 
CTGCACCTGG 
GGGAGGTGGA 
75 TTGATGCCAA 
CCTACGAGGT 



70 



ii 
I 

AGGAGAGCCA 
CACCCTCATA 
GCCATAGGGT 
CCTGTGACTC 
ACAGTTTCCT 
ATGGAGGCCC 
TCAGAATTTG 
CCAGTGAGAC 
GAAAAAGACA 
GAAGGGGAGA 
AGGTGGAGGC 
CCGCCTTGCT 
CCTCCTCAGC 
AACTCACAGC 
TTTCCCCAGC 
GTCCCTCTTA 
AGGGGGTCTC 
AGATGGCCTT 
TGCCGGCAGC 
GACTTGTCCA 
C ATG ATGCAA 
TTTAGGGGAT 
GCCATCTGGT 
GCAAGCTGGG 
CTCTGAGGAA 
GTGGGATCCC 
TGTGGAGATC 
GGAGCTGGAA 
TGACCCAGAC 
TGCCTTGGAT 
GGAGCTGGAC 
GAACCCCCCC 
TAGCCCTGCG 
TACGCTTCTC 
GTTCTTCCTC 
GACAGGCCAG 
GGATGTTCAG 



21 , 
I 

CACGGTCAAG 

AGTCAGTGTG 

GTGACAAGTT 

GGGGCCAGGC 

GCTGAAAGGG 

AGGAAAGTTA 

AACTCAGGCA 

AAAAACGGGT 

GGAATGAGAG 

GAAACAGGAA 

TCAAGTTTTC 

GCGTGGACAG 

GGCAGGCCAC 

CCTCCCTTTT 

CTAGCAACTG 

GACCAGCTCT 

CCCTGGCAAT 

GCCTTGGGGT 

TCTGGGGGAA 

GCCTTGAAAG 

CTTCTGCAAC 

TGTCAGGAGG 

ACAGTGATCG 

GCTGCCTTCC 

GGCTTGCTCA 

TGCCTGGTTT 

CAAGTGCTGG- 

ATCTCTGAGA 

ACAGGCCCTA 

GTCATTGTGG 

AGGGAAATCC 

AAGTCAGGTA 

TTTGCTGAGA 

ATAAAACTGA 

AGTAAGCACA 

GTCATTCTGC 

GCAAGGGACC 



31 
I 

TTGCACAGGT 
GGCAGGGACT 
GTGCAGATTA 
CCAGGCCAAA 
CTACTATTCT 
AGTGACTTGT 
GTTTACCTCT 
GATCAGGGCA 
GGGAACAATG 
ACATGACTTG 
TGCTCACTTG 
CAGCTTCCCA 
TTAGCTTCAC 
CCTGTAGGTG 
TTTGGCAGTC 
TGTCCATCAT 
TCTTGATCGG 
CCTGCTTGTT 
GGAGCACGGG 
AACTCTAGTG 
TTCTGCTGGG 
TGACCACTCT 
GGAAGCTGTC 
AGGTGTTGCA 
GCACAGGCAG 
CCTTTGATGT 
ACATCAATGA 
GCGCCTCTCT 
ACACCCTGCA 
GCCCTGATGA 
ATTCATTTTT 
CCAGCTTGGT 
GTTCACTGGC 
CCGCCACAGA 
TGCCTCCAGA 
GTCGACCTCT 
TGGGTCCCAA 



41 
I 

TCTTGCAGCT 
GCCCCAGGGC 
CAACACTCAC 
GCCCTTCCTA 
TACTCCCATC 
CCCAGATGAC 
GATGGCTGCT 
GAGTCAAGAC 
GGGGAAAAGA 
CCCGGGAGGG 
GTGATGCAGA 
TCTGGCCTGT 
AGGAAATGCT 
GGGTTTCCAT 
AGAGTCCCAC 
TTGCTGAAGT 
CGTTTGGACA 
TCATAATCAT 
GCTGATCAAG 
GTTTCTGAAT 
GCTTTTGGGG 
CACGGTGAAA 
CCAGGAACTG 
GCTGCCTCAG 
GCGGCTGGAT 
GCTTGCCACA 
CCACCAGCCA 
GCGAACCCGG 
CACCTACACT 
GACCAAACAT 
TGATCTGGTG 
CAAGGTCAAC 
ACTGGAAATC 
CCCTGACCAA 
GGTGCTGGAC 
AGACTATGAA 
TCCTATCCCA 



51 
I 

TCTGGAATCA 60 

CAATCCAAGA 120 

CCCTTGCAAT 180 

CATCATTTCG 24 0 

CCCACTCTAC 300 

ACCGCTGGTA 3 60 

CTGTTAATCA 420 

AGAGAGGTAA 4 80 

TAGGAACAAA 540 

GCATCAGTCC 600 

GGCTCCCTTT 660 

CCCCGGAGCC 720 

CTTTCTCTAA 780 

AGGAAAAAGC 840 

ATCCTGCTCA 900 

GGACCAACTA 960 

TCTCAGATCG 1020 

CTAACTATGG 1080 

CCATCCAGGA 1140 

CTAGCCCACT 1200 

CCAGGTGGCT 1260 

TACCAAGTGT 1320 

GGCCGGGAGG 1380 

GCGCTCCCCA 1440 

CGAGAGCAGC 1500 

GGGGATTTGG 1560 

CGGTTTCCCA 1620 

ATCCCCCTGG 1680 

CTGTCTCCCA 1740 

GCAGAACTCA 1800 

TTAACTGCCT 1860 

GTCTTGGACT 1920 

CAAGAAGATG 1980 

GGCCCCAATG 204 0 

ACCTTCAGTA 2100 

AAGAACCCTG 2160 

GCCCATTGCA 2220 
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AAGTTCTCAT 
CCTCCCAGCC 
TGGCAGATGA 
TGGGCCACTT 
5 CACTGGACAG 
TCCAGCCCTT 
CACCTGTGTT 
TTCACCTCAT 
ACCGCATCCA 

1 0 TCACTGCTCA 
CAGAGGACAG 
ATGCCAATGA 
TCTCCGTGCT 
GCTTGGGCCC 

1 5 TTTTGACAAC 
GCATCCGCAG 
TCGTCAATGT 
TAGAGGACCA 
CCAGTGTGGA 

20 TGACGGTGAT 
TGTCCATCTG 
CCACCTACCG 
TCGTGCCTGT 
ATGTGGACAA 

25 ACCTCACCCC 
AGAGCCGAGA 
ATGCCTCCCG 
CCAGGCCTCT 
AGGAAGCCCC 

30 ATGG CAAAGT 
GGCTGTCTGT 
CTCCTGTTCA 
AACCAAACCA 
CAGACACAGA 

35 GGCCTTTGGA 
GTCTGCTGGA 
GGATGGCGAG 
ATGCTGCAGC 
AGCTGAGCCC 

40 TGGAGATGCT 
GGCGGCTCTC 
GCATGAAAGT 
GCAGCAGCAG 
GGGCCTGAGG 

45 GCGGCCTGAG 
ACTAACAGCT 
GGGACGGTTT 
CAGTCCTCTG 
GGTAGCAGGA 

50 ATAAAGGAAA 



CAAGGTTCTG 
ATCACTGGTG 
CTTGGATTCA 
CAGGCTGAAA 
AGAGCAGTGG 
ATCAGCCAAG 
TGAGAAAAGC 
TACCATCAAG 
GGACTCCCCA 
GAGGTCACTG 
CGGGCAACCC 
TAATGCCCCA 
TGTGAATGCC 
AGCGGGCACT 
CATTGTGGCA 
TGGAAATGAA 
CACCAATGCC 
GGGAAGCCCC 
CCACCTGAGG 
CTGCCTGGCT 
CCGGACAGAA 
CCAGCAGCCC 
GCTCAGGGGT 
GGAGGCGATG 
GACCCTGTAC 
GGTGCTGCAA 
GGAGAACCTG 
GAAGGTTGCA 
ACAGAGGCCA 
GTCCCCTGAG 
GGCTGCCTTC 
GCAAATCTCC 
CCGAGGAAAT 
TGGCCCAAGT 
TCCTGAAGAG 
CCCCAGCACA 
ACTCTCTTTG 
CACGGAGGAG 
AACAGGCACG 
GCTGGAACAG 
GGTCTGCGGG 
GCAAGGGGAC 
CAGCAGGTGC 
ATCTGTGGAC 
AACTTTAGGG 
GACTGACCAA 
GTGGCTGAGA 
GCAGAACAGA 
GTCAGGGGGC 
AGCAGTGATT 



GATGTCAATG 
TCAGAAGCTC 
GGACACAATG 
AGAACTAATG 
CCCAAATATA 
AAACAGCTCA 
AGGTATGAAG 
GCTCATGATG 
GTTGCTCACT 
AACTATGAAG 
ATGCTTGCAT 
GAGGTGGTCC 
TCCACAGGCC 
GACACACCTC 
AGAGATGCAG 
GCCCACCTCT 
AGCAGCCTCA 
CCCTTACAGA 
GACTCAGCCC 
GTACTGTTGG 
AAGAAGGACA 
AAGAGGCCCC 
CAGGCAGGTG 
ATGGAAG CAG 
AGGACGCTGC 
GACACGGTCA 
AACCTTCCCG 
GGCAGCCCCA 
CCAGCCTCCT 
AAAGAATCAG 
GCCGAGCGGA 
CAGCTGCTGT 
AAGTACTTGG 
GCAAGGGCTG 
GACCTCTCTG 
GGTCTGGCCC 
CCCCTCACCA 
CCAAGGACCT 
AGGCTGGCCA 
CGCTCCAGCA 
AGGACCCTCA 
CCAGGTGGAA 
CTG TGAA CAT 
AAG AG CTGGT 
TGACTGATGC 
AGCAGCCCCT 
TAAGTGTTTC 
TGCCACGGAG 
TGTACCCTGG 
CAAAAAAAAA 



ACAACATCCC 
TTCCCAAGGA 
GTTTGGTCCA 
GCAACACATA 
CCCTCACTCT 
GCATTCAGAT 
TCTCCACGCG 
CAGACTTGGG 
TAGTAGCTAT 
AGATGGCCGG 
CCAGTGTCTC 
AGCCTGTGCT 
ACCTGCTGGT 
CACTGGCCAC 
ACTCGGGGGC 
TCATCCTCAA 
TTGGGAGTGA 
CCCGAGCCCT 
GCAAGCCTGG 
GCATCTTCGG 
ACAGGGCCTA 
AGAAACACAT 
AGCCTTGTGA 
GCTGGGACCC 
GTAATCAAGG 
ACCTCCTTTT 
AGCCCCAGCC 
CAGGGAGGCT 
CTGCAACCCT 
GGCCCCGTCA 
ACCCCGTGGA 
CCTTGCTGCA 
CCAAGCCAGG 
GAGGCCAGAC 
TGAAGCAACT 
TGGACCGGCT 
CCAACTACCG 
TCCAGACGTT 
GCACCTTTGT 
TGCCCGTGGA 
GTTTAGACTT 
AGACGGGGAC 
ACCTCAGACG 
TTCTAAAATC 
TACCCCCACA 
TGTAAGCAGC 
CTGGCAAAAC 
TATCACAGGC 
GGGTGCCAGG 
AAAAAAAAAA 



AAGCATCCAC 
CAGTTTTATT 
CTGCTGGCTG 
CATGTTGCTA 
GTTAGCCCAA 
CAGTGACATC 
GGAAAACAAC 
CATTAATGGA 
TGACTCCAAC 
CTTTGAGTTC 
TGTGTGGGTC 
CAGCGATGGA 
GCCCATCGAG 
TCACAGCTCC 
AAATGGAGAG 
CCCTCATACG 
GTGGGAGCTG 
GTTGAGGGTC 
GGCCTTGAGC 
GTTGATCCTG 
CAACTGTCGG 
TCAGAAGGCA 
AGTCGGGCAG 
CTGCCTGCAG 
CAACCAGGGA 
CAACCATCCC 
TGCCACAGGC 
GGCTGGAGAC 
GAGACGGCAG 
GATCCTGCGG 
GGAGCTCACT 
TCAGGGCCAA 
AGGCAGCAGG 
AGACCCAGAA 
GCTAGAAGAA 
GAGCGCCCCT 
TGACAATGTG 
CGGCAAGGCA 
CTCGGAGATG 
GGCCGCCTCC 
GGCCACCAGT 
TGAGGGCAAG 
CCTCTGGATC 
TTGTAACTCA 
GAGGAGGCAA 
TCTGAGTCTT 
ATATGTGGAG 
AGGAAAGGGT 
AAATGCTCTC 
AAAAAAAAAA 



GTCACATGGG 2280 

GCTCTTGTCA 2340 

AGCCAAGAGC 24 00 

ACCAATGCCA 2460 

GACCAAGGAC 2520 

AACGACAATG 2580 

TTACCCTCTC 2640 

AAAGTCTCAT 2700 

ACAGGAGAGG 2760 

CAGGTGATCG 2820 

AGCCTCTTGG 2880 

AAAGCCAGCC 2940 

ACTCCCAATG 3000 

CGGCCATTCC 3060 

CCCCTCTACA 3120 

GGGCAGCTGT 3180 

GAGATAGTAG 3240 

ATGTTTGTCA 3300 

ATGTCGATGC 3360 

GCTTTGTTCA 3420 

GAGGCCGAGT 3480 

GACATCCACC 3540 

TCCCACAAAG 3600 

GCCCCCTTCC 3660 

GCACCGGCGG 3720 

AGGCAGAGGA 3780 

CAGCCACGTT 3840 

CAGGGCAGTG 3 900 

CGACATCTCA 3 960 

AGCCTGGTCC 4020 

GTGGATTCTC 4080 

TTCCAGCCCA 4140 

AGTGCAATCC 4200 

CAGGAGGAAG 4260 

GAGCTGTCAA 4320 

GACCCGGCCT 4380 

ATCTCCCCGG 4440 

GAGGCACCAG 4500 

AGCTCACTGC 4560 

GAGGCGCTGC 4620 

GCAGCCTCAG 4680 

AGCAGAGGCA 4740 

CAAGAACCAG 4 800 

CTAGCTAGCG 4 860 

GAGCCCCAGG 4920 

TTGGAGGACA 4980 

CACAAAGGGT 504 0 

GGCCTTCTTG 5100 

TGACCTATCA 5160 



Seq ID No: 175 Protein secnience : 
Protein Accession #: NP_057664.1 



55 



60 



65 



70 



75 



i 
i 

MMQLLQLLLG 
QAGAAFQVLQ 
VEIQVLDIND 
ALDVIVGPDE 
SPAFAESSLA 
TGQVILRRPL 
SLVSEALPKD 
EQWPKYTLTL 
TIKAHDADLG 
GQPMLASSVS 
AGTDTPPLAT 
TNASSLIGSE 
CLAVLLGIFG 
LRGQAGEPCE 
VLQDTVNLLF 
QRPPASSATL 
QISQLLSLLH 
PEEDLSVKQL 
TEEPRTFQTF 



11 
I 

LLGPGGYLFL 
LPQALPIQVD 
HQPRFPKGEQ 
TKHAELIWK 
LEIQEDAAPG 
DYEKNPAYEV 
SFIALVMADD 
LAQDCGLQPL 
INGKVSYRIQ 
VWVSLLDAND 
HSSRPPLLTT 
WELEIWEDQ 
LILALFMSIC 
VGQSHKDVDK 
NHPRQRNASR 
RRQRKLNGKV 
QGQFQ PKPNH 
LEEELSSLLD 
GKAEAPELSP 



21 



LGDCQEVTTL 
SEEGLLSTGR 
ELEISESASL 
ELDREIHSFF 
TLLIKLTATD 
DVQARDLGPN 
LDSGHNGLVH 
SAKKQLSIQI 
DSPVAHLVAI 
NAPBWQPVL 
IVARDADSGA 
GSPPLQTRAL 
RTEKKDNRAY 
EAMMEAGWDP 
ENLNLPEPQP 
SPEKESGPRQ 
RGNKYLAKPG 
PSTGIiALDRL 
TGTRLASTFV 



31 
I 

TVKYQVSEEV 
RLDREQLCRQ 
RTRIPLDRAL 
DLVLTAYDNG 
PDQGPNGEVE 
PIPAHCKVLI 
CWLSQELGHF 
SDINDNAPVF 
DSNTGEVTAQ 
SDGKASLSVL 
NGEPLYSIRS 
LRVMFVTSVD 
NCREAESTYR 
CLQAPFHLTP 
ATGQPRSRPL 
ILRSLVRLSV 
GSRSAIPDTD 
SAPDPAWMAR 
SEMSSLLEML 



41 
I 

PSGTVIGKLS 
WDPCLVSFDV 
DPDTGPNTLH 
NPPKSGTSLV 
FFLSKHMPPE 
KVLDVNDNIP 
RLKRTNGNTY 
EKSRYEVSTR 
RSLNYEEMAG 
VNASTGHLLV 
GNEAHLFILN 
HLRDSARKPG 
QQPKRPQKHI 
TLYRTLRNQG 
KVAGSPTGRL 
AAFAERNPVE 
GPSARAGGQT 
LSLPLTTNYR 
LEQRSSMPVE 



51 
I 

QELGREERRR 60 

tiATGDIiALIH 120 

TYTLSPSEHF 180 

KVNVLDSNDN 240 

VLDTFSIDAK 300 

SIHVTWASQP 360 

MLLTNATLDR 420 

ENNLPSLHLI 480 

FEFQVIAEDS 540 

PIETPNGLGP 600 

PHTGQLFVNV 660 

ALSMSMLTVI 720 

QKADIHLVPV 78 0 

NQGAPAESRE 84 0 

AGDQGSEEAP 900 

ELTVDSPPVQ 960 

DPEQEEGPLD 1020 

DNVISPDAAA 1080 

AASEALRRLS 1140 



263 



WO 02/079492 



PCT/US02/04915 



VCGRTLSLDL ATSAASGMKV QGDPGGKTGT EGKSRGSSSS SRCL 

Seq ID NO: 176 DNA sequence 
Nucleic Acid Accession #: AL109712.1 

Coding sequence: 2-128 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

GAGTCTCTTT GGGCCAGCCG GGCTGCTGCA GACAGACAGG AAGCACGCCT GACGCTCCTC 60 

TACCCTCGGG CAGCACAGCG GGGCTGGGAC TCACTCTAGC TTGCCCAGCA ACTTGCTTTC 120 

CTGT GTGA AC TCTGGCAGGC TGCCCTCTCT GTGCAAAGCT GCCACTGGGG CCTGCTCAGG 180 

GTGGCCTGGA ACTTGGAGGT GGGCAGTCAG GGCCTAGGAT GGGCCTGTGT CACCAGGGCA 240 

TGTGCCCTTG GGCCAGTTAC TTCCTCTCAG AGCCTTGGGC TCCTCCTCTG AGGATGGGGC 300 

TTGTTGGTGT GAAATGAGGT GAGCATGTTG AGTTGGGGAG CAGCAGGACA CGCACCTGCA 360 

GGCAGCCGCC CTGGCCACGC TCCCTCCCTA CCTTCCGAGT CCTGGGACAG ACACAGTAGA 420 

GCACAGCGGG CCAGCCTGCT CTCTTCTCTG TCTACTTTTT GCAGAAGAGT CAACAGATAC 480 

AACAGGCCCA GGGAGGTGCC CCTGGGGGCC CCAGTCCCCA TCACTCCAAG GGGCAGTCCT 540 

GCAAGTGACA AGGTGGGCCC AATCCCTGTG GAACAGGTCT CTGAGGACCA CAGAGTGGGG 600 

CCCCAGGGAA AGCTGGGAGC CGAGCTAGAG GCAGGCAGCA AGTAAGGGCA AAGCTGTGCC 660 

CCTGCCCGGA AGACCTTCCT GCCCCCAGAA CCCGACCCTC CGCAGATAGC CCTCCCTGGG 720 

CAGCAGCCCC CCAGCTTCCA AGGCCCGTGC CTCACCAGAC GCCATGCTCT CACGGACTTG 780 

TTTGCTGCTC TGTACCCTGC AGATCTGCCC CAGAGGAGCA GGTGAAAAGC CGCGCCTGCC 840 

GAGGTGCTGT GGCGGTGGAG TTTTGGGCAG AGGAGTGGGG GGAAGAGTTT CTCACTTTTA 900 

AGATTCTCCA AATCCAAGAT GAAGTCATGC TGTGCTTTGG AATGGTAGAT GCTCATTTAT 960 

GTAAAATCAT AATAAATGTT ACACAAACTG TTAAAAAAAA AAAAAAAAAA AAAAAA 

Seq ID No: 177 Protein sequence: 
Protein Accession #: AIrl09712.1 

1 11 21 31 41 51 

I I I I I I 

VSLGQPGCCR QTGSTPDAPL PSGSTAGLGL TLACPATCFP V 

Seq ID NO: 178 DNA sequence 

Nucleic Acid Accession ff: none found 

Coding sequence: 3-107 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

AATGGAGCAC TCCAAAGAAC GATTTGACCA ATAGCATTTC TTCTCTGGGG GTTGTATTTC 

AAAGCATGCA ACTCTCCAGG GAACCAGAAC TAAATTGCTT AAAATGAAGT CATTCCTCAG 

ATTAACTTCC TCAGATAAAG TGTCAGCGGT CTGCAGAAAC GAAGAAGACA AAACTGAGAT 

TATCACTCAT AATTCTCTTA CTTACTATGT CAGTGAAACA ATGAGTTTGC ATTTTTGCAA 

TCCTAGAACA TTCTTCATTA GCCCTGGGTC ATGACCTCTT CCAGTTAATT CTCTTTCACA 

CCTTTAGGAA AGATTTAAGA TGAACCTTCA ATAGGATATT AACATAACTC ATAGCCAATA 

CCACAGCTGC CTTTCAAATT AATGAGGTTA ATTGTTCTCC AGCAAACATG AGTTTGTCTT 

TGG CATTTTA AATGCTTCCC ATTGATCTGA CATTTTGCTG TTTCAAGTTT TAAAGGGCTC 

AAATCAAAGA CTATTGATAA CTGAGCAAAG AG CG AAGATC CAGAAATACG AAAACATTGT 

CTTTTTTTTT CCATGAAAAA CAATCATAGC CTTTTGAATT CAATCGAAGT TTCTACATTA 

GCCATCTAAG ACTTATTTAA TTATTTCTGT TCTCAGTCAA GCTAATTCAA GTGAATGAAC 

AGTATTGACT TTTAAAATCT TTTTTAAATT TTTTTAAATC TTTAGTTTAT TAAGTTTGTA 

GAAAAGCTCT GGGGCCATGA CCACTTACGT AAATGTTT C A GTTTAAAAAC AAAAGATTGA 

GGCCTCTAAT TTGAGCCAAA TCCAGGTGAT CTTGTTTGAA ATTTTTGATG AATTTGAAAA 

GATGAAAGTG GAACTTTTAA CATTCATGTT CCCCAAATTT TTCACTGGGA AGGGATGCTA 

ATTGCCTACT TAAGATATAA GTTCAAGAAT AACATTTTCA TAGAAAATTC AGAAAACTGC 

TTGACACAGC AGTGACATAG TTAGATGTGG CTCAGATGCC TTCCAAACCT GAGGGTCCCC 

AAAGATTTCT TTACCAGTTG TTTTTAACTA TGAATCTTAA TCTTGTTCAT TCCCCTGCCA 

AAACAAATTT AAAAG 

Seq ID No: 179 Protein sequence: 
protein Accession #j none found 



1 11 21 31 41 51 

I I I I I I 

WSTPKNDLTN SISSLGWFQ SMQLSREPEL NCLK 

Seq ID NO: 180 DNA sequence 
Nucleic Acid Accession #: none found 
Coding sequence: 2-176 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 
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264 



WO 02/079492 



CCGGGTGGGG CCTCGGGATG CAGGCGCCGG TGCCCGGGCC CCTGGGCCTG CTGGACCCCG 60 

CAGAAGGGCT TTCGAGGAGG AAGAAGACGT CGCTCTGGTT TGTGGGGTCT CTGCTGCTGG 120 

TGTCCGTCCT CATAGTCACC GTCGGGCTGG CTGCATCAGC AGGACGGAGA ATGTGACCGT 180 

TGGGGGCTAC TACCCAGGGA TCATTCTCGG CTTTGGATCT TTCTTAGGAA TTATTGGCAT 240 

CAACTTGGTG GAGAATAGAA GGCAAATGCT GGTGGCAGCG ATCGTGTTTA TCAGTTTTGG 300 

CGTGGTGGCC GCCTTCTGCT GCGCCATCGT GGACGGCGTA TTTGCAGCAC AGCACATTGA 360 

ACCGAGGCCC CTCACCACGG GAAGATGCCA GTTTTACTCC AGTGGGGTGG GGTACTTGTA 420 

CGATGTCTAC CAGACAGAGG TGAGCAGGAG CACTGAGATT CATGTGGGTT TTGCTCAGCT 4 80 

AACCCCGCCG ACCCCACGCG GTTTTCCCTG CACATAGGCG TGGTCTGAAT ATTTGGATTC 540 

TAATAGTTCC TGGGGGTCAC CCCTGCAGCT GGTGAACCGT TGATGCCCCC TGTGTAAGGG 600 

ACCTTGACAT TTCGATGTGC TGTATTTCAC TCTGGAGTCA GAGTTCTGGA CTTGCTTCAT 660 

TAAATCACAA CAGTCTCAGA AAACAACCGC ACCACCCCGC AATCCCACCA AAGGGGCGCG 720 
CCGTCCCTAA GAGTTATCCC 

Seq ID No: 181 Protein sequence : 
Protein Accession #: none found 

1 11 21 31 41 51 

I I I I I I 

RVGPRDAGAG ARAPGPAGPR RRAFEEEEDV AIiVCGVSAAG VRPHSHRRAG CISRTENV 

Seq ID NO: 182 DNA sequence 

Nucleic Acid Accession #: AK001579.1 

Coding sequence: 1150-2637 (underlined sequences correspond to start and 



1 11 21 31 41 51 

I I I I I. I 

TTTTCTCTGC TTTTCGCTAC CCCGGTCACT CTCATTTCTC TCCCCTATTC CTTGTCTCTT 60 

CCCCCATCCC CCTTTCTCCT GTCCTCCCCC TGCCTCTACA GTGGTTCTCC CCGCTGAGCT 120 

GCCACCAGCT GCTGGGCCCC GGGCTGCTGC GGCTGGGCCG CCTATGGCTG CGGTCCCCCT 180 

CCCATACAGC CCCGGCCCCT GGTCTCTGGC TGTCAGGGTT TGGCCTCCTT CGTGGTGACC 240 

ACCTCTTCCT GTGCTCAGCG CCGGGCCCAG GCCCCCCAGC CCCTGAGGAC ATGGTGCATC 300 

TGCGGCGGCT ACAGGAGATC AGTGTGGTTT CTGCAGCTGA CACCCCAGAT AAGAAAGAGC 360 

ATTTGGTCCT GGTGGAGACA GGAAGGACCC TGTATCTGCA AGGAGAGGGC CGGCTGGACT 420 

TCACGGCATG GAACG CAGCC ATTGGGGGCG CGGCTGGTGG GGGCGGCACA GGGCTGCAGG 4 80 

AGCAGCAGAT GAGCCGGGGT GACATCCCCA TCATCGTGGA TGCCTGCATC AGTTTTGTTA 54 0 

CCCAGCATGG GCTCCGGCTG GAAGGTGTAT ACCGGAAAGG GGGCGCTCGT GCCCGCAGCC 600 

TGAGACTCCT GGCTGAGTTC CGTCGGGATG CCCGGTCGGT GAAGCTCCGA CCAGGGGAGC 660 

ACTTTGTGGA GGATGTCACT GACACACTCA AACGCTTCTT TCGTGAGCTC GATGACCCTG 720 

TGACCTCTGC ACGGTTGCTG CCTCGCTGGA GGGAGGCTGC TGGTATTCCT AAGATCCCTG 780 

AGAGCCAAGG CCCAACCAGG ATCTCTG CCT TCCCCCACCA GAATCCATGG TTTGGCAGCC 840 

CTCCGCCCCA TCACTTCCCA CCCTGGGGGA TCATCCAGAG ACTTGGCTCA GGGGGAGGTG 900 

GGAAGGGGGC AGAGACACAT CCATCCTGCA TTTGTGCCTA AAAATCCCTC CCTCTGTACC 960 

AGCTGCCACT CTTTCTTCCC GGGTCCTCCC CAACCCTCCT CCATTCCATC CCCAGAGCTG 1020 

CCCCAGAAGA ATCAGCGCCT GGAGAAATAT AAAGATGTGA TTGGCTGCCT GCCGCGGGTC 1080 

AACCGCCGCA CACTGGCCAC CCTCATTGGG CATCTCTATC GGGTGCAGAA ATGTGCGGCT 114 0 

• CTAAACCAG A TG TGCACGCG GAACTTGGCT CTGCTGTTTG CACCCAGCGT GTTCCAGACG 1200 

GATGGGCGAG GGGAGCACGA GGTGCGAGTG CTGCAAGAGC TCATTGATGG CTACATCTCT 1260 

GTCTTTGATA TCGATTCTGA CCAGGTAGCT CAGATTGACT TGGAGGTCAG TCTTATCACC 1320 

ACCTGGAAGG ACGTGCAGCT GTCTCAGGCT GGAGACCTCA TCATGGAAGT TTATATAGAG 1380 

CAGCAGCTCC CAGACAACTG TGTCACCCTG AAGGTGTCCC CAACCCTGAC TGCTGAGGAG 1440 

CTGACTAACC AGGTACTGGA GATGCGGGGG ACAGCAGCTG GGATGGACTT GTGGGTGACT 1500 

TTTGAGATTC GCGAGCATGG GGAGCTGGAG CGGCCACTGC ATCCCAAGGA AAAGGTCTTA 1560 

GAGCAGGCTT TACAATGGTG CCAGCTCCCA GAGCCCTGCT CAGCTTCCCT GCTCTTGAAA 1620 

AAAGTCCCCC TGGCCCAAGC TGGCTGCCTC TTCACAGGTA TCCGACGTGA GAGCCCACGG 1680 

GTGGGGCTGT TGCGGTGTCG TGAGGAGCCA CCTCGCTTGC TGGGAAG C CG CTTCCAGGAG 174 0 

AGGTTCTTTC TGCTGCGTGG CCGCTGCCTG CTGCTGCTCA AGGAGAAGAA AAGCTCTAAA 180 0 

CCAGAACGGG AGTGGCCTTT GGAAGGTGCC AAGGTCTACC TGGGAATCCG CAAGAAGTTA 1860 

AAGCCCCCAA CACCGTGGGG CTTCACATTG ATACTAGAGA AGATGCACCT CTACTTGTCC 1920 

TGCACTGACG AGGATGAAAT GTGGGATTGG ACCACCAGCA TCCTTAAAGC CCAGCACGAT 1980 

GACCAGCAGC CAGTGGTCTT ACGACGCCAT TCCTCCTCTG ACCTTGCCCG TCAGAAGTTT 2040 

GGCACTATGC CTTTGCTGCC TATCCGTGGG GATGACAGTG GAGCCACCCT CCTCTCTGCC 2100 

AATCAGACCC TGCGGCGACT ACACAACCGG AGGACCCTGT CCATGTTCTT TCCAATGAAG 2160 

TCATCCCAGG GGTCTGTGGA GGAGCAAGAG GAGCTGGAGG AGCCTGTGTA CGAGGAGCCA 2220 

GTGTATGAGG AAGTAGGGGC CTTCCCTGAG TTGATCCAGG ACACTTCTAC CTCCTTCTCC 2280 

ACCACACGGG AGTGGACAGT GAAGCCAGAG AACCCCCTCA CCAGCCAGAA GTCATTGGAT 2340 

CAACCCTTTC TCTCCAAGTC AAGCACCCTT GGCCAGGAGG AGAGGCCACC TGAGCCCCCT 2400 

CCAGGCCCCC CTTCAAAGAG CAGTCCCCAG GCACGGGGGT CCCTAGAGGA ACAGCTGCTC 2460 

CAGGAGCTCA GCAGCCTCAT CCTGAGGAAA GGAGAGACCA CTGCAGGCCT GGGAAGTCCT 2520 

TCCCAGCCAT CCAGCCCCCA ATCCCCCAGC CCCACTGGCC TTCCAACACA GACACCTGGC 2580 

TTCCCCACCC AACCCCCATG CACTTCCAGT CCACCCTCCA GCCAGCCCCT CACATGACCC 2640 

TAGGACCAGC AGTCTGAGAG GGTAGGTACC AGAAGACCCA GAAACTCTTA TCGTGGCACT 2700 

GTTGCAGCTT CCTCTGCCCT GGCTGGAAAG ACTCCAGAAT CCAGTGTGGT GCTGTGGAAG 2760 

GAGCACTGGA CTAAAGGCTT CAGTGGCTGC GTGTCCCAGG ACAGGTCATG GCCCCTCTCT 2820 

GGGCCCAGCC CATTTATCTA TACCATGAGG TAACTGAAGT AAGGAGAGCA GTGAATGTCA 2880 
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WO 02/079492 
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10 



15 



20 



25 



AACTGTGTTT 
TTTATATATT 
ACTGCATTAA 
CCTGTCCCAC 
GGGAGCTGGG 



CTTAGAGCCA 
TGATACGTAG 
AGAAAAAAAA 
TTCCTACATT 
ACTCGGGGAG 



TAAGCCCCAC 
GGGTTCCATG 
TGCTTTGGAA 
CTGGGACTAC 
TCAAAAATAG 



ATATTATCCC 
AGAGATTTTG 
ACCAGAGGCC 
CGTGAAGCCT 
ATGAGTAATT 



TGAACAAGGG 
GGTTTTAAAG 
TGGGTGATGT 
GGAGTAGGGA 
GTCAATAAAC 



CAGCTCCTGC 
GAATGGTTTT 
TAAAGTCTAT 
GAGCGAGTTT 
CTGGGAACC 



Seq ID No: 183 Protein sequence: 
Protein Accession #: AK001579.1 



MSLTHSNASF 
SHPGGSSRDL 
RLEKYKDVIG 
HEVRVLQELI 
NCVTLKVSPT 
WCQLPEPCSA 
RGRCLLLLKE 
EMWDWTTSIL 
RLHNRRTLSM 
TVKPENPLTS 
LILRKGETTA 



11 

I 

VSSMTLPLHG 
AQGEVGRGQR 
CLPRVNRRTL 
DGYISVFDID 
LTAEELTNQV 
SLIiLKKVPLA 
KKSSKPEREW 
KAQHDDQQPV 
FFPMKSSQGS 
QKSLDQPFLS 
GLGSPSQPSS 



21 
I 

CCLAGGRLLV 
HIHPAFVPKN 
ATLIGHLYRV 
SDQVAQIDLE 
LEMRGTAAGM 
QAGCLFTGIR 
PLEGAKVYLG 
VLRRHSSSDL 
VEEQEELEEP 
KSSTLGQEER 
PQSPSPTGLP 



31 



FLRSLRAKAQ 
PSLCTSCHSF 
QKCAALNQMC 
VSLITTWKDV 
DLWVTFEIRE 
RESPRVGLLR 
IRKKLKPPTP 
ARQKFGTMPL 
VYEEPVYEEV 
PPEPPPGPPS 
TQTPGFPTQP 



41 
I 

PGSLPSPTRI 
FPGPPQPSSI 
TRNLALLFAP 
QLSQAGDLIM 
HGELERPLHP 
CREEPPRIiLG 
WGFTLILEKM 
LPIRGDDSGA 
GAFPELIQDT 
KSSPQARGSL 
PCTSSPPSSQ 



51 

1 

HGLAALRPIT 
PSPELPQKNQ 
SVFQTDGRGE 
EVYIEQQLPD 
KEKVLEQALQ 
SRFQERFFLL 
HLYLSCTDED 
TLLSANQTLR 
STSFSTTREW 
EEQLLQELSS 
PLT 



2940 
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Seq ID NO: 184 DNA sequence 

Nucleic Acid Accession #: none found 

Coding sequence: 1-81 (underlined sequences correspond to start and stop codons) 
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I • 

GTAGAGTTAG 
GTATTAAGCT 
AATCTCAACA 
CTTTCTCTAT 
AACCCTAACT 
CACTGCACCA 
ACTGAAGATG 
AACATGTAAT 
TGTTCAGCTG 
AAGTTTTGTT 
ACAAAGGCAT 
GGATGGCAGT 
TGTGCCTCTT 
CCATTAGCTA 
GTCATGTGCT 
TTGCTTGGTT 
ACTTGGATGC 
TTCTTTTGTC 
CAGGATGGTT 
AGTAAGCTTT 
TGGCTTTTCA 
GCTTATTCTT 
CACCTCCTTG 
GCACTTAAAA 
TGCAGGTTAG 
CACCGTTCTA 
GACACAGTCA 
TTTTGCAGAA 



11 
I 

TGTCAATGTG 
TAAAAAGTTA 



TAAGAAGTCA 
AATTTCATCA 
CTGCTAATTA 
GCTTTGTTAT 
AGAGAACATG 
GAATGTAGTA 
TAACAGAATA 
ACGAATTCAG 
CTTTCCTGAT 
TCCAGCCCTG 
CACTTTAATC 
ATGCTTGCTT 
CAATTAATAT 
GCATTCTTCT 
CTCAGTTGTC 
ATCAGCACCA 
GCAACAACCA 
TGAAAATGTA 
TATTACTCAA 
CCTCTTACTT 
CTGCTTGTCC 
TAGAAAAAAA 
AAGTCATGGA 
TTCAACCCCA 
GCTCTTTCTG 
GAAAAC 



21 

I 

CTTAGAATAT 
ATTCAGTTTA 

AAATGTAATG 
GTATGTCCTC 
TAAGCTAGGC 
CTGTAAAATG 
ATATGTGTAA 
ATAGTAATTA 
CCCAAAATAA 
ACAATCCAGG 
TTCTGCCAGT 
GTTACGCCCA 
ATAGCTCCCA 
ACATGGTCAC 
CCAAGTGTCC 
TAGCATAAGC 
CTTTCATTTA 
TCACTACCAC 
CCATAGGGAC 
GGTCAGATCA 
AAGAAAACCT 
TATCTCTGTA 
TATACTCCTA 
AAAAAAAAAA 
GCTGGGATCT 
TTGCCTAGAG 
AGAAAAGGCA 



31 
I 

ACCAAATTCA 
AGGAATATAA 
CTGCCAGATA 
TCCCTTTTCT 
AAGTAATCTT 
ATGATAATAC 
AGTGCCTTCC 
TTTTATTTTC 
CAGTTTTAAA 
GCTTTTATAG 
CTCAATGCAT 
TATTAGCACA 
CTAGATGCAC 
ACTTAGTTTC 
AATTACTGAG 
CACATTCTTT 
GAAATGCTCC 
TGCCTTCTTC 
TTTTTGCTTC 
TGTCTCTCTC 
AAAACTTTGC 
TTGCTCTTCC 
AAAGAAGTTC 
AGCTCAGAGA 
AAATCCATGT 
GTGCTTGATT 
GCTCAGCATT 



41 
I 

TAAACATTTT 
ACCAAATTAT 
ACAATATCAA 
CCTATTTGTC 
GGACAAGTTA 
CAACACCTTC 
ACAATACCCA 
TTTTGATTCA 
CAAATTAAAG 
ATGCACCAGG 
GGGTTGCAAT 
CAGAAAGAAA 
CCACTACTTC 
CAGAGAGACA 
AAAAAAAGAA 
TTATGAAGTT 
TTGGACATCC 
AAAGCCACCA 
TACTTCCACA 
TTCTCTTCAA 
TGTGAGATCT 
TCACTCTACT 
AGTCTTCCCT 
GGCTGAGTTG 
CAGTCTGACT 
GCTCAATAAT 
TCCATGAGAT 



51 
I 

CTCTAAAAAA 60 

TTTATATTTG 120 

AGGTATTTTT 180 

AAATTTTAGC 240 

TTTGACCTCT 300 

TTCTTGGGGT 360 

GAACATAGCA 420 

GTTGGGACTA 4 80 

TTTTGTTGTG 540 

ATCAGCAGGT 600 

CCAGAGTCCA 6 60 

GAGAAAGGGA 720 

TGCTGATACT 780 

TGTCTGGACA 840 

ACTAGCACCT 900 

GTCCTCAGTT 960 

TGAATCTGAC 1020 

CGTTCTGTCC 1080 

CAATAGCCAG 1140 

AACCCTCCGA 1200 

ATGTGACCCG 1260 

CCAGCCATCC 1320 

TATGATATTT 13 80 

TCCAAGGTCA 1440 

ATGAGTTCTG 1500 

AGATTCCATG 1560 

CCGCACATCC 1620 



Seq ID No: 185 Protein sequence: 
Protein Accession #: none found 



65 



1 11 21 

I I I 

VELVSMCLEY TKFINIFSKK VLSLKS 



31 



41 



51 



70 



75 



Seq ID NO: 186 DNA sequence 

Nucleic Acid Accession #: NM_002203.2 

Coding sequence: 43-3588 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

I I I I I I 

CTGCAAACCC AGCG CAACTA CGGTCCCCCG GTCAGACCCA GGATGGGGCC AGAACGGACA 

GGGGCCGCGC CGCTGCCGCT GCTGCTGGTG TTAGCGCTCA GTCAAGGCAT TTTAAATTGT 



60 
120 



266 



WO 02/079492 



PCT/US02/04915 



TGTTTGGCCT ACAATGTTGG TCTCCCAGAA 
CAGTTTGGGT ATGCAGTGCA GCAGTTTATA 
TCACCCTGGA GTGGCTTTCC TGAGAACCGA 
CTATCCACTG CCACATGTGA AAAACTAAAT 
5 ACTGAGATGA AAACCAACAT GAGCCTCGGC 
GGTTTTCTCA CATGTGGTCC TCTGTGGGCA 
GGTGTGTGTT CTGACATCAG TCCTGATTTT 
CAGCCCTGCC CTTCCCTCAT AGATGTTGTG 
CCTTGGGATG CAGTAAAGAA TTTTTTGGAA 

10 ACAAAGACAC AGGTGGGGTT AATTCAGTAT 
AACACATATA AAACCAAAGA AGAAATGATT 
GGGGACCTCA CAAACACATT CGGAGCAATT 
GCTTCTGGTG GGCGACGAAG TGCTACGAAA 
CATGATGGTT CAATGTTGAA AGCTGTGATT 

15 TTTGGCATAG CAGTTCTTGG GTACTTAAAC 
AAAGAAATAA AAGCGATCGC TAGTATTCCA 
GAAGCAGCTC TACTAGAAAA GGCTGGGACA 
ACTGTTCAAG GAGGAGACAA CTTTCAGATG 
TACTCTTCTC AAAATGATAT TCTGATGCTG 

20 ACCATTGTCC AGAAGACATC TCATGGCCAT 
ATTCTGCAGG ACAGAAATCA CAGTTCATAT 
GGAGAAAGCA CTCACTTTGT TGCTGGTGCT 
CTATATAGTG TGAATGAGAA TGGCAATATC 
ATTGGCTCCT ATTTTGGTAG TGTGCTGTGT 

25 GACGTGCTCT TGGTAGGTGC ACCAATGTAC 
GTCTACCTGT TTACTATCAA AAAGGGCATT 
GAGGGCATTG AAAACACTCG ATTTGGTTCA 
GATGGCTTTA ATGATGTGAT TGTTGGTTCA 
TACATTTACA ATGGTCATCA GGGCACTATC 

30 TCCGATGGAG CCTTTAGGAG CCATCTCCAG 
GATTTAAATG GGGATTCCAT CACCGATGTG 
CTCTGGTCAC AAAGTATTGC TGATGTAGCT 
ACTTTGGTCA ACAAGAATGC TCAGATAATT 
CCTACTAAGC AAAACAATCA AGTGGC CATT 

35 TTTTCATCCA GAGTAACCTC CAGGGGGTTA 
AAGAATATGG TAGTAAATCA AGCACAGAGT 
CCCTCTGATG TTGTCAACTC TTTGGATTTG 
ACTAGCCCTG CCCTTGAAGC CTATTCTGAG 
AAAGACTGTG GTGAGGATGG ACTTTG CATT 

40 CCAGCTGCTC AAGAACAACC CTTTATTGTC 
GTAACACTGA AAAATAAAAG GGAAAGTGCA 
GAAAACTTGT TTTTTGCATC ATTCTCCCTA 
GTGGCTGCAT CTCAGAAGTC TGTTGCCTGC 
CAACAGGTGA CTTTTACTAT TAACTTTGAC 

45 TCTCTCAGTT TCCAAGCCTT AAGTGAAAGC 
AACCTCAAAA TTCCTCTCCT GTATGATGCT 
AATTTTTATG AAATCTCTTC GGATGGGAAT 
GTTGGTCCAA AATTCATCTT CTCCCTGAAG 
GCAACTGTAA TCATCCACAT CCCTCAGTAT 

50 ACTGGGGTGC AAACAGACAA GGCTGGTGAC 
AAAATAGGAC AAACATCTTC TTCTGTATCT 
GAATTGAACT GCAGAACTGC TTCCTGTAGT 
ATGAAAGGAG AATACTTTGT TAATGTGACT 
TCAACGTTCC AGACAGTACA GCTAACGGCA 

55 ATATATGTGA TTGAAGATAA CACTGTTACG 
AAAGCCGAAG TACCAACAGG AGTTATAATA 
TTAGCTCTGG TTGCAATTTT ATGGAAGCTC 
ACCAAAAATC CAGATGAGAT TGATGAGACC 
ACCTGCAGTG GGAACCGGCA GCATCCCAGC 

60 TTTTTAAATC CCATATTTTT TTTATCATGT 
AAAACTGCAG GTCAGTTTGG ATGAAGAAAT 
GTAGGGAAAT AATAGGGAAA ATACCTATTT 
AACTGGCTGG CCCAGAGTTT ACATTCTAAT 
CAAGCATGAC AACTTTTAAA GAAAAATATG 

65 TCTCTTTAAA ATATTTGTCT TTAAACAGCA 
GTACTTCCAC TTGTGTATAT TTTAATGAAT 
ACAGGTTTTT TCAATTTATG CTGCTCATCC 
TAATTTTATT TATAAACTAG GTAAAATT'TG 
CTTCCACACC CCATCTTGCT CTAATGATCA 

70 TACCTCCTAT ATGTCCATTT AAGTTAGGAG 
TTTTGTTTAA AACTCAGAAT ATAACATTTA 
TGTGCCAGAG GAAGGAAAAG GAGGAAATTT 
CTTCTAGGAT TTGTTTGGCT GACTGGCAGT 
TTCTTTGGCA ACCTTCCTCC TCCCTTACTG 

75 TATTATAGAA GCCCTCTACA GCCTGACTTT 
TTACCCCTCA TCCAAAGTTC CCACTCCTTC 



GCAAAAATAT TTTCCGGTCC TTCAAGTGAA 180 

AATCCAAAAG GCAACTGGTT ACTGGTTGGT 240 

ATGGGAGATG TGTATAAATG TCCTGTTGAC 300 

TTGCAAACTT CAACAAGCAT TCCAAATGTT 360 

TTGATCCTCA CCAGGAACAT GGGAACTGGA 420 

CAGCAATGTG GGAATCAGTA TTACACAACG 480 

CAGCTCTCAG CCAGCTTCTC ACCTGCAACT 540 

GTTGTGTGTG ATGAATCAAA TAGTATTTAT 600 

AAATTTGTAC AAGGCCTTGA TATAGGCCCC 660 

GCCAATAATC CAAGAGTTGT GTTTAACTTG 720 

GTAGCAACAT CCCAGACATC CCAATATGGT 780 

CAATATGCAA GAAAATATGC CTATTCAGCA 84 0 

GTAATGGTAG TTGTAACTGA CGGTGAATCA 900 

GATCAATGCA ACCATGACAA TATACTGAGG 960 

AGAAACGCCC TTGATACTAA AAATTTAATA 1020 

ACAGAAAGAT ACTTTTTCAA TGTGTCTGAT 1080 

TTAGGAGAAC AAATTTTCAG CATTGAAGGT 114 0 

GAAATGTCAC AAGTGGGATT CAGTGCAGAT 1200 

GGTGCAGTGG GAGCTTTTGG CTGGAGTGGG 1260 

TTGATCTTTC CTAAACAAGC CTTTGACCAA 1320 

TTAGGTTACT CTGTGGCTGC AATTTCTACT 1380 

CCTCGGGCAA ATTATACCGG CCAGATAGTG 144 0 

ACGGTTATTC AGGCTCACCG AGGTGACCAG 1500 

TCAGTTGATG TGGATAAAGA CACCATTACA 1560 

ATGAGTGACC TAAAGAAAGA GGAAGGAAGA 1620 

TTGGGTCAGC ACCAATTTCT TGAAGGCCCC 1680 

GCAATTGCAG CTCTTTCAGA CATCAACATG 174 0 

CCACTAGAAA ATCAGAATTC TGGAGCTGTA 1800 

CGCACAAAGT ATTCCCAGAA AATCTTGGGA 1860 

TACTTTGGGA GGTCCTTGGA TGGCTATGGA 1920 

TCTATTGGTG CCTTTGGACA AGTGGTTCAA 1980 

ATAGAAGCTT CATTCACACC AGAAAAAATC 204 0 

CTCAAACTCT GCTTCAGTGC AAAGTTCAGA 2100 

GTATATAACA TCACACTTGA TGCAGATGGA 2160 

TTTAAAGAAA ACAATGAAAG GTGCCTGCAG 222 0 

TGCCCCGAGC ACATCATTTA TATACAGGAG 22 80 

CGTGTGGACA TCAGTCTGGA AAACCCTGGC 2340 

ACTGCCAAGG TCTTCAGTAT TCCTTTCCAC 2400 

TCTGATCTAG TCCTAGATGT CCGACAAATA 2460 

AGCAACCAAA ACAAAAGGTT AACATTTTCA 2520 

TACAACACTG GAATTGTTGT TGATTTTTCA 25 80 

CCGGTTGATG GGACAGAAGT AACATGC C AG 264 0 

GATGTAGGCT ACCCTGCTTT AAAGAGAGAA 2700 

TTCAATCTTC AAAACCTTCA GAATCAGGCG 2760 

CAAGAAGAAA ACAAGGCTGA TAATTTGGTC 2820 

G AAATT C ACT TAACAAGATC TACCAACATA 28 80 

GTTCCTTCAA TCGTGCACAG TTTTGAAGAT 294 0 

GTAACAACAG GAAGTGTTCC AGTAAGCATG 3000 

AC C AAAG AAA AGAACCCACT GATGTACCTA 3060 

ATCAGTTGTA ATGCAGATAT CAATCCACTG 3120 

TTCAAAAGTG AAAATTTCAG GCACACCAAA 3180 

AATGTTACCT GCTGGTTGAA AGACGTTCAC 324 0 

AC C AG AATTT GGAACGGGAC TTTCGCATCA 3300 

G C TG CAG AAA TCAACACCTA TAACCCTGAG 3360 

ATTCCCCTGA TGATAATGAA ACCTGATGAG 34 20 

GGAAGTATAA TTGCTGGAAT CCTTTTGCTG 3480 

GG CTTCTTCA AAAGAAAATA TGAAAAGATG 354 0 

ACAGAGCTCA GTAGCTGAAC CAGCAGACCT 3600 

CAGGGTTTGC TGTTTGCGTG CATGGATTTC 3660 

CGTAGGTAAA CTAACCTGGT ATTTTAAGAG 3720 

TGTGGGGGGT GGGGGAGGTG CGGGGGGCAG 3780 

TATATGATGG GGGAAAAAAA GTAATCTTTA 384 0 

TTGCATTGTG TCAGAAACAT GAAATGCTTC 3900 

ATACTCTCAG ATTTTAAGGG GGAAAACTGT 3960 

ACTACAGAAG TGGAAGTGCT TGATATGTAA 4020 

ATTGATGTTA ACAAGAGGGG AAAACAAAAC 4080 

AAAGTTGCCA CAGATGATAC TTCCAAGTGA 4140 

TTGTTGGTTC CTTTTATACC ACGGCTGCCC 4200 

AAACATGCTT GAATAACTGA GCTTAGAGTA 4260 

AGGGGGCGAT ATAGAGACTA AGGCACAAAA 432 0 

TGT AAAATC C CATCTGCTAG AAGCCCATCC 43 8 0 

CCTTTCTCTT TTAGGAGGCA CAACAGTTCT 4440 

AACCTAGTGA ATTTTTGAAA GATGAGTAAT 4500 

AACCACTCTC CCACCTCCTG GTGGTACCAT 4560 

CTCTCCAGCG GTCCAAAGTT ATCCCCTCCT 4620 

AGGACAGCTG CTGTGCATTA GATATTAGGG 4680 



267 



WO 02/079492 



PCT/US02/04915 



10 



GGGAAAGTCA 
TTCAGGGAGC 
CTAAATGTTG 
CCAGAAGTTA 
GCTGTCTTGT 
GTTCAAAAGG 
TTTTAACAAG 
ATTTCTACTT 
CTTAGATTAA 
AAACTTCAGA 
TTTAGTTTTA 
CAAGAATTTG 



TCTGTTTAAT 
TATTTTCATT 
GAATGTTATG 
CAGATGAGGC 
TTCTGAAGTA 
TAGATCCTGA 
CACCCCAGTC 
TTTGCACCTT 
AATTCACAGA 
GTCCTCATTA 
AAAGTCTATG 
ACTTGGAAAA 



TTACACACTT 
TAGTGCTAAA 
GGATGTAAAC 
ACTGGAAACC 
CTTTTTCTTC 
GATGATTTGG 
ACTAGGATGC 
ATTTTCTCTG 
CACTACATAT 
TAAAATGGGA 
ATCTGATCTG 
G 



GCATGAATTA 
CAAGTAAGAA 
AATGTAAAGT 
ACCACCAAAT 
CACAAGAGTG 
TCAGATTGGG 
AGATGGACCA 
TTCCTGAGCC 
CTAAAGCTTT 
AGACTGAGCT 
GACTTCCTAT 



CTGTATATAA 
AAATAAGCTA 
AAAACACTCT 
TAGCAGGTGC 
AATTTGACCT 
ATAAGGCCCA 
CACTTTGAGA 
CCCACATTCT 
GACAAGTCCT 
GGAGTTCAGC 
AATACAAATA 



ACTCCTTAAC 4740 

GAGTGAATTT 4800 

CAGGATTTCA 4860 

ACCTTCTGTG 4920 

AGGCAAGTTT 4980 

GCAATCTGCA 5040 

AACACCACCC 5100 

CTAGGAGAAA 5160 

TGACCTCTAT 5220 

AGTGATGCTT 5280 

CACAATCCTC 5340 



Seq ID NO: 187 Protein sequence; 
15 Protein Accession #: NP 002194.1 
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25 



30 



35 



40 



MGPERTGAAP 
NWLLVGSPWS 
RNMGTGGFLT 
ESNSIYPWDA 
QTSQYGGDLT 
HDNILRFGIA 
IFSIEGTVQG 
KQAFDQILQD 
AHRGDQIGSY 
QFLEGPEGIE 
SQKILGSDGA 
FTPEKITLVN 
NERCLQKNMV 
FSIPFHKDCG 
IWDFSENLF 
NLQNQASLSF 
VHSFEDVGPK 
ADINPLKIGQ 
NGTFASSTFQ 
AGILLLLALV 



11 

i 

LPLLLVLALS 
GFPENRMGDV 
CGPLWAQQCG 
VKNFLEKFVQ 
NTFGAIQYAR 
VLGYLNRNAL 
GDNFQMEMSQ 
RNHSSYLGYS 
FGSVLCSVDV 
NTRFGSAIAA 
FRSHLQYFGR 
KNAQIILKLC 
VNQAQSCPEH 
EDGLCISDLV 
FASFSLPVDG 
QALSESQEEN 
FIFSLKVTTG 
TSSSVSFKSE 
TVQLTAAAEI 
AILWKLGFFK 



21 
I 

QGILNCCLAY 
YKCPVDLSTA 
NQYYTTGVCS 
GLDIGPTKTQ 
KYAYSAASGG 
DTKNLIKEIK 
VGFSADYSSQ 
VAAISTGEST 
DKDTITDVLL 
LSDINMDGFN 
SLDGYGDLNG 
FSAKFRPTKQ 
IIYIQEPSDV 
LDVRQIPAAQ 
TEVTCQVAAS 
KADNLVNLKI 
SVPVSMATVI 
NFRHTKELNC 
NTYNPEIYVI 
RKYEKMTKNP 



31 



NVGLPEAKIF 
TCEKLNLQTS 
DISPDFQLSA 
VGLIQYANNP 
RRSATKVMW 
AIASIPTERY 
NDILMLGAVG 
HFVAGAPRAN 
VGAPMYMSDL 
DVIVGSPLEN 
DSITDVSIGA 
NNQVAIVYNI 
VNSLDLRVDI 
EQPFIVSNQN 
QKSVACDVGY 
PLLYDAEIHL 
IHIPQYTKEK 
RTASCSNVTC 
EDNTVTIPLM 
DEIDETTELS 



41 
I 

SGPSSEQFGY 
TSIPNVTEMK 
SFSPATQPCP 
RWFNLNTYK 
VTDGESHDGS 
FFNVSDEAAL 
AFGWSGTIVQ 
YTGQIVLYSV 
KKEEGRVYLF 
QNSGAVYIYN 
FGQWQLWSQ 
TLDADGFSSR 
SLENPGTSPA 
KRLTFSVTLK 
PALKREQQVT 
TRSTNINFYE 
NPLMYLTGVQ 
WLKDVHMKGE 
IMKPDEKAEV 
S 



51 

I 

AVQQFINPKG 60 

TNMSLGLILT 120 

SLIDWWCD 180 

TKEEMIVATS 240 

MLKAVIDQCN 300 

LEKAGTLGEQ 360 

KTSHGHLIFP 420 

NENGNITVIQ 4 80 

TIKKGILGQH 540 

GHQGTIRTKY 600 

SIADVAIEAS 660 

VTSRGLFKEN 720 

LEAYSETAKV 780 

NKRESAYNTG 840 

FTINFDFNLQ 900 

ISSDGNVPSI 960 

TDKAGDISCN 1020 

YFVNVTTRIW 1080 

PTGVIIGSII 114 0 



Seq ID NO: 188 DNA sequence 

Nucleic Acid Accession #: NM_002210.1 

Coding sequence: 42-3188 (underlined sequences correspond to start and stop codons) 



45 | 



11 



21 



31 



41 



51 



I 



I 



I 



GGCTACCGCT 
GACGGCTGCG 
TGTGCCGCGC 
GTTACTTCGG 

50 TCGTGGGAGC 
TCAAATGTGA 
ATAGAGATTA 
CTGTGAGGTC 
AGATGAAACA 

55 TTGAGTATGC 
GAGGATTCAG 
TTTATTGGCA 
CCAATGTTTA 
TTGATGACAG 

60 ATGACTTTGT 
ATGGGAAGAA 
GATTTTCTGT 
CACCTCTCTT 
TGTCTCTACA 

65 TTGCACGGTT 
ATATTGCAAT 
ATGGAAGATC 
CTCGAAGCAT 
ATGGATATCC 

70 CCAGACCAGT 

ACAATAAAAC 
TCTGCTTAAA 

TTCTTTTGGA 
GGTCCCCAAG 
75 AATTGATAGC 
TTTTTATGGA 



CCCGGCTTGG 
CCTCGGTCCC 
CTTCAACCTA 
CTTCGCCGTG 
TCCCAAAGCA 
CTGGTCTTCT 
TGCCAAGGAT 
GAAACAGGAT 
GGAGCGAGAG 
TCCATGTAGA 
CATTGATTTT 
AGGTCAGCTT 
CAGCATCAAG 
CTATTTGGGT 
TTCAGGAGTT 
CATGTCCTCC 
AGCTGCCACT 
CATGGATCGT 
GAGAGCTTCA 
TGGCAGTGCC 
TGCTGCTCCA 
AACAGGCTTG 
GCCACCAAGC 
AGACTTAATT 
TATCACTGTA 
CTGCTCACTG 
GGCAGATGGC 
TAAACTCAAG 
TCACTCCAAG 
GTATCTGCGG 
ATATCGGTTG 



CGTCCCGCGC 
CGCGGCCTCC 
GACGTGGACA 
GATTTCTTCG 
AACACCACCC 
ACCCGCCGGT 
GATCCATTGG 
AAAATTTTGG 
CCTGTTGGAA 
TCACAAGATA 
ACTAAAGCTG 
ATTTCGGATC 
TATAATAACC 
TATTCTGTGG 
CCAAGAGCAG 
TTATACAATT 
GACATTAATG 
GGCTCTGATG 
GGAGACTTCC 
ATAGCTCCTT 
TATGGGGGTG 
AACGCAGTCC 
TTTGGCTATT 
GTAGGAGCTT 
AATGCTGGTC 
CCTGGAACAG 
AAAGGAGTAC 
CAAAAGGGAG 
AACATGACTA 
GATGAATCTG 
GATTATAGAA 



GCACTTCGGC 
CGCTTCTTCT 
GTCCTGCCGA 
TGCCCAGCGC 
AGCCTGGGAT 
GCCAGCCAAT 
AATTTAAGTC 
CCTGTGCCCC 
CATGCTTTCT 
TTGATGCTGA 
ACAGAGTACT 
AAGTGGCAGA 
AATTAGCAAC 
CTGTCGGAGA 
CAAGGACTTT 
TTACTGGCGA 
GAGATGATTA 
GCAAACTCCA 
AGACGACAAA 
TGGGAGATCT 
AAGATAAAAA 
CATCTCAAAT 
CAATGAAAGG 
TTGGTGTAGA 
TTGAAGTGTA 
CTCTCAAAGT 
TTCCCAGGAA 
CAATTCGACG 
TTTCAAGGGG 
AATTTAGAGA 
CAGCTGCTGA 



G ATG GCTTTT 
CTCGGGACTC 
GTACTCTGGC 
GTCTTCCCGG 
TGTGGAAGGA 
TGAATTTGAT 
CCATCAGTGG 
ATTGTACCAT 
TCAAGATGGA 
TGGACAGGGA 
TCTTGGTGGT 
AATCGTATCT 
TCGGACTGCA 
TTTCAATGGT 
GGGAATGGTT 
GCAGATGGCT 
TGCAGATGTG 
AGAGGTGGGG 
GCTGAATGGA 
GGACCAGGAT 
AGGAATTGTT 
CCTTGAAGGG 
AGCCACAGAT 
TCGAGCTATC 
CCCTAGCATT 
TTCCTGTTTT 
ACTTAATTTC 
AGCACTGTTT 
GGGACTGATG 
CAAACTCACT 
TACAACAGGC 



CCGCCGCGGC 
CTGCTACCTC 
CCCGAGGGAA 
ATGTTTCTTC 
GGGCAGGTCC 
GCAACAGGCA 
TTTGGAGCAT 
TGGAGAACTG 
ACAAAGACTG 
TTTTGTCAAG 
CCTGGTAGCT 
AAATACGACC 
CAAGCTATTT 
GATGGCATAG 
TATATTTATG 
GCATATTTCG 
TTTATTGGAG 
CAGGTCTCAG 
TTTGAGGTCT 
GGTTTCAATG 
TATATCTTCA 
CAGTGGGCTG 
ATAGACAAAA 
TTATACAGGG 
TTAAATCAAG 
AATGTTAGGT 
CAGGTGGAAC 
CTCTACAGCA 
CAGTGTGAGG 
CCAATTACTA 
TTGCAACCCA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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TTCTTAACCA GTTCACGCCT GCTAACATTA GTCGACAGGC TCACATTCTA CTTGACTGTG 1920 

GTGAAGACAA TGTCTGTAAA CCCAAGCTGG AAGTTTCTGT AGATAGTGAT CAAAAGAAGA 1980 

TCTATATTGG GGATGACAAC CCTCTGACAT TGATTGTTAA GGCTCAGAAT CAAGGAGAAG 204 0 

GTGCCTACGA AGCTGAGCTC ATCGTTTCCA TTCCACTGCA GGCTGATTTC ATCGGGGTTG 2100 

5 TCCGAAACAA TGAAGCCTTA GCAAGACTTT CCTGTGCATT TAAGACAGAA AACCAAACTC 2160 

GCCAGGTGGT ATGTGACCTT GGAAACCCAA TGAAGGCTGG AACTCAACTC TTAGCTGGTC 2220 

TTCGTTTCAG TGTGCACCAG CAGTCAGAGA TGGATACTTC TGTGAAATTT GACTTACAAA 2280 

TCCAAAGCTC AAATCTATTT GACAAAGTAA GCCCAGTTGT ATCTCACAAA GTTGATCTTG 2340 

CTGTTTTAGC TGCAGTTGAG ATAAGAGGAG TCTCGAGTCC TGATCATATC TTTCTTCCGA 2400 

10 TTCCAAACTG GGAGCACAAG GAGAACCCTG AGACTGAAGA AGATGTTGGG CCAGTTGTTC 2460 

AGCACATCTA TGAG CTGAGA AACAATGGTC CAAGTTCATT CAGCAAGGCA ATGCTCCATC 2520 

TTCAGTGGCC TTACAAATAT AATAATAACA CTCTGTTGTA TATCCTTCAT TATGATATTG 2580 

ATGGACCAAT GAACTG CACT TCAGATATGG AGATCAACCC TTTGAGAATT AAGATCTCAT 2640 

CTTTGCAAAC AACTGAAAAG AATGACACGG TTGCCGGGCA AGGTGAGCGG GACCATCTCA 2700 

15 TCACTAAGCG GGATCTTGCC CTCAGTGAAG GAGATATTCA CACTTTGGGT TGTGGAGTTG 2760 

CTCAGTGCTT GAAGATTGTC TGCCAAGTTG GGAGATTAGA CAGAGGAAAG AGTGCAATCT 2820 

TGTACGTAAA GTCATTACTG TGGACTGAGA CTTTTATGAA TAAAGAAAAT CAGAATCATT 2880 

CCTATTCTCT GAAGTCGTCT GCTTCATTTA ATGTCATAGA GTTTCCTTAT AAGAATCTTC 2940 

CAATTGAGGA TATCACCAAC TCCACATTGG TTACCACTAA TGTCACCTGG GGCATTCAGC 3000 

20 CAGCGCCCAT GCCTGTGCCT GTGTGGGTGA TCATTTTAGC AGTTCTAGCA GGATTGTTGC 3060 

TACTGGCTGT TTTGGTATTT GTAATGTACA GGATGGGCTT TTTTAAACGG GTCCGGCCAC 3120 

CTCAAGAAGA ACAAGAAAGG GAGCAGCTTC AACCTCATGA AAATGGTGAA GGAAACTCAG 3180 

AAACT TAA CT GCAGTTTTTA AGTTATGCTA CATCTTGACC CACTAGAATT AGCAACTTTA 3240 

TTATAGATTT AAACTTTCTT CATGAGGAGT AAAAATCCAA GGCTTTACTG CTGATAGTGC 3300 

25 TAATTGGCAT TAACCACAAA ATGAGAATTA TATTTGTCAA CCTTCTCCTT ATAAATAAGT 3360 

TCAGACATAC ATTTAATAAC ATAGGGTGAC TTGTGTTTTT AGGTATTTAA ATAATAAAAT 3420 

TTCAAGGGAT AGTTTTTATT CAATGTATAT AAGACAGGTA GTGCCTGATT TACTACTTTA 3480 

TATAAAATAG TACCTCCTTC AGTTACTGTT TCTGATTTAA TGTACGGAAC TTTATTTGTT 3540 

GTTGTTGTTG TTGTTGTTGT TGTTGTTTTA AAGCAGTCCA AATTTGGACC TTAGCAATCA 3600 

30 TGTCTTTTGT ATAGGTACTT AATGTTAATA CATATTACAC TACAGTTTAC TTTTCAGAAT 3660 

ACTAAAGACT TTATAACTGC ATGAACTTGG ATTTTTTTAA TCACTCATAT GGTAGAATTT 3720 

TATAAACACA TACATGATAC CATCCAAATT CTTGCTTTTA ATAACAAAGG TACAATATTT 3780 

TGTTTTAGTA TGAAAATCTG GTAGATCCTA TTACACTTCT GTTTATATTA AATCCACAAT 384 0 

ATTTTATTAC ATTTTTAACT TGTATAAATT TTAGGTCAAA TCCTTCAAGC CAACCTATAC 3900 

35 TAAAAATTAG TTCCATAATC ACAAATGGCT CTTTTGTGTA ATTGTTTAAT TTCACCTGAA 3960 

TATCATAATG CTTAAAGCCA TATGGAGTTG GAAATTATTT CCAAAGCATA TTTATTCCAT 4020 

TGTTTTAGTC TGGCTATTTA CAGTATAAAA AAAGCATTTT ATTAAAATAC TGTGTAGTTC 4080 

TTTGAGATAG TTGCTTATGC ATATAGTAAG TATTACATTC TTAGAGTAGA GCAGAGTTTT 414 0 

TAGTTAGTAT TAATTTATTT TCCTCCATTC ATGTACTTTT CCTTATATTT CCAAAACTGT 4 200 

40 TACTGAGAAT GGGTCAAGAT CAGTGAGAAA TCTTTACAGT TGACAGGAAC CTGGACCCCT 42 60 

TACCCCAACT TTATGAGTAA TGCTTGGAAT AAAAAACTCT TAAGGCAACT CACTGATTTA 4320 

CTTCTAGCAA TAGCATGATG TTACAGGAAT ATTACCTCTG TTTAAGCAAG GTAATGTGTA 4380 

AAATCAGTCT CGGCTGTCAG AATAACTTCT AAAAGGTATT TTTATAAGCA GTTCAAGTTA 444 0 

CTGAAAACCT TTTAAACCTT TCTGAAGTTC GTTAGTATAA ATTACTTTTC TAGGATTATT 4 500 

45 AATAAAAGCC ACATAGGTGG CAAGTTGTAG TTTTATATGG CTCTGTAGAG TGGTGAACCT 4 560 

TCTAGAGGAA TATATGATTT ATTCACAGTT CCTCAAGGCC TGGGGATGAT GATCAGTTAT 4 620 

ACCTATTTTT GTGCAATTAC ATCATGTTGT ACATTAGAAA TGGAGAGTTT AATAGCTCTT 4 680 

TAACTGCTGT CCTCATTAGG TAATGATAAA TATTTCCCTT AAATAATTGA CTATTTTGCT 4 740 

GTGTTTTAAA AATGATTGAA ATTTATCTTG CCATATCTCA TAATTTCATG CACAAGTTGA 4 800 

50 CTGAGCTAAT CTTGAGAATA TATTCGTAAA ATAGGAGCAC ATTTAGTTGA GGTATACAAG 4 860 

GTAGGACTCT AGACAAAACC TTCTATTTTA GCTTTAGTGA ATTTCAAAAG TAATGGGTCT 4920 

TGGAGTATAG ATTTTTATTA GTAGCTTGAA AGAGCTTAAT CAT ATG C AGT AAGTATTTTT 4980 

ATTACCAATA AATTTAAAAT TTTTTAAGAA AAATATTTTT ATCCTAGGGC CAAGTGTTGC 504 0 

CTGCCACCAA TCAGTAAGTT AGTCTATAAC AAATTTTACC CTAACAGTTT TACCACCTAG 5100 

55 CAACAGTCAT TTCTGAAAAT ATGTTGGATA GAAAGTCACT CTTTGGCAAA AGTGTTAGAA 5160 

TTTGCTTTTG TGCCATCTAT TCCTTTTATG GCATCTATCT TGAAAGTAAT CTTGTATTGG 5220 

AGATTGAAAG ATGCTGTAAT TTAGAAATTA ACATGATATC TTAAATTACC TTTATGAAAT 5280 

ATAGTTTTGT ATAATAGCAT AGATTTTCCT TCAAAAAATG AACATTTATA TATCTACAAA 5340 

AATATGGAGA AGAGCAATTT GAAAGCCTAC TTTCTGAAGA AAATGGTGGG ATTTTTTTTT 5400 

60 ATCATGATTA AATATCAAAA AATTGCCCTA TGAAAACTTT AAATCTCTAA AACATTTGAA 54 60 

ATACTACCAT ATTTGTGATT TATTGAGAAT AAAAATCCAT TTTGAAATGT AAAATTTTTA 5520 

TGATCTGATT CAGTTTTAAG AAAACATGAA TGAACTAGAA GATATTAAAA ACATTTGACA 5580 

TTGGTAAGAA ATATTGATAC TGATATTGAT TTTTATATAG GTATTTATTT CAGAATTGAT 5640 

ATTTTGAGAA AAATACATGT GAGTCATTTT TTCTGTTTCT CTTTTCTCTT AACGATTATC 5700 
65 ACTGTAATTC TGAATCT 

Seq 10 N° : 18^ Protein sequence; 
Protein Accession #: NP_002201.1 

70 1 11 21 31 41 51 

| I I I I I 

MAFPPRRRLR LGPRGLPLLL SGLLLPLCRA FNLDVDSPAE YSGPEGSYFG FAVDFFVPSA 60 

SSRMFLLVGA PKANTTQPGI VEGGQVL.KCD WSSTRRCQPI EFDATGNRDY AKDDPLEFKS 120 

HQWFGASVRS KQDKILACAP LYHWRTEMKQ EREPVGTCFL QDGTKTVEYA PCRSQDIDAD 180 

75 GQGFCQGGFS IDFTKADRVL LGGPGSFYWQ GQLISDQVAE IVSKYDPNVY SIKYNNQLAT 240 

RTAQAIFDDS YLGYSVAVGD FNGDGIDDFV SGVPRAARTL GMVYIYDGKN MSSLYNFTGE 300 
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QMAAYFGFSV AATDINGDDY ADVFIGAPLF MDRGSDGKLQ EVGQVSVSLQ RASGDFQTTK 360 

LNGFEVFARF GSAIAPLGDL DQDGFNDIAI AAPYGGEDKK GIVYIFNGRS TGLNAVPSQI 420 

LEGQWAARSM PPSFGYSMKG ATDIDKNGYP DLIVGAFGVD RAILYRARPV ITVNAGLEVY 480 

PSILNQDNKT CSLPGTALKV SCFNVRFCLK ADGKGVkPRK LNFQVELLLD KLKQKGAIRR 540 

5 ALFLYSRSPS HSKNMTISRG GLMQCEELIA YLRDESEFRD KLTPITIFME YRLDYRTAAD 600 

TTGLQPILNQ FTPANISRQA HILJjDCGEDN VCKPKLEVSV DSDQKKIYIG DDNPLTI*IVK 660 

AQNQGEGAYE AELIVSIPLQ ADFIGWRNN EALARLSCAF KTENQTRQW CDLGNPMKAG 720 

TQLLAGLRFS VHQQSEMDTS VKFDLQIQSS NLFDKVSPW SHKVDLAVLA AVEIRGVSSP 780 

DHIFLPIPNW EHKENPETEE DVGPWQHIY ELRNNGPSSF SKAMIiHLQWP YKYNNNTLfcY 840 

10 ILHYDIDGPM NCTSDMEINP LRIKISSLQT TEKNDTVAGQ GERDHLITKR DLALSEGDIH 900 

TLGCGVAQCL KIVCQVGRLD RGKSAILYVK SLLWTETFMN KENQNHSYSL KSSASFNVIE 960 

FPYKNLPIED ITNSTLVTTN VTWGIQPAPM PVPVWVIIIA VLAGIiLLLAV LVFVMYRMGF 1020 
FKRVRPPQEE QEREQLQPHE NGEGNSET 

15 Seq ID NO: 190 DNA sequence 

Nucleic Acid Accession ft: NM_004864 

Coding sequence: 26-952 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

20 | | | | | | 

CGGAACGAGG GCAACCTGCA CAGCCATGCC CGGGCAAGAA CTCAGGACGG TGAATGGCTC 60 

TCAGATGCTC CTGGTGTTGC TGGTGCTCTC GTGGCTGCCG CATGGGGGCG CCCTGTCTCT 120 

GGCCGAGGCG AGCCGCGCAA GTTTCCCGGG ACCCTCAGAG TTGCACTCCG AAGACTCCAG 180 

ATTCCGAGAG TTGCGGAAAC GCTACGAGGA CCTGCTAACC AGGCTGCGGG CCAACCAGAG 240 

25 CTGGGAAGAT TCGAACACCG ACCTCGTCCC GGCCCCTGCA GTCCGGATAC TCACGCCAGA 300 

AGTGCGGCTG GGATCCGGCG GCCACCTGCA CCTGCGTATC TCTCGGGCCG CCCTTCCCGA 360 

GGGGCTCCCC GAGGCCTCCC GCCTTCACCG GGCTCTGTTC CGGCTGTCCC CGACGGCGTC 420 

AAGGTCGTGG GACGTGACAC GACCGCTGCG GCGTCAGCTC AGCCTTGCAA GACCCCAAGC 480 

GCCCGCGCTG CACCTGCGAC TGTCGCCGCC GCCGTCGCAG TCGGACCAAC TGCTGGCAGA 540 

30 ATCTTCGTCC GCACGGCCCC AGCTGGAGTT GCACTTGCGG CCGCAAGCCG CCAGGGGGCG 600 

CCGCAGAGCG CGTGCGCGCA ACGGGGACGA CTGTCCGCTC GGGCCCGGGC GTTGCTGCCG 660 

TCTGCACACG GTCCGCGCGT CGCTGGAAGA CCTGGGCTGG GCCGATTGGG TGCTGTCGCC 720 

ACGGGAGGTG CAAGTGACCA TGTGCATCGG CGCGTGCCCG AGCCAGTTCC GGGCGGCAAA 780 

CATGCACGCG CAGATCAAGA CGAGCCTGCA CCGCCTGAAG CCCGACACGG AGCCAGCGCC 840 

35 CTGCTGCGTG CCCGCCAGCT ACAATCCCAT GGTGCTCATT CAAAAGACCG ACACCGGGGT 900 

GTCGCTCCAG ACCTATGATG ACTTGTTAGC CAAAGACTGC CACTGCAT AT GAG CAGTCCT 960 

GGTCCTTCCA CTGTGCACCT GCGCGGGGGA GGCGACCTCA GTTGTCCTGC CCTGTGGAAT 1020 

GGGCTCAAGG TTCCTGAGAC ACCCGATTCC TGCCCAAACA GCTGTATTTA TATAAGTCTG 1080 

TTATTTATTA TTAATTTATT GGGGTGACCT TCTTGGGGAC TCGGGGGCTG GTCTGATGGA 1140 

40 ACTGTGTATT TATTTAAAAC TCTGGTGATA AAAATAAAGC TGTCTGAACT GTTAAAAAAA 1200 
AAAA 



Seq ID NO: 191 Protein sequence : 
Protein Accession #: NP 004855 

45 

1 11 21 31 41 51 

I ! I I I I 

MPGQELRTVN GSQMLLVLLV LSWLPHGGAL SLAEASRASF PGPSELHSED SRFRELRKRY 60 
EDLLTRLRAN QSWEDSNTDL VPAPAVRILT PEVRLGSGGH LHLRISRAAL PEGLPEASRL 120 
50 HRALFRLSPT ASRSWDVTRP LRRQLSLARP QAPALHLRLS PPPSQSDQLL AESSSARPQL 180 
ELHLRPQAAR GRRRARARNG DDCPLGPGRC CRLHTVRASL EDLGWADWVL SPREVQVTMC 240 
IGACPSQFRA ANMHAQIKTS LHRLKPDTEP APCCVPASYN PMVLIQKTDT GVSLQTYDDL 300 
LAKDCHCI 

55 Seq ID NO: 192 DNA sequence 

Nucleic Acid Accession #: XM_061731.1 

Coding sequence: 1-567 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

60 | | I I I I 

ATGAGAAAAG GAAATGAGGG AGAGAACACA GAAGAGGGCA GGCTTGCTCA GCTTGCTCAA 60 

AGAAAGTTTC TCAAAGAAGA TGGCATTACA TTGCACATCT CTCTGTGTCT CTCTATTGCT 120 

GTAAAAGAAC CTTTCTCTCT GATTGGACTT GACACACAGA AGGATCTCAG TAAAGATTTG 180 

CTGTTGTTGA TGTCCACAGA CACTGGCAAG GACAGGTTTA CCAACATACT GCTGTCACAC 240 

65 TCCCCTCCAA TGTGCACCAA ATCACGTAAA AATGGGGATA ATGACTCCCC TGCCTTCACA 300 

TGGGGTGGCA AAGACACCAG GAGCAATACT GATCTTCCTA TCAGAGACCC TGGGGGCAAG 360 

AGTCTTTCAC TCACCAAACA TTCCCACAAG CCTGTCCCTG AGCATCAGTG TGACCAGAGA 420 

GAGGTCTTCC AGCCACTTTC AGAGCCAGGT GTAGAAGCAG AGATGGAAGT GTTCGCTGAT 4 80 

GCTGGATGGT GGATTTATCA GAGCTGTCAG GTTCCTTCCT CAACCCTTGC AAGAAAGAAG 540 

70 ATGGTTTATT CTAAAGAAAC TGAGTGA 



Seq ID NO: 193 Protein sequence: 
Protein Accession #: XP_061731.1 



75 1 11 21 31 41 51 
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MRKGNEGENT EEGRLAQLAQ RKFLKEDGIT LHISLCLSIA VKEPPSLIGL DTQKDLSKDL 60 

LLLMSTDTGK DRFTNILLSH SPPMCTKSRK NGDNDSPAFT WGGKDTRSNT DLPIRDPGGK 120 

SLSLTKHSHK PVPEHQCDQR EVFQPLSEPG VEAEMEVFAD AGWWIYQSCQ VPSSTLARKK 18 0 
MVYSKETE 

Seq ID NO: 194 DNA sequence 

Nucleic Acid Accession #: NM_005415.2 

Coding sequence: 371-2410 (underlined sequences correspond to start and 

1 11 21 31 41 51 

I I I I I I 

GAGCTGTCCC CGGTGCCGCC GACCCGGGCC GTGCCGTGTG CCCGTGGCTC CAGCCGCTGC 60 

CGCCTCGATC TCCTCGTCTC CCGCTCCGCC CTCCCTTTTC CCTGGATGAA CTTGCGTCCT 120 

TTCTCTTCTC CGCCATGGAA TTCTGCTCCG TGCTTTTAGC CCTCCTGAGC CAAAGAAACC 180 

CCAGACAACA GATGCCCATA CGCAGCGTAT AGCAGTAACT CCCCAGCTCG GTTTCTGTGC 240 

CGTAGTTTAC AGTATTTAAT TTTATATAAT ATATATTATT TATTATAGCA TTTTTGATAC 300 

CTCATATTCT GTTTACACAT CTTGAAAGGC GCTCAGTAGT TCTCTTACTA AACAACCACT 360 

ACTCCAGAGA ATGGCAACGC TGATTACCAG TACTACAGCT GCTACCGCCG CTTCTGGTCC 420* 

TTTGGTGGAC TACCTATGGA TGCTCATCCT GGGCTTCATT ATTGCATTTG TCTTGGCATT 480 

CTCCGTGGGA GCCAATGATG TAGCAAATTC TTTTGGTACA GCTGTGGGCT CAGGTGTAGT 540 

GACCCTGAAG CAAGCCTGCA TCCTAGCTAG CAT CTTTGAA ACAGTGGGCT CTGTCTTACT 600 

GGGGGCCAAA GTGAGCGAAA CCATCCGGAA GGGCTTGATT GACGTGGAGA TGTACAACTC 660 

GACTCAAGGG CTACTGATGG CCGGCTCAGT CAGTGCTATG TTTGGTTCTG CTGTGTGGCA 720 

ACTCGTGGCT TCGTTTTTGA AGCTCCCTAT TTCTGGAACC CATTGTATTG TTGGTGCAAC 780 

TATTGGTTTC TCCCTCGTGG CAAAGGGGCA GGAGGGTGTC AAGTGGTCTG AACTGATAAA 840 

AATTGTGATG TCTTGGTTCG TGTCCCCACT GCTTTCTGGA ATTATGTCTG GAATTTTATT 900 

CTTCCTGGTT CGTGCATTCA TCCTCCATAA GGCAGATCCA GTTCCTAATG GTTTGCGAGC 960 

TTTGCCAGTT TTCTATGCCT GCACAGTTGG AATAAACCTC TTTTCCATCA TGTATACTGG 1020 

AGCACCGTTG CTGGGCTTTG ACAAACTTCC TCTGTGGGGT ACCATCCTCA TCTCGGTGGG 1080 

ATGTGCAGTT TTCTGTGCCC TTATCGTCTG GTTCTTTGTA TGTCCCAGGA TGAAGAGAAA 1140 

AATTGAACGA GAAATAAAGT GTAGTCCTTC TGAAAGCCCC TTAATGGAAA AAAAGAATAG 1200 

CTTGAAAGAA GACCATGAAG AAACAAAGTT GTCTGTTGGT GATATTGAAA ACAAGCATCC 1260 

TGTTTCTGAG GTAGGGCCTG CCACTGTGCC CCTCCAGGCT GTGGTGGAGG AGAGAACAGT 1320 

CTCATTCAAA CTTGGAGATT TGGAGGAAGC TCCAGAGAGA GAGAGGCTTC CCAGCGTGGA 1380 

CTTGAAAGAG GAAACCAGCA TAGATAGCAC CGTGAATGGT GCAGTGCAGT TGCCTAATGG 1440 

GAACCTTGTC CAGTTCAGTC AAGCCGTCAG CAACCAAATA AACTCCAGTG GCCACTCCCA 1500 

GTATCACACC GTGCATAAGG ATTCCGGCCT GTACAAAGAG CTACTCCATA AATTACATCT 1560 

TGCCAAGGTG GGAGATTGCA TGGGAGACTC CGGTGACAAA CCCTTAAGGC GCAATAATAG 1620 

CTATACTTCC TATACCATGG CAATATGTGG CATGCCTCTG GATTCATTCC GTGCCAAAGA 1680 

AGGTGAACAG AAGGGCGAAG AAATGGAGAA GCTGACATGG CCTAATGCAG ACTCCAAGAA 1740 

GCGAATTCGA ATGGACAGTT ACACCAGTTA CTGCAATGCT GTGTCTGACC TTCACTCAGC 1800 

ATCTGAGATA GACATGAGTG TCAAGGCAGC GATGGGTCTA GGTGACAGAA AAGGAAGTAA 1860 

TGGCTCTCTA GAAGAATGGT ATGACCAGGA TAAGCCTGAA GTCTCTCTCC TCTTCCAGTT 1920 

CCTGCAGATC CTTACAGCCT GCTTTGGGTC ATTCGCCCAT GGTGGCAATG ACGTAAGCAA 1980 

TGCCATTGGG CCTCTGGTTG CTTTATATTT GGTTTATGAC ACAGGAGATG TTTCTTCAAA 2040 

AGTGGCAACA CCAATATGGC TTCTACTCTA TGGTGGTGTT GGTATCTGTG TTGGTCTGTG 2100 

GGTTTGGGGA AGAAGAGTTA TCCAGACCAT GGGGAAGGAT CTGACACCGA TCACACCCTC 2160 

TAGTGGCTTC AGTATTGAAC TGGCATCTGC CCTCACTGTG GTGATTGCAT CAAATATTGG 2220 

CCTTCCCATC AGTACAACAC ATTGTAAAGT GGGCTCTGTT GTGTCTGTTG GCTGGCTCCG 22 80 

GTCCAAGAAG GCTGTTGACT GGCGTCTCTT TCGTAACATT TTTATGGCCT GGTTTGTCAC 2340 

AGTCCCCATT TCTGGAGTTA TCAGTGCTGC CATCATGGCA ATCTTCAGAT ATGTCATCCT 24 00 

CAGAATGTGA AGCTGTTTGA GATTAAAATT TGTGTCAATG TTTGGGACCA TCTTAGGTAT 24 60 

TCCTGCTCCC CTGAAGAATG ATTACAGTGT TAACAGAAGA CTGACAAGAG TCTTTTTATT 2520 

TGGGAGCAGA GGAGGGAAGT GTTACTTGTG CTATAACTGC TTTTGTGCTA AATATGAATT 2580 

GTCTCAAAAT TAGCTGTGTA AAATAGCCCG GGTTCCACTG GCTCCTGCTG AGGTCCCCTT 264 0 

TCCTTCTGGG CTGTGAATTC CTGTACATAT TTCTCTACTT TTTGTATCAG GCTTCAATTC 2 700 

CATTATGTTT TAATGTTGTC TCTGAAGATG ACTTGTGATT TTTTTTTCTT TTTTTTAAAC 2760 

CATGAAGAGC CGTTTGACAG AGCATGCTCT GCGTTGTTGG TTTCACCAGC TTCTGCCCTC 2820 

ACATGCACAG GGATTTAACA ACAAAAATAT AACTACAACT TCCCTTGTAG TCTCTTATAT 2880 

AAGTAGAGTC CTTGGTACTC TGCCCTCCTG TCAGTAGTGG CAGGATCTAT TGGCATATTC 2940 

GGGAGCTTCT TAGAGGGATG AGGTTCTTTG AACACAGTGA AAATTTAAAT TAGTAACTTT 3000 

TTTGCAAGCA GTTTATTGAC TGTTATTGCT AAGAAGAAGT AAGAAAGAAA AAGCCTGTTG 3060 

GCAATCTTGG TTATTTCTTT AAGATTTCTG GCAGTGTGGG ATGGATGAAT GAAGTGGAAT 3120 

GTGAACTTTG GGCAAGTTAA ATGGGACAGC CTTCCATGTT CATTTGTCTA CCTCTTAACT 3180 
GAATAAAAAA GCCTACAGTT TTTAGAAAAA ACCCGAATTC 

Seq ID NO: 195 Protein sequence: 
Protein Accession #: NP_005406.2 

1 11 21 31 41 51 

| I I I I I 

MATLITSTTA ATAASGPLVD YLWMLILGFI IAFVLAFSVG ANDVANSFGT AVGSGWTLK 60 

QACILASIFE TVGSVLLGAK VSETIRKGLI DVEMYNSTQG LLMAGSVSAM FGSAVWQLVA 120 

SFLKLPISGT HCIVGATIGF SLVAKGQEGV KWSELIKIVM SWFVSPLLSG IMSGILFFLV 180 

RAFILHKADP VPNGLRALPV FYACTVGINL FSIMYTGAPL. LGFDKLPLWG TILISVGCAV 240 

FCALIVWFFV CPRMKRKIER EIKCSPSESP LMEKKNSLKE DHEETKLSVG DIENKHPVSE 300 
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VGPATVPLQA WEERTVSFK LGDLEEAPER ERLPSVDLKE ETSIDSTVNG AVQLPNGNLV 360 

QFSQAVSNQI NSSGHSQYHT VHKDSGLYKE LLHKLHLAKV GDCMGDSGDK PLRRNNSYTS 420 

YTMAICGMPL DSFRAKEGEQ KGEEMEKLTW PNADSKKRIR MDSYTSYCNA VSDLHSASEI 480 

DMSVKAAMGL GDRKGSNGSL EEWYDQDKPE VSLLFQFLQI LTACFGSFAH GGNDVSNAIG 540 

PLVALYLVYD TGDVSSKVAT PIWLLLYGGV GICVGLWVWG RRVIQTMGKD LTPITPSSGF 600 

SIELASALTV VIASNIGLPI STTHCKVGSV VSVGWLRSKK AVDWRLFRNI FMAWFVTVPI 660 
SGVISAAIMA IFRYVIBRM 



Seq ID NO: 196 DNA sequence 

Nucleic Acid Accession #: NM_000020.1 

Coding sequence: 2B3-1794 (underlined sequences correspond to start and stop codons) 



AGGAAACGGT 
AGAAACATTT 
GAGCGAGCCC 
CCAGCGCTGG 
AGGCTAGCGC 
AGGAAAGGCC 
TCTCGGGGCC 
CGGGGGGCCT 
CGGGGCTGCG 
CACTACTGCT 
CAACCTCCTT 
CTGGCCTTGC 
CAGGAGAAGC 
TCTGAGCAGG 
GGCTCAGGGC 
TGTGTGGGAA 
GCCGTCAAGA 
AACACAGTAT 
CGCAACTCGA 
GACTTTCTGC 
GCATGCGGCC 
GCCCACCGCG 
GCCGACCTGG 
AACCCGAGAG 
ACGGACTGCT 
GAGATTGCCC 
GATGTGGTGC 
CAGACCCCCA 
ATGATGCGGG 
AAGACACTAC 
AGCACCTGAT 
CTATCTGGGT 
TGCTCGGCCC 



11 
I 

TTATTAGGAG 
TTGCTCCAGC 
CTCCCCGGCT 
CGGTGCAACT 
CCCGCCACCC 
TTCTGATGCT 
CGCTGGTGAC 
GGTGCACAGT 
GGAACTTGCA 
GCGACAGCCA 
CGGAGCAGCC 
TGGCCCTGGT 
AGCGTGGCCT 
GCGACACGAT 
TCCCCTTCCT 
AAGGCCGCTA 
TCTTCTCCTC 
TGCTCAGACA 
GCACGCAGCT 
AGAGACAGAC 
TGGCGCACCT 
ACTTCAAGAG 
GCCTGGCTGT 
TGGGCACCAA 
TTGAGTCCTA 
GCCGGACCAT 
CCAATGACCC 
CCATCCCTAA 
AGTGCTGGTA 
AAAAAATTAG 
TCCTTTCTGC 
AGAGGTAGTG 
CCAGCCCACC 



21 
I 

GGAGTGGTGG 
CCCCATCCCA 
CCAGCCCGGT 
GCGGCCGCGC 
GCAGAGCGGG 
GCTGATGGCC 
CTGCACGTGT 
AGTGCTGGTG 
CAGGGAGCTC 
CCTCTGCAAC 
GGGAACAGAT 
GGCCCTGGGT 
GCACAGCGAG 
GTTGGGGGAC 
GGTGCAGAGG 
TGGCGAAGTG 
GAGGGATGAA 
CGACAACATC 
GTGGCTCATC 
GCTGGAGCCC 
GCACGTGGAG 
CCGCAATGTG 
GATGCACTCA 
GCGGTACATG 
CAAGTGGACT 
CGTGAATGGC 
CAGCTTTGAG 
CCGGCTGGCT 
CCCAAACCCC 
CAACAGTCCA 
CTGCAGGGGG 
TGAGTGTGGT 
CAGCCAAAAA 



31 



AGCTGGGCCA 
GTCCCGGGAG 
CCGGGGCCGC 
GGTGGAGGGG 
CCCAGAGGGA 
TTGGTGACCC 
GAGAGCCCAC 
CGGGAGGAGG 
TGCAGGGGGC 
CACAACGTGT 
GGCCAGCTGG 
GTCCTGGGCC 
CTGGGAGAGT 
CTCCTGGACA 
ACAGTGGCAC 
TGGCGGGGCT 
CAGTCCTGGT 
CTAGGCTTCA 
ACGCACTACC 
CATCTGGCTC 
ATCTTCGGTA 
CTGGTCAAGA 
CAGGGCAGCG 
GCACCCGAGG 
GACATCTGGG 
ATCGTGGAGG 
GACATGAAGA 
GCAGACCCGG 
TCTGCCCGAC 
GAGAAGCCTA 
CTGGGGGGGT 
GTGTGCTGGG 
TACAGCTGGG 



41 
I 

GGCAGGAAGA 
GCTGCCGCGC 
GCCGGACCCC 
AGGTGGCCCC 
CCATGACCTT 
AGGGAGACCC 
ATTGCAAGGG 
GGAGGCACCC 
GCCCCACCGA 
CCCTGGTGCT 
CCCTGATCCT 
TGTGGCATGT 
CCAGTCTCAT 
GTGACTGCAC 
GGCAGGTTGC 
TGTGGCACGG 
TCCGGGAGAC 
TCGCCTCAGA 
ACGAGCACGG 
TGAGGCTAGC 
CACAGGGCAA 
GCAACCTGCA 
ATTACCTGGA 
TGCTGGACGA 
CCTTTGGCCT 
ACTATAG AC C 
AGGTGGTGTG 
TCCTCTCAGG 
TCACCGCGCT 
AAGTGATTCA 
GGGGGGCAGT 
GATGGGCAGC 
CTGAAACCTG 



51 
I 

CGCTGGAATA 60 

CAGCTGCGCC 120 

AGCCCGCCGT 180 

GGTCCGCCGA 240 

GGGCTCCCCC 300 

TGTGAAGCCG 360 

GCCTACCTGC 420 

CCAGGAACAT 480 

GTTCGTCAAC 540 

GGAGGCCACC 600 

GGGCCCCGTG 660 

CCGACGGAGG 720 

CCTGAAAGCA 780 

CACAGGGAGT 840 

CTTGGTGGAG 900 

TGAGAGTGTG 960 

TGAGATCTAT 1020 

CATGACCTCC 1080 

CTCCCTCTAC 1140 

TGTGTCCGCG 1200 

ACCAGCCATT 1260 

GTGTTGCATC 1320 

CATCGGCAAC 1380 

GCAGATCCGC 1440 

GGTGCTGTGG 1500 

ACCCTTCTAT 1560 

TGTGGATCAG 1620 

CCTAGCTCAG 1680 

GCGGATCAAG 1740 

ATAGCCCAGG 1800 

GGATGGTGCC 1860 

TGCGCCTGCC 1920 



Seq ID NO: 197 Protein sequence: 
Protein Accession ft: NP 000011.1 



1 11 21 31 41 51 

I I I I I I 

MTLGSPRKGL LMLLMALVTQ GDPVKPSRGP LVTCTCESPH CKGPTCRGAW CTWLVREEG 60 

RHPQEHRGCG NLHRELCRGR PTEFVNHYCC DSHLCNHNVS LVLEATQPPS EQPGTDGQLA 120 

LILGPVLALL ALVALGVLGL WHVRRRQEKQ RGLHSELGES SLILKASEQG DTMLGDIiLDS 180 

DCTTGSGSGL PFLVQRTVAR QVALVECVGK GRYGEVWRGL WHGESVAVKI FSSRDEQSWF 240 

RETEIYNTVL LRHDNILGFI ASDMTSRNSS TQLWLITHYH EHGSLYDFLQ RQTLEPHLAL 300 

RLAVSAACGL AHLHVEIFGT QGKPAIAHRD FKSRNVLVKS NLQCCIADLG LAVMHSQGSD 360 

YLDIGNNPRV GTKRYMAPEV LDEQIRTDCF ESYKWTDIWA FGLVLWEIAR RTIVNGIVED 420 

YRPPFYDWP NDPSFEDMKK WCVDQQTPT IPNRLAADPV LSGLAQMMRE CWYPNPSARL 480 
TALRIKKTLQ KISNSPEKPK VIQ 



Seq ID NO: 198 DNA sequence 

Nucleic Acid Accession #: NM_003 199.1 

Coding sequence: 200-2203 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

l I I I I I 

CGGGGGGATC TTGGCTGTGT GTCTGCGGAT CTGTAGTGGC GGCGGCGGCG GCGGCGGCGG 60 

GGAGGCAGCA GGCGCGGGAG CGGGCGCAGG AGCAGGCGGC GGCGGTGGCG GCGGCGGTTA 120 

GACATGAACG CCGCCTCGGC GCCGGCGGTG CACGGAGAGC CCCTTCTCGC GCGCGGGCGG 180 

TTTGTGTGAT TTTGCTAA AA TG CATCACCA ACAGCGAATG GCTGCCTTAG GGACGGACAA 240 

AGAGCTGAGT GATTTACTGG ATTTCAGTGC GATGTTTTCA CCTCCTGTGA GCAGTGGGAA 300 

AAATGGACCA ACTTCTTTGG CAAGTGGACA TTTTACTGGC TCAAATGTAG AAGACAGAAG 360 

TAGCTCAGGG TCCTGGGGGA ATGGAGGACA TCCAAGCCCG TCCAGGAACT ATGGAGATGG 420 



272 



WO 02/079492 



PCT/US02/04915 



GACTCCCTAT 
TTTTGTCAAT 
AGAATCAAAC 
CAACCCAGGA 
5 TAATCCCCGA 
AAAAGTTCCT 
CAATAGGGAC 
CTTCTTCATG 
TCAGCCTGGC 

10 CTGTAGCCTG 
TTCCAGTCTT 
TTCCTGTACG 
AGCCGGCAGC 
AGATCACACT 

15 TCTCTCAGCA 
TTATGAAGGA 
TGATGCTATT 
TCATGGGGAC 
CTCAGGGTAT 

20 TCGTGAAGAT 
TCCACAGCTT 
CAGAGGCATG 
ATCCGATGAC 
AGATGACGAC 

25 ACCAGAGCAG 
TCTGCGGGTC 
CCTCAAGAGT 
CCTCAGTCTG 
AAGAAGGGAG 

30 CCCTGGAATG 
CATTGCTTCA 
AAACACTTCT 
GCAGACGCAA 
AGAAAAAAGT 

35 AGTATGTACA 



GACCACATGA 
TCCAGAATAC 
TTACAGGGTT 
ACCCTTTCGC 
AGGAGGCCTC 
CCAGGTTTGC 
TCGCCAGGCT 
CAAGATGGCC 
TATGCAGGAA 
CATCCACATG 
CCTCCGATGT 
CCTCCTGCCA 
TCCCAGACTG 
AACAACAGCT 
GGCACAGCTG 
CCCTTACACT 
CATGTTCTCC 
ATGCATGGAA 
GGAACCGGCC 
GGCGTGGCCC 
CCTGTCCAGT 
CCACCAGGAC 
GAGGGTGATG 
AAGAAGGATA 
AAGGCAGAGC 
CGTGACATCA 
GACAAGCCCC 
GAGCAG CAAG 
GAAGAGAAGG 
GGAGACGCAT 
TTAAAACAAG 
CCTTAACCCC 
GAGGTTTCAG 
GCAACTTGAG 
GGCTGAGACA 



CCAGCAGGGA 
AAAGTAAAAC 
GCCACCAGCA 
CCACCAAACC 
TTCACAGTAG 
CATCTTCAGT 
ATCCTTCCTC 
ATCACAGCAG 
TGTTGGGCAA 
AACGTTTGAG 
CCACTTTCCA 
ACGGGACAGA 
GAGATGCTCT 
TTTCATCAAA 
TTTGGTCTAG 
CTTTGCAAAG 
GGAACCATGC 
TCATTGGACC 
TTCTTTCAGC 
TGAGAGGCAG 
CTGCGACTTC 
TACAGGGGCA 
AGAACCTGCA 
TCAAATCAAT 
GTGAGAAGGA 
ACGAGGCTTT 
AGACCAAGCT 
TCCGAGAAAG 
TGTCCTCGGA 
CGAATCACAT 
AGACCACTTC 
CATTTTTGTA 
CATTCCCAAT 
GGACGACTTT 
CAGCCCAGAG 



CCTTGGGTCA 
AGAAAGGGGC 
GAGTCTCCTT 
TGGTTCCCAG 
TGCCATGGAG 
CTATGCTCCA 
CAAACCAGCA 
TGACCCTTGG 
CTCTTCTCAT 
CTATCCATCA 
TCGTAGTGGT 
CAGTATAATG 
GGGGAAAGCA 
CCCTTCAACT 
AAATGGAGGA 
CCGAATTGAA 
AGTGGGCCCA 
TTCTCATAAT 
CAACAGACAT 
CCATTCTCTT 
CCCTGACCTG 
GAGTGTCTCC 
AGACACGAAA 
TACTAGCAAT 
GCGGAGGATG 
CAAAGAGCTC 
CCTGATCCTC 
GAATCTGAAT 
GCCTCCCCCT 
GGGACAGATG 
CTTAACAGCT 
ATATAAGACA 
TATCAAAAAA 
CTTTAACATA 
ACTGAACGGC 



CATGACAATC 
TCATACTCAT 
GGAGGTGACA 
TACTATCAGT 
GTACAGACAA 
TCAGCAAGCA 
ACCAGCACTT 
AGCTCCTCCA 
ATTCCACAGT 
CACTCCTCAG 
ACAAACCATT 
GCAAATAGAG 
CTTGCTTCGA 
CCTGTTGGCT 
CAGGCCTCAT 
GATCGTTTAG 
TCCACAGCTA 
GGAGCCATGG 
TCACTCATGG 
CTGCCAAACC 
AACCCACCCC 
TCTGGCAGCT 
TCTTCGGAGG 
AATGACGATG 
GCCAACAATG 
GGCCGCATGG 
CACCAGGCGG 
CCGAAAGCTG 
CTCTCCTTGG 
TAAAAGGGTC 
GTATTATCTT 
AGTCTGAGTA 
CAGAAAAACA 
TCATTCAGAA 



TCTCTCCACC 480 

CTTATGGGAG 540 

TGGATATGGG 600 

ATTCTAGCAA 660 

AGAAAGTTCG 72 0 

CTGCCGACTA 780 

TCCCTAGCTC 84 0 

GTGGGATGAA 900 

CCAGCAGCTA 960 

CAGACATCAA 1020 

ACAGCACCTC 1080 

GAAGCGGGGC 1140 

TCTATTCTCC 1200 

CTCCTCCATC 1260 

CGTCTCCTAA 1320 

AAAGACTGGA 1380 

TGCCTGGTGG 1440 

GTGGTCTGGG 1500 

TGGGGACCCA 1560 

AGGTTCCGGT 1620 

AGGACCCTTA 1680 

CTGAGATCAA 1740 

ACAAGAAATT 1800 

AGGACCTGAC 1860 

CCCGAGAGCG 1920 

TGCAGCTCCA 1980 

TGGCCGTCAT 2040 

CGTGTCTGAA 2100 

CCGGCCCACA 2160 

CAAGTTGCCA 2220 

AAACCCACAT 2280 

GTTATGAATC 2340 

AAAAAAAGAA 2400 

TGTGCAAAGC 2460 



Seq ID NO: 199 Protein sequence; 
Protein Accession #: NP 003190.1 



40 



45 



50 



55 



ii 



I 

MHHQQRMAAL 
NGGHPSPSRN 
CHQQSLLGGD 
PSSVYAPSAS 
MLGNSSHIPQ 
NGTDSIMANR 
VWSRNGGQAS 
IIGPSHNGAM 
SATSPDLNPP 
IKSITSNNDD 
QTKLLILHQA 
SNHMGQM 



GTDKELSDLL 
YGDGTPYDHM 
MDMGNPGTLS 
TADYNRDSPG 
SSSYCSLHPH 
GSGAAGSSQT 
SSPNYEGPLH 
GGLGSGYGTG 
QDPYRGMPPG 
EDLTPEQKAE 
VAVILSLEQQ 



21 
I 

DFSAMFSPPV 
TSRDLGSHDN 
PTKPGSQYYQ 
YPSSKPATST 
ERLSYPSHSS 
GDALGKALAS 
SIiQSRIEDRL 
LLSANRHSLM 
LQGQSVSSGS 
REKERRMANN 
VRERNLNPKA 



31 
I 

SSGKNGPTSL 
LSPPFVNSRI 
YSSNNPRRRP 
FPSSFFMQDG 
ADINSSLPPM 
IYSPDHTNNS 
ERLDDAIHVL 
VGTHREDGVA 
SEIKSDDEGD 
ARERLRVRDI 
ACLKRREEEK 



41 
I 

ASGHFTGSNV 
QSKTERGSYS 
LHSSAMEVQT 
HHSSDPWSSS 
STFHRSGTNH 
FSSNPSTPVG 
RNHAVGPSTA 
LRGSHSLLPN 
ENLQDTKSSE 
NEAFKELGRM 
VSSEPPPLSL 



51 
I 

EDRSSSGSWG 60 

SYGRESNLQG 120 

KKVRKVPPGL 180 

SGMNQPGYAG 240 

YSTSSCTPPA 300 

SPPSLSAGTA 360 

MPGGHGDMHG 420 

QVPVPQLPVQ 4 80 

DKKLDDDKKD 54 0 

VQLHLKSDKP 600 

AGPHPGMGDA 660 



Seq ID NO: 2 00 DNA sequence 

Nucleic Acid Accession #: BC005987 (1-1286), BE888744 (1287-1756) 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 



60 | 



GGCAGAAGAG 
GAGAATTGCA 
CAACTAAAAT 
GAAGACAAAG 

65 AACCTACTGG 
CGTAAAGCTG 
GTCACCTGGG 
ATTTATGTAG 
AGTCCAGAGC 

70 GAAAGAGCGA 
ACCTCTGGAC 
ATTGACCCTC 
CTGGCTCTGA 
TTAGTTGAAG 

75 AAGTTTTATC 
GAATACATAC 



11 
I 

GAAGATTTCT 
CTGCAACCAT 
GCCATTTCAC 
TATTTTACCG 
CCTATCTAAA 
AAGAGTTAAT 
GAAACTATGC 
ACAAGGTGAA 
TTGACTGTGA 
AGGTGTGCTT 
TGGCAATAGC 
TGAGGCAAGC 
AGCTTCATAA 
AAGCCTTGGA 
GAAGAAAAGA 
CAAACAATGC 



21 

I 

GAAGAGTGCA 
GAGTGAGAAC 
CTGGAACTTG 
GACTGAGTTT 
GCACCTCAAA 
CCAGCAAGAG 
CTGGGTCTAC 
ACATGTCTGT 
GGAAGGGTGG 
TGAGAAGGCT 
AAG CTACCGT 
CATTCGGCTG 
GATGCGTGAA 
GAAAGCCCCA 
TGAGCCAGAC 
CTACCTGCAT 



31 
I 

GCTGCCTGAA 
AATAAGAATT 
ATGGAGGGAG 
CAGAATCGTG 
GGGCAAAACG 
CATGCTGACC 
TATCACATGG 
GAGAAGTTTT 
ACACGGTTAA 
CTGGAAAAGA 
CTGGACAACT 
AATCCTGACA 
GAAGGTGAAG 
GGTGTAACAG 
AAAGCGATTG 
TGCCAAATTG 



41 
I 

CCGAGCCCTG 
CCTTGGAGAG 
AAAACTCCTT 
AATTCAAAGC 
AGGCAGCCCT 
AGGCAGAAAT 
GCCGACTCTC 
CCAGTCCCTA 
AGTGTGGARG 
AGCCAAAGAA 
GGCCACCATC 
ACCAGTACCT 
AGGAAGGTGA 
ATGTACTTCG 
AACTGCTTAA 
GGTGCTGCTA 



51 
I 

CCGAACAGCT 60 

CAGCCTACGG 120 

GGATGATTTT 180 

CACAATGTGC 240 

GGAATGCTTA 300 

CAGAAGTCTG 360 

AGACGTTCAG 420 

TAGAATTGAG 480 

AAACCAAAAT 540 

CCCAGAATTC 600 

TCAGAACGCC 660 

TAAAGTCCTC 720 

AGGAGAGAAG 780 

CAGTGCAGCC 840 

AAAGGCTTTA 900 

TAGGGCAAAA 960 
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GTCTTCCAAG TAATGAATCT AAGAGAGAAT GGAATGTATG GGAAAAGAAA GTTACTGGAA 1020 

CTAATAGGAC ACGCTGTGGC TCATCTGAAG AAAGCTGATG AGGCCAATGA TAATCTCTTC 1080 

CGTGTCTGTT CCATTCTTGC CAGCCTCCAT GCTCTAGCAG ATCAGTATGA AGAAGCAGAG 1140 

TATTACTTCC AAAAGGAATT CAGTAAAGAG CTTACTCCTG TAGCGAAACA ACTGCTCCAT 1200 

5 CTGCGGTATG GCAACTTTCA GCTGTACCAA ATGAAGTGTG AAGACAAGGC CATCCACCAC 1260 

TTTATAGAGG GTGTAAAAAT AAACCAGAAA TCAAGGGAGA AAGAAAAGAT GAAAGACAAA 1320 

CTGCAAAAAA TTGCCAAAAT GCGACTTTCT AAAAATGGAG CAGATTCTGA GGCTTTGCAT 1380 

GTCTTGGCAT TCCTTCAGGA GCTGAATGAA AAAATGCAAC AAGCAGATGA AGACTCTGAG 1440 

AGGGGTTTGG AGTCTGGAAG CCTCATCCCT TCAGCATCAA GCTGGAATGG GGAATGAAGA 1500 

10 ATAGAGATGT GGTGCCCACT AGGCTACTGC TGAAAGGGAG CTGAAATTCC TCCACAAGTT 1560 

GGTATTCAAA ATATGTAATG ACTGGTATGG CAAAAGATTG GACTAAGACA CTGGCCATAC 1620 

CACTGGACAG GGTTATGTTA AACCTGAATT GCTGGGTCTT AAAAGAGCCC AAGGAGTTCT 1680 

GGGAGAGGGA CAGATTGGGG GGTCGTCCAG GGCTGCGCTA AATTATTCTC AATGATTTGT 1740 
CTCTTTGCGG AACTTC 

15 

Seq ID NO: 201 Protein sequence: 
Protein Accession #: AAA59191 

1 11 21 31 41 51 

20 | | | | | | 

MSENNKNSLE SSLRQLKCHF TWNLMEGENS LDDFBDKVFY RTEFQNREFK ATMCNIjLAYL 60 

KHLKGQNEAA LECLRKAEEL IQQEHADQAE IRSLVTWGNY AWVYYHMGRL SDVQIYVDKV 120 

KHVCEKFSSP YRIESPELDC EEGWTRLKCG GNQNERAKVC FEKALEKKPK NPEFTSGLiAI 180 

ASYRLDNWPP SQNAIDPLRQ AIRLNPDNQY LKVLLALKLH KMREEGEEEG EGEKLVEEAL 240 

25 EKAPGVTDVL RSAAKFYRRK DEPDKAIELL KKALEYIPNN AYLHCQIGCC YRAKVFQVMN 300 

LRENGMYGKR KLLELIGHAV AHLKKADEAN DNLFRVCSIL ASLHAIiADQY EDAEYYFQKE 360 

FSKELTPVAK QLLHLRYGNF QLYQMKCEDK AIHHFIEGVK INQKSREKEK MKDKLQKIAK 420 
MRLSKNGADS EAI1HVLAFI1Q ELNEKMQQAD EDSERGLESG SLIPSASSWN GE 

30 

Seq ID NO: 202 DNA sequence 
Nucleic Acid Accession #: NM_003090 

Coding sequence: 57-824 {underlined sequences correspond to start and stop codons) 

35 1 11 21 31 41 51 

I I I I I I 

GAATTCCGCG GGAGGCCACG GGCTTTCCAC AGCGCGGGGG AACGGGAGGC TGCAGG ATG G 60 

TCAAGCTGAC GGCGGAGCTG ATCGAGCAGG CGGCGCAGTA CACCAACGCG GTGCGCGACC 120 

GGGAGCTGGA CCTCCGGGGG TATAAAATTC CCGTCATTGA AAATCTAGGT GCTACGTTAG 180 

40 ACCAGTTTGA TGCTATTGAT TTTTCTGACA ATGAGATCAG GAAACTGGAT GGTTTTCCTT 24 0 

TGTTGAGAAG ACTGAAAACA TTGTTAGTGA ACAACAACAG AATATGCCGT ATAGGTGAGG 300 

GACTTGATCA GGCTCTGCCC TGTCTGACAG AACTCATTCT CACCAATAAT AGTCTCGTGG 360 

AACTGGGTGA TCTGGACCCT CTGGCATCTC TCAAATCGCT GACTTACCTA AGTATCCTAA 420 

GAAATCCGGT AACCAATAAG AAGCATTACA GATTGTATGT GATTTATAAA GTTCCGCAAG 4 80 

45 TCAGAGTACT GGATTTCCAG AAAGTGAAAC TAAAAGAGCG TCAGGAAGCA GAGAAAATGT 54 0 

TCAAGGGCAA ACGGGGTGCA CAGCTTGCAA AGGATATTGC CAGGAGAAGC AAAACTTTTA 600 

ATCCAGGTGC TGGTTTGCCA ACTGACAAAA AGAGAGGTGG GCCATCTCCA GGGGATGTAG 660 

AAGCAATCAA GAATGCCATA GCAAATGCTT CAACTCTGGC TGAAGTGGAG AGGCTGAAGG 720 

GGTTGCTGCA GTCTGGTCAG ATCCCTGGCA GAGAACGCAG ATCAGGGCCC ACTGATGATG 780 

50 GTGAAGAAGA GATGGAAGAA GACACAGTCA CAAACGGGTC C TGA GCAGTG AGGCAGATGT 840 

ATAATAATAG GCCCTCTTGG AACAAGTCTT GCTTTTCGAA CATGGTATAA TAGCCTTGTT 900 

TGTGTTAGCA AAGTGGAATC TATCAGCATT GTTGAAATGC TTAAGACTGC TGCTGATAAT 960 

TTTGTAATAT AAGTTTTGAA ATCTAAATGT CAATTTTCTA CAAATTATAA AAATAAACTC 1020 
CACTCTCTAT GCTAAAAAAA AAAAAAAGGA ATTC 

55 

Seq ID NO: 203 Protein sequence: 
Protein Accession #: NP_003081.1 

1 11 21 31 41 51 

60. | | | | | | 

MVKLTAELIE QAAQYTNAVR DRELDLRGYK IPVIENLGAT LDQFDAIDFS DNEIRKLDGF 60 

PLLRRLKTLL VNNNRICRIG EGLDQALPCL TELILTNNSL VELGDLDPLA SLKSLTYLSI 120 

LRNPVTNKKH YRLYVIYKVP QVRVLDFQKV KLKERQEAEK MFKGKRGAQLt AKDIARRSKT 180 

FNPGAGLPTD KKRGGPSPGD VEAIKNAIAN ASTLABVERL KGLLQSGQIP GRERRSGPTD 240 
65 DGEEEMEEDT VTNGS 

Seq ID NO: 204 DNA sequence 

Nucleic Acid Accession #: NM_017643.1 

Coding sequence: 169-1401 (underlined sequences correspond to start and stop codons} 

70 

1 11 21 31 41 51 

I I I I I I 

AATAGCAATA GCTTTATAGC AGCTCCGGTT ACCTGTTTTA AACATGGAAG GAGAGTCGCT 60 
CCCAGATAGC CCTCACGAGT GGCCCTGGAG CAGGGAGTGG TGGAGCAGAT CTTCCTTGTT 120 
75 TGGGAGGAGC CTGAGGTGGA CCTCGCGTCC TGAGTCTGGA AGGCACCTAT_GGGGACCTGC 180 
TGGGGTGATA TCTCAGAAAA TGTGAGAGTA GAAGTTCCCA ATACAGACTG CAGCCTACCT 240 
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10 



15 



20 



25 



30 



35 



40 



ACCAAAGTCT 
TATGAAGGAT 
ATCCATCCAG 
CAGCATAAAT 
CTGCCTCCTG 
ATGAGAGTAG 
AGTGTAATTG 
TTCTGGTGCC 
CATCGATTCA 
CATTTATTTG 
AAATTGGAAG 
GTGCTAGCTG 
GACTGGTTCT 
AACATGATTG 
TACCTCAGGG 
AATCACGGAT 
ATATGTGTAG 
TGGGAAGAAG 
TGGTGTCAGT 
AGAAAAGGTG 
TCTTATGAGC 
AAGACTCAAC 
AGTCAATTAC 
TTTAGTGAGT 
AAATAGTGGA 
TCAGAGCATC 
CTAGACAACT 
TTTTCAGCAT 
CAGCAATACA 
ATTGTTAACT 
TGATGATTGT 
TGCAATAATG 
AGTATACGTG 
TGAAAGTTAA 
GGCTCACGCC 
AGATCGAGAC 
CTGCCGGGCG 
ATCACTTGAA 
CTGGGCAACA 



TCTGGATTGC 
TTGAAAATGA 
TTGGTTGGTG 
ATACAAACTG 
ATTTCTCCCA 
AAGTGGTTGA 
GAGGAAGATT 
ATATGCACAG 
AAAGATCTGA 
CTAAGGTAAA 
CTATAGACCC 
ACGGATTCCT 
GTTACCATGC 
AACTTACTCC 
AAACTGGCTC 
TTCGTGTAGG 
CCACAGTAAC 
AGTATGATCA 
TAACTGGATA 
TCCTTTTGTA 



TCACAGGACA 
AACAATATCA 
ATAATGACTA 
TAAAAATTAC 
AAATGTCTCA 
ATGTACTTAA 
GTATCGTCTA 
CAAGAGAAAA 
AAGGACATAA 
AATTGGAGTC 
GCATTATGTA 
AGAAACACTG 
GTAAAGAATA 
TCCTTTTTAA 
TGTAATCCCA 
CATCCTGGCT 
TGGTGGCACA 
CCCAGGAGGC 
CAGCAAGACT 



TGGAATTGTA 
CTCTGGTCTG 
TGCAGCCAGC 
GAAAGCTTTT 
AAAGGTTTCA 
CAAGAGGCAT 
AAGACTAGTG 
CCCATTAATA 
TATTACAAAG 
AGAAGTAGAC 
ATTAAATCTT 
GATGATTGGG 
AACCTCTCCT 
ACCCAGAGGT 
CATTGCAGCA 
AATGAAATTA 
TCGAATTATT 
GTGGGTAGAC 
TCAACTACAG 
AAAATCAGCA 
AGAATATACC 
CAGAATCAGA 
TAGAAACACA 
ATACTAAAAG 
TGTTGAGGCT 
GTATAATGGT 
AATTGTAAAC 
CCAATCAGCT 
GAAAAGTGGG 
ACAGTATTCT 
TTATGCTTAA 
ATATTTTACT 
GAGTCTGTGA 
AAACTTTATT 
GCACTTTAGG 
AACACGGTGA 
CGCCTGAAGT 
AGAGGTTGCA 
CTGTCTCAAA 



AAATTAGCAG 
GACTTCTGGT 
GGAAAACCTC 
CTAGTGAAAC 
GAGAGTATGC 
TTGTGTCGAA 
TATGAAGAAA 
CATCATATTG 
AAACAGGATG 
CAGAGTGGGG 
TCTACAATAT 
ATCGATGGCT 
TCTATTTTCC 
TACACAAAAC 
CCAGTAAAAC 
GAAGCAGTAG 
CATCGTCTCT 
TGTGAGTCAC 
CCTCCAGCAT 
ATTCTCCAGA 
TATGTCTGAT 
CCATGTGTCC 
ACAGTCACCA 
TTTATTGGTA 
ATGGTTTTGT 
CTTGGTAAAG 
AATTATCTAG 
TCATCAAAAC 
TCACCACGTG 
TGGACAGAAA 
AGGTGCAGTA 
AACAGGAGAA 
ATGATTCTTG 
TAAAAAGAAA 
AGGCCGAGGC 
AACCCTGTCT 
CCCAGCTACT 
GTGGGCCAAG 
AAAAAAAAAA 



GTTACAATGC 


CCTTTTAAGA 


300 


GCAATATATG 


TGGTTCTGAT 


360 


TTGTTCCTCC 


TAGAACTATT 


420 


GACTTACTGG 


TGCCAAAACA 


4B0 


AGTATCCTTT 


CAAACCTTGC 


540 


CACGAGTAGC 


AGTGGTGGAA 


600 


GCGAAGATAG 


AACAGATGAC 


660 


GTTGGTCTCG 


AAGCATAGGT 


720 


GACATTTTGA 


TACACCACCA 


780 


AATGGTTCAA 


GGAAGGAATG 


640 


GTGTCGCAAC 


CATTAGAAAG 


900 


CAGAAGCAGC 


AGACGGATCT 


960 


CTGTCGGTTT 


CTGTGAAATT 


1020 


TTCCTTTTAA 


ATGGTTTGAC 


1080 


TATTTAATAA 


GGATGTTCCA 


1140 


ATCTCATGGA 


GCCACGTTTA 


1200 


TGAGGATACA 


TTTTGATGGA 


1260 


CTGACCTCTA 


TCCTGTAGGG 


1320 


CACAGTGTAA 


GTTGGTATAC 


1380 


GGACTATCTC 


ACATAAGTCA 


1440 


TGGTTGCCAG 


GTAAGACATT 


1500 


CATGGCAATG 


TGAATCCAAT 


1560 


AATTAAACTA 


GACTTACTAT 


1620 


GGTAATAAAT 


G CTTTTG AGT 


1680 


AGGAACAAGT 


ACCCTTATTT 


1740 


ATAGTTCATA 


TAAGTTGTAT 


1800 


TACCAATTTT 


CCCTTTTTAT 


1860 


AGAAGAAAAA 


GGCTAAGTCC 

^^P^^ ^b# ^b b ^b* 


1920 






*i- > O V 


ATGATATATC 


TTGTGAGAAC 


2040 


TGCCATAAAA 


GGCAAACCCT 


2100 


ATGATTACCA 


CAGTATTTAA 


2160 


AAATAATATG 


TAAAACCTAC 


2220 


AATTAGCAGC 


CAGGTGCAGT 


2280 


TGGCAGATCA 


CAAGGTCAGG 


2340 


CCACCAAAAA 


TACAAAAAAT 


2400 


CAGGAGGCTG 


AGGCAAGAGA 


2460 


ATCACGCCAC 


TACATTCCAG 


2520 


AAAA 







Seq ID NO: 205 Protein sequence : 
Protein Accession #: NP 060113.1 



45 | 



50 



MGTCWGDISE 
CGSDIHPVGW 
FKPCMRVEW 
RSIGHRFKRS 
TIRKVLADGF 
KWFDYLRETG 
HFDGV/EEEYD 



11 
I 

NVRVEVPNTD 
CAASGKPLVP 
DKRHLCRTRV 
DITKKQDGHF 

lmigidgsea 
siaapvklfn 
q;«jvdcespdl 



21 



CSLPTKVFWI 
PRTIQHKYTN 
AWESVIGGR 
DTPPHLFAKV 
ADGSDWFCYH 
KDVPNHGFRV 
YPVGWCQLTG 



31 
I 

AGIVKLAGYN 
VJKAFLVKRL.T 
LRLVYEESED 
KEVDQSGEWF 
ATSPSIFPVG 
GMKLEAVDLiM 
YQLQPPASQC 



41 


51 




1 

ALLRYEGFEN 


1 

DSGLDFWCNI 


60 


GAKTLPPDFS 


QKVSESMQYP 


120 


RTDDFWCHMH 


SPLIHHIGWS 


180 


KEGMKLEAID 


PLNLSTICVA 


240 


FCEINMIELT 


PPRGYTKLPF 


300 


EPRLICVATV 


TRIIHRLLRI 


360 


KLVYRKGVLL 







55 



60 



65 



70 



Seq ID NO: 206 DNA sequence 
Nucleic Acid Accession #: NM_012334 

Coding sequence: 223-6399 (underlined sequences correspond to start and stop codons) 



75 



I 

GAGACAAAGG 
TGAGAAGGAC 
CGGGAGTGGC 
AGTCGGAGCG 
GAGGGAACAC 
TGTGCAGAAG 
AGCACAATTA 
GACATGGCGT 
TATAAGAGAA 
CAGCCCATCG 
GGCGAGCTGC 
CGCTACGACA 
ACTAAATTGA 
AAGGAGAAGA 
TTCGGCAATG 



11 
I 

CTGCCGTCGG 
AAGAAGGGAC 
GCCGTGACAC 
GCACTCGGCG 
GGGTCTGGCT 
GCATCGTCGT 
CCCACCAGAA 
CCTTGACAGA 
ATCAAATATA 
CCGGGCTGTA 
CCCCGCACAT 
ACCAGTGCAT 
TCCTCAAGTT 
CATCCTGTGT 
CGAAGACCGT 



21 
I 

GACGGGCGAG 
CGGGCGATGG 
GCATGGTTTC 
AGTCCGGGAC 
GAGAGAAAAT 
CTTCCGGACA 
GGTGACTGCT 
GCTCCATGGC 
TACCTACATC 
CGAGCCTGCC 
CTTCGCCATC 
CCTCATCAGT 
TCTGTCAGTC 
TGAACGAGCT 
GTACAACAAC 



31 
I 

TTAGGGACTT 
CAGCAGGGGA 
CCCGGACCCG 
TGCGCTGGAA 
GGCCAGCATT 
GACTATGGTC 
ATGCACCCCA 
GGCTCCATCA 
GGCTCCATCC 
ACCATGGAGC 
GCCAACGAGT 
GGTGAAAGTG 
ATCAGTCAAC 
ATTCTTGAAA 
AACTCTAGTC 



41 

I 

GGGTTTGGGC 
GCCCCGCGGG 
CGGCGGCGCT 
C AATG GATAA 
TTCCAAGTAC 
AGGTATTCAC 
CGAACGAGGA 
TGTATAACTT 
TGGCCTCCGT 
AGTACAGCCG 
GCTACCGCTG 
GGGCAGGTAA 
AGTCTTTGGA 
GCAGCCCCAT 
GCTTTGGGAA 



51 



GAACAAAAGG 
CGCGCGTCCT 
GACTTCCGCG 
CTTCTTCACC 
TGTAAATTCC 
TTACAAGCAG 
GGGCGTGGAT 
ATTCCAGCGG 
GAACCCCTAC 
GCGCCACCTG 
CCTGTGGAAG 
AACCGAAAGC 
ATTGTCCTTA 
CATGGAAGCT 
GTTTGTTCAG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
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CTGAACATCT GTCAGAAAGG AAATATTCAG 
AAAAACCGAG TAGTAAGGCA AAATCCCGGG 
CTGGCAGGGC TGGAACATGA AGAAAGAGAA 
CACTACTTGA ATCAGTCTGG ATGTGTAGAA 
5 AGGGAAGTTA TTACGGCAAT GGACGTGATG 
TCGAGGCTGC TTGCTGGTAT ACTGCATCTT 
GCACAGGTTT CCTTCAAAAC AGCTTTGGGC 
ACACAGCTCA CAGATGCTTT GACCCAGAGA 
ACGCCTCTCA ATGTTCAACA GGCAGTAGAC 

10 GCGTGCTGCT TTGAGTGGGT AATCAAGAAG 
TTCAAGTCTA TTGGCATCCT CGACATCTTT 
GAACAGTTCA ATATAAACTA TGCAAACGAG 
TTTTCTTTAG AACAACTAGA ATATAGCCGG 
ATAGACAATG GAGAATGCCT GGACTTGATT 

15 AATGAAGAAA GCCATTTTCC TCAAGCCACA 
CAGCATGCGA ATAACCACTT TTATGTGAAG 
AAGCACTATG CTGGAGAGGT GCAATATGAT 
ACATTTCGAG ATGACCTTCT CAATTTGCTA 
CTTTTTGAAC ATGTTTCAAG CCGCAACAAC 

20 CGGCGGCCTA CAGTCAGCTC ACAGTTCAAG 
AGCTCCTCTA ATCCTTTCTT TGTTCGCTGT 
CAGTTTGACC AGGCGGTTGT GCTGAACCAG 
AGAATCCGCA AAGCTGGGTA TGCGGTCCGA 
AAAGTGCTGA TGAGGAATCT GGCTCTGCCT 

25 CTGCAGCTCT ATGATGCCTC CAACAGCGAG 
CGAGAATCCT TGGAACAGAA ACTGGAGAAG 
ATGGTGATTC GGGCCCATGT CTTGGGCTTC 
TATTGTGTGG TGATAATACA GAAGAATTAC 
CACCTGAAAA AGGCAGCCAT AGTTTTCCAG 

30 GTTTACAGAC AATTGCTGGC AGAGAAAAGG 
GAAGAAAAGA AGAAACGGGA GGAAGAAGAA 
GAGCTCCGCG CCCAGCAGGA AGAAGAAACG 
AAGAGCCAGA AGGAAGCTGA ACTGACCCGT 
GTGGAAGAGA TCCTCCGTCT GGAGAAAGAA 

35 CAGGAGCTGT CGCTGACCGA GGCTTCCCTG 
CTCCGCAGGC TGGAGGAGGA AGCGTGCAGG 
TTCGACGAGA TCGACGAGTG TGTCCGGAAT 
TTTTCCAGCG AGCTGGCTGA GAGCGCATGC 
CCCTACCCAG AGGAGGAGGT CGATGAGGGC 

40 TCCCCCAACC CCAGCGAGCA CGGCCACTCA 
GATGACTCTT CAGAGGAGGA CCCATACATG 
GCGGACAGCA CGGTGCTGCT CGCCCCATCA 
TCCAGCGGCG AGTCCACCTA CTGCATGCCC 
GGCGACTACG ACTACGACCA GGATGACTAT 

45 GTGACCTTCT CCAACTCCTA CGGCAGCCAG 
ACCTACAACA GCTCGGGTGC CTACCGGTTC 
GATAGTGAAG AGGACTTTGA TTCCAGGTTT 
GACTCTGTGT ACAGCTGTGT CACTCTGCCG 
GGCCTGATGA ACTCTTGGAA ACGCCGCTGG 

50 TTCCGCTCCA AGCAGGAGGC CCTCAAGCAA 
TCCACGCTGT CC AG GAG AAA TTGGAAGAAG 
ATGTACTTTG AAAACGACAG CGAGGAGAAG 
AAAGAGATCA TAGATAACAC CACCAAGGAG 
ACTTTCCACC TGATTGCAGA GTCCCCAGAA 

55 CAGGTCCACG CGTCCACGGA CCAGGAGATC 
CAGAATGCTG TGGGCACCTT GGATGTGGGG 
CCTGATAGAC CCAACTCGTT TGTGATCATC 
GACACGCCGG AGGAGATGCA CCACTGGATA 
AGAGTGGAGG GCCAGGAATT CATCGTGAGA 

60 CCGAAGATGT CTTCACTGAA ACTGAAGAAA 
GATTACTACA AGAGTTCAGA GAAGAACGCG 
CTCTGCTCTG TCGTCCCCCC AGATGAGAAG 
ACCGTGTACG GGCGCAAGCA CTGTTACCGG 
. CGGTGGTCCA GTGCCATTCA AAACGTGACT 

65 CAGCAGCTGA TTCAAGATAT CAAGGAGAAC 
TACAAGCGGA ACCCGATCCT TCGATACACC 
CTTCCGTATG GGGACATAAA TCTCAACTTG 
GATGAGGCCA TCAAGATATT CAATTCCCTG 
CCAATAATCC AGGGCATCCT ACAGACAGGG 

70 TACTGCCAGC TTATCAAACA GACCAACAAA 
TACAGCTGGC AGATCCTGAC ATGCCTGAGC 
AAGTATCTCA AGTTCCATCT GAAAAGGATA 
AAATACGCTC TCTTGACTTA CGAATCTCTT 
TCCCGAGATG AAATAGAAGC TCTGATCCAC 

75 CATGGCGGCG GCTCCTGCAA GATCACCATC 
GAGAAGCTGA TCCGAGGCCT GGCCATGGAG 



GGCGGGAGAA TTGTAGATTA TTTATTAGAA 960 

GAAAGGAATT ATCACATATT TTATGCACTG 1020 

GAATTTTATT TATCTACGCC AGAAAACTAC 1080 

GACAAGACAA TCAGTGACCA GGAATCCTTT 1140 

CAGTTCAGCA AGGAGGAAGT TCGGGAAGTG 1200 

GGGAACATAG AATTTATCAC TGCTGGTGGG 1260 

AGATCTGCGG AGTTACTTGG GCTGGACCCA 1320 

TCAATGTTCC TCAGGGGAGA AGAGATCCTC 1380 

AGCAGGGACT CCCTGGCCAT GGCTCTGTAT 1440 

ATCAACAGCA GGATCAAAGG CAATGAGGAC 1500 

GGATTTGAAA ACTTTGAGGT TAATCACTTT 1560 

AAACTTCAGG AGTACTTCAA CAAGCATATT 1620 

GAAGGATTAG TGTGGGAAGA TATTGACTGG 1680 

GAGAAGAAAC TTGGCCTCCT AGCCCTTATC 1740 

GACAGCACCT TATTGGAGAA GCTACACAGT 1800 

CCCAGAGTTG CAGTTAACAA TTTTGGAGTG 1860 

GTCCGAGGTA TCTTGGAGAA GAACAGAGAT 1920 

AGAGAAAGCC GATTTGACTT TATCTACGAT 1980 

CAGGATACCT TGAAATGTGG AAGCAAACAT 2040 

GACTCACTGC ATTCCTTAAT GGCAACGCTA 2100 

ATCAAGCCAA ACATGCAGAA GATGCCAGAC 2160 

CTGCGGTACT CAGGGATGCT GGAGACTGTG 2220 

AGACCCTTTC AGGACTTTTA CAAAAGGTAT 2280 

GAGGACGTCC GAGGGAAGTG CACGAGCCTG 2340 

TGGCAGCTGG GGAAGACCAA GGTCTTTCTT 2400 

CGGAGGGAAG AGGAAGTGAG CCACGCGGCC 2460 

TTAGCACGAA AACAATACAG AAAGGTCCTT 2520 

AGAGCATTCC TTCTGAGGAG GAGATTTTTG 2580 

AAGCAACTCA GAGGTCAGAT TGCTCGGAGA 2640 

GAGCAAGAAG AAAAGAAGAA ACAGGAAGAG 2700 

AGAGAAAGAG AGAGAGAGCG AAGAGAAGCC 2760 

AGGAAGCAGC AAGAACTCGA AGCCTTGCAG 2820 

GAACTGGAGA AACAGAAGGA AAATAAGCAG 2880 

ATCGAGGACC TGCAGCGCAT GAAGGAGCAG 2940 

CAGAAGCTGC AGGAGCGGCG GGACCAGGAG 3000 

GCGGCCCAGG AGTTCCTCGA GTCCCTCAAT 3060 

ATCGAGCGGT CCCTGTCGGT GGGAAGCGAA 3120 

GAGGAGAAGC CCAACTTCAA CTTCAGCCAG 3180 

TTCGAAGCCG ACGACGACGC CTTCAAGGAC 324 0 

GACCAGCGAA CAAGTGGCAT CCGGACCAGC 3300 

AACGACACGG TGGTGCCCAC CAGCCCCAGT 3360 

GTGCAGGACT CCGGGAGCCT ACACAACTCC 3420 

CAGAACGCTG GGGACTTGCC CTCCCCAGAC 34 80 

GAGGACGGTG CCATCACTTC CGGCAGCAGC 354 0 

TGGTCCCCCG ACTACCGCTG CTCTGTGGGG 3600 

AGCTCTGAGG GGGCGCAGTC CTCGTTTGAA 3660 

GATACAGATG ATGAGCTTTC ATACCGGCGT 3720 

TATTTCCACA GCTTTCTGTA - CATGAAAGGT 3780 

TGCGTCCTCA AGGATGAAAC CTTCTTGTGG 3840 

GGCTGGCTCC ACAAAAAAGG GGGGGGCTCC 3900 

CGCTGGTTTG TCCTCCGCCA GTCCAAGCTG 3960 

CTCAAGGGCA CCGTAGAAGT GCGAACGGCA 4020 

AATGGGATCG ACATCATTAT GGCCGATAGG 4 080 

GATGCCAGCC AGTGGTTCAG CGTGCTGAGT 414 0 

CAGGAGATGC ATGATGAGCA GGCAAACCCA 42 00 

CTGATTGATT CTGTGTGTGC CTCTGACAGC 4260 

ACGGCCAACC GGGTGCTGCA CTGCAACGCC 4320 

ACCCTGCTGC AGAGGTCCAA AGGGGACACC 43 80 

GGATGGTTGC ACAAAGAGGT GAAGAACAGT 4440 

CGGTGGTTTG TACTCACCCA CAATTCCCTG 4500 

CTCAAACTGG GGACCCTGGT CCTCAACAGC 4560 

ATATTCAAAG AGACAGGCTA CTGGAACGTC 4620 

CTCTACACCA AGCTGCTCAA CGAGGCCACC 4680 

GACACCAAGG CCCCGATCGA CACCCCCACC 474 0 

TGCCTGAACT CGGATGTGGT GGAACAGATT 4 800 

CATCACCCCT TGCACTCCCC GCTCCTGCCC 4860 

CTCAAAGACA AAGGCTATAC CACCCTTCAG 4 920 

CAGCAACTGG AGTCCATGTC TGACCCAATT 4 980 

CATGACCTGC GACCTCTGCG GGACGAGCTG 5040 

GTGCCCCACC CCGGCAGTGT GGGCAACCTG 5100 

TGCACCTTCC TGCCGAGTCG AGGGATTCTC 516 0 

CGGGAACAGT TTCCAGGAAC CGAGATGGAA 5220 

AAGAAAACCA AATGCCGAGA GTTTGTGCCT 5280 

AGGCAGGAAA TGACATCCAC GGTCTATTGC 5340 

AACTCCCACA CCACTGCTGG GGAGGTGGTG 54 00 

GACAGCAGGA ACATGTTTGC TTTGTTTGAA 5460 
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TACAACGGCC ACGTCGACAA AGCCATTGAA AGTCGAACCG TCGTAGCTGA TGTCTTAGCC 5520 

AAGTTTGAAA AGCTGGCTGC CACATCCGAG GTTGGGGACC TGCCATGGAA ATTCTACTTC 5580 

AAACTTTACT GCTTCCTGGA CACAGACAAC GTGCCAAAAG ACAGTGTGGA GTTTGCATTT 5640 

ATGTTTGAAC AGGCCCACGA AGCGGTTATC CATGGCCACC ATCCAGCCCC GGAAGAAAAC 5700 

5 CTCCAGGTTC TTGCTGCCCT GCGACTCCAG TATCTGCAGG GGGATTATAC TCTGCACGCT 5760 

GCCATCCCAC CTCTCGAAGA GGTTTATTCC CTGCAGAGAC TCAAGGCCCG CATCAGCCAG 5820 

TCAACCAAAA CCTTCACCCC TTGTGAACGG CTGGAGAAGA GGCGGACGAG CTTCCTAGAG 5880 

GGGACCCTGA GGCGGAGCTT CCGGACAGGA TCCGTGGTCC GGCAGAAGGT CGAGGAGGAG 5940 

CAGATGCTGG ACATGTGGAT TAAGGAAGAA GTCTCCTCTG CTCGAGCCAG TATCATTGAC 6000 

10 AAGTGGAGGA AATTTCAGGG AATGAACCAG GAACAGGCCA TGGCCAAGTA CATGGCCTTG 6060 

ATCAAGGAGT GGCCTGGCTA TGGCTCGACG CTGTTTGATG TGGAGTGCAA GGAAGGTGGC 6120 

TTCCCTCAGG AACTCTGGTT GGGTGTCAGC GCGGACGCCG TCTCCGTCTA CAAGCGTGGA 6180 

GAGGGAAGAC CACTGGAAGT CTTCCAGTAT GAACACATCC TCTCTTTTGG GGCACCCCTG 6240 

GCGAATACGT ATAAGATCGT GGTCGATGAG AGGGAGCTGC TCTTTGAAAC CAGTGAGGTG 6300 

15 GTGGATGTGG CCAAGCTCAT GAAAGCCTAC ATCAGCATGA TCGTGAAGAA GCGCTACAGC 6360 

ACGACACGCT CCGCCAGCAG CCAGGGCAGC TCCAGGTGAA GGCGGGACAG AGCCCACCTG 6420 

TCTTTGCTAC CTGAACGCAC CACCCTCTGG CCTAGGCTGG CTCCAGTGTG CCATGCCCAG 6480 

CCAAAACAAA CACAGAGCTG CCCAGGCTTT CTGGAAGCTT CTGGTCTGAG GGAGGTGTCT 6540 

CCGAGGATCC TTTTGCCTGC CGCCTTCATT GATCCTGTAT TAAGCTGTCA ACTTTAACAG 6600 

20 TCTGCACAGT TTCCAAAGCT TTACTACTCT TAGAGGACAC ATGCCTTAAA AAAGGAGGGG 6660 

AGGAACCACG CTGCCACCAA AGCAGCCGGA AGTGCCTTAA CTTGTGGAAC CAACACTAAT 6720 

CGACCGTAAC TGTGCTACTG AAGGGAACTG CCTTTCCCCC TTCTGGGGGA GACTTAACAG 6780 

AGCGTGGAAG GGGGGCATTC TCTGTCAATG ATGCACTAAC CTCCCAACCT GATTTCCCCG 6840 

AATCTGAGGG AAGGTGAGGG AGTGGGAAGG GGGATGGAGA GCTCGAGGGG ACAGTGTGTT 6900 

25 TGAGCTGGAG TGCTGCGGGC AGCCTTTCTC ATGGAATGAC ATGAATCAAC TTTTTTCTTT 6960 

GTTTCATCTT TTAAGTGTAC GTGCTTGCCT GTTCGTGCAT GTGTTCATAA ACTCAACACT 7020 

TTAATCATGG TTTCATGAGC ATTAAAAAGC AAAGGGAAAA AGGATGTGTA ATGGTGTACA 7080 

CAGTCTGTAT ATTTTAATAA TGCAGAGCTA TAGTCTCAAT TGTTACTTTA TAAGGTGGTT 7140 

TTATTAACAA ACCCAAATCC TGGATTTTCC TGTCTTTGCT GTATTTTGAA AAACACGTGT 7200 

30 TGACTCCATT GTTTTACATG TAGCAAAGTC TGCCATCTGT GTCTGCTGTA TTATAAACAG 7260 

ATAAGCAGCC TACAAGATAA CTGTATTTAT AAACCACTCT TCAACAGCTG GCTCCAGTGC 7320 

TGGTTTTAGA ACAAGAATGA AGTCATTTTG GAGTCTTTCA TGTCTAAAAG ATTTAAGTTA 7380 

AAAACAAAGT GTTACTTGGA AGGTTAGCTT CTATCATTCT GGATAGATTA CAGATATAAT 744 0 

AACCATGTTG ACTATGGGGG AGAGACGCTG CATTCCAGAA ACGTCTTAAC ACTTGAGTGA 7500 

35 ATCTTCAAAG GACCCTGACA TTAAATGCTG AGGCTTTAAT ACACACATAT TTTATCCCAA 7560 

GTTTATAATG GTGGTCTGAA CAAGGCACCT GTAAATAAAT CAGCATTTAT GACCAGAAGA 7620 

AAAATAATCT GGTCTTGGAC TTTTTATTTT TATATGGAAA AGTTTTAAGG ACTTGGGCCA 7680 

ACTAAGTCTA CCCACACGAA AAAAGAAATT TGCCTTGTCC CTTTGTGTAC AACCATGCAA 7740 
AACTGTTTGT TGGCTCACAG AAGTTCTGAC AATAAAAGAT ACTAGCT 

40 

Seq ID NO: 207 Protein sequence: 
Protein Accession #: NP_036466 

1 11 21 31 41 51 

45 | I I I I I 

MDNFFTEGTR VWLRENGQHF PSTVNSCAEG IWFRTDYGQ VFTYKQSTIT HQKVTAMHPT 60 

NEEGVDDMAS LTELHGGSIM YNLFQRYKRN QIYTYIGSIL ASVNPYQPIA GLYEPATMEQ 120 

YSRRHLGELP PHIFAIANEC YRCLWKRYDN QCILISGESG AGKTESTKLI LKFLSVISQQ 180 

SLELSLKEKT SCVERAILES SPIMBAFGNA KTVYNNNSSR FGKFVQLNIC QKGNIQGGRI 24 0 

50 VDYLLEKNRV VRQNPGERNY HI FYALLAGL EHEEREEFYL STPENYHYLN QSGCVEDKTI 300 

SDQESFREVI TAMDVMQFSK EEVREVSRLL AGILHLGNIE FITAGGAQVS FKTALGRSAE 360 

LLGLDPTQLT DALTQRSMFL RGEEILTPIjN VQQAVDSRDS LAMALYACCF EWVIKKINSR 4 20 

IKGNEDFKSI GILDIFGFEN FEVNHFEQFN INYANEKLQE YFNKHIFSLE QLEYSREGLV 4 80 

WEDIDWIDNG ECLDLIEKKL GLLALINEES HFPQATDSTL IjEKLHSQHAN NHFYVKPRVA 54 0 

55 .VNNFGVKHYA GEVQYDVRGI LEKNRDTFRD DLLNLLRESR FDFIYDLFEH VSSRNNQDTL 600 

KCGSKHRRPT VSSQFKDSLH SLMATLSSSN PFFVRCIKPN MQKMPDQFDQ AWLNQLRYS 660 

GMLETVRIRK AGYAVRRPFQ DFYKRYKVLM RNIiALPEDVR GKCTSLLQLY DASNSEWQLG 720 

KTKVFLRESL EQKLEKRREE EVSHAAMVIR AHVLGFLARK QYRKVLYCW IIQKNYRAFL 780 

LRRRFLHLKK AAIVFQKQLR GQIARRVYRQ LLAEKREQEE KKKQEEEEKK KREEEERERE 840 

60 RERREAELRA QQEEETRKQQ ELEALQKSQK EAELTRELEK QKENKQVEEI LRLEKEIEDL 900 

QRMKEQQELS LTEASLQKLQ ERRDQELRRL EEEACRAAQE FLESLNFDEI DECVRNIERS 960 

LSVGSEFSSE LAESACEEKP NFNFSQPYPE EEVDEGFEAD DDAFKDSPNP SEHGHSDQRT 1020 

SGIRTSDDSS EEDPYMNDTV VPTSPSADST VLLAPSVQDS GSLHNSSSGE STYCMPQNAG 1080 

DLPSPDGDYD YDQDDYEDGA ITSGSSVTFS NSYGSQWSPD YRCSVGTYNS SGAYRFSSEG 1140 

65 AQSSFEDSEE DFDSRFDTDD ELSYRRDSVY SCVTLPYFHS FLYMKGGLMN SWKRRWCVLK 1200 

DETFLWFRSK QEALKQGWLH KKGGGSSTLS RRNWKKRWFV LRQSKLMYFE NDSEEKLKGT 1260 

VEVRTAKEII DNTTKENGID IIMADRTFHL IAESPEDASQ WFSVLSQVHA STDQEIQEMH 1320 

DEQANPQNAV GTLDVGLIDS VCASDSPDRP NSFVIITANR VLHCNADTPE EMHHWITLLQ 1380 

RSKGDTRVEG QEFIVRGWLH KEVKNSPKMS SLKLKKRWFV LTHNSLDYYK SSEKNAL.KLG 1440 

70 TLVLNSLCSV VPPDEKIFKE TGYWNVTVYG RKHCYRLYTK LLNEATRWSS AIQNVTDTKA 1500 

PIDTPTQQLI QDIKENCLNS DWEQIYKRN PILRYTHHPL HSPLLPLPYG DINIjNLLKDK 1560 

GYTTLQDEAI KIFNSLQQLE SMSDPIPIIQ GILQTGHDLR PLRDELYCQL IKQTNFCVPHP 1620 

GSVGNLYSWQ ILTCLSCTFL PSRGILKYLK FHLKRIREQF PGTEMEKYAL FTYESLKKTK 1680 

CREFVPSRDE IEALIHRQEM TSTVYCHGGG SCKITINSHT TAGEWEKLI RGLAMEDSRN 1740 

75 MFALFEYNGH VDKAIESRTV VADVLAKFEK LAATSEVGDL PWKFYFKLYC FLDTDNVPKD 1800 

SVEFAFMFEQ AHEAVIHGHH PAPEENLQVL AALRLQYLQG DYTLHAAIPP LEEVYSLQRL 1860 
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KARISQSTKT FTPCERLEKR RTSFliEGTLR RSFRTGSWR QKVEEEQMLD MWIKEEVSSA 1920 

RASIIDKWRK FQGMNQEQAM AKYMALIKEW PGYGSTLFDV ECKEGGFPQE LWLGVSADAV 198 0 

SVYKRGEGRP LEVFQYEHIIi SFGA PLANT Y KIWDERELL FETSEWDVA KLMKAYISMI 2040 
VKKRYSTTRS ASSQGSSR 



Seg ID NO: 208 DNA sequence 

Nucleic Acid Accession #: XMJ359761.1 

Coding sequence: 124-525 (underlined sequences correspond to start and stop codons) 



1 11 21 31 41 51 

1 I I I * I I 

CGAAGATCTA TCCAAAATCA AGAAGCCTTT GATTTAGATG TTGCTGTAAA AGAAAATAAA 60 

GATGATCTCA ATCATGTGGA TTTGAATGTG TGTACAAGCT TTTCGGGCCC GGGTAGGAGT 120 

GGC ATGG CTC TTATGGAAGT TAACCTATTA AGTGGCTTTA TGGTGCCTTC AGAAGCAATT 180 

TCTCTGAGCG AGACAGTGAA GAAAGTGGAA TATGATCATG GAAAACTCAA CCTCTATTTA 240 

GATTCTGTAA ATGAAACCCA GTTTTGTGTT AATATTCCTG CTGTGAGAAA CTTTAAAGTT 300 

TCAAATACCC AAGATGCTTC AGTGTCCATA GTGGATTACT ATGAGCCAAG GAGACAGGCG 360 

GTGAGAAGTT ACAACTCTGA AGTGAAGCTG TCCTCCTGTG ACCTTTGCAG TGATGTCCAG 420 

GGCTGCCGTC CTTGTGAGGA TGGAGCTTCA GGCTCCCATC ATCACTCTTC AGTCATTTTT 4 80 

ATTTTCTGTT TCAAGCTTCT GTACTTTATG GAACTTTGGC TGTGATTTAT TTTTAAAGGA 540 

CTCTGTGTAA CACTAACATT TCCAGTAGTC ACATGTGATT GTTTTGTTTT CGTAGAAGAA 600 

TACTGCTTCT ATTTTGAAAA AAGAGTTTTT TTTCTTTCTA TGGGGTTGCA GGGATGGTGT 660 

ACAACAGGTC CTAGCATGTA TAGCTGCATA GATTTCTTCA CCTGATCTTT GTGTGGAAGA 720 

TCAGAATGAA TGCAGTTGTG TGTCTATATT TTCCCCTCTC AAAATCTTTT AGAATTTTTT 780 
TGGAGGTGTT TGTTTTCTCC AGAATAAAGG TATTACTTTA G 



Seq ID NO: 209 Protein sequence: 
Protein Accession #: XP 059761.1 



1 11 21 31 41 51 

I I I I I I 

MALMEVNLLS GFMVPSEAIS LSETVKKVEY DHGKLNLYLD SVNETQFCVN IPAVRNFKVS 60 
NTQDASVSIV DYYEPRRQAV RSYNSEVKLS SCDLCSDVQG CRPCEDGASG SHHHSSVIFI 120 
FCFKLLYFME LWL 



Seq ID NO: 210 DNA sequence 
Nucleic Acid Accession #: NM_015472 

Coding sequence: 258-1460 (underlined sequences correspond to start and stop codons) 



I 11 21 31 41 51 

II I 111 

GACACACTCC TCTACAACAC CAGAGACTCC CAAACACAAG GCCTTATATT GACTCATTTC 60 

AGCTCACATC CTGGCGACTC TCAAGAGAGA AACCTCAGAG TGACTAAAAT CTCCATAATG 120 

AGAAGACATG TACATTCAGT ATCTATTTTG GCATTTTCCC CAATACATCT CTGCTCATCT 180 

GACTCTTATC TTGGCATCTG CTTCCTGGTG GATCTGAACT GACCCATAAG CCACGCTTAC 24 0 

TGGTGATTTT CCAGAAGATG AATCCGGCCT CGGCGCCCCC TCCGCTCCCG CCGCCTGGGC 300 

AGCAAGTGAT CCACGTCACG CAGGACCTAG ACACAGACCT CGAAGCCCTC TTCAACTCTG 3 60 

TCATGAATCC GAAGCCTAGC TCGTGGCGGA AGAAGATCCT GCCGGAGTCT TTCTTTAAGG 42 0 

AGCCTGATTC GGGCTCGCAC TCGCGCCAGT CCAGCACCGA CTCGTCGGGC GGCCACCCGG 4 80 

GGCCTCGACT GGCTGGGGGT GCCCAGCATG TCCGCTCGCA CTCGTCGCCC GCGTCCCTGC 54 0 

AGCTGGGCAC CGGCGCGGGT GCTGCGGGTA GCCCCGCGCA GCAGCACGCG CACCTCCGCC 600 

AGCAGTCCTA CGACGTGACC GACGAGCTGC CACTGCCCCC GGGCTGGGAG ATGACCTTCA 660 

CGGCCACTGG CCAGAGGTAC TTCCTCAATC ACATAGAAAA AATCACCACA TGGCAAGACC 720 

CTAGGAAGGC GATGAATCAG CCTCTGAATC ATATGAACCT CCACCCTGCC GTCAGTTCCA 780 

CACCAGTGCC TCAGAGGTCC ATGGCAGTAT CCCAGCCAAA TCTCGTGATG AATCACCAAC 840 

ACCAGCAGCA GATGGCCCCC AGTACCCTGA GCCAGCAGAA CCACCCCACT CAGAACCCAC 900 

CCGCAGGGCT CATGAGTATG CCCAATGCGC TGACCACTCA GCAGCAGCAG CAGCAGAAAC 960 

TGCGGCTTCA GAGAATCCAG ATGGAGAGAG AAAGGATTCG AATGCGCCAA GAGGAGCTCA 1020 

TGAGGCAGGA AGCTGCCCTC TGTCGACAGC TCCCCATGGA AGCTGAGACT CTTGCCCCAG 1080 

TTCAGGCTGC TGTCAACCCA CCCACGATGA CCCCAGACAT GAGATCCATC ACTAATAATA 114 0 

GCTCAGATCC TTTCCTCAAT GGAGGGCCAT ATCATTCGAG GGAGCAGAGC ACTGACAGTG 1200 

GCCTGGGGTT AGGGTGCTAC AGTGTCCCCA CAACTCCGGA GGACTTCCTC AGCAATGTGG 1260 

ATGAGATGGA TACAGGAGAA AACGCAGGAC AAACACCCAT GAACATCAAT CCCCAACAGA 1320 

CCCGTTTCCC TGATTTCCTT GACTGTCTTC CAGGAACAAA CGTTGACTTA GGAACTTTGG 138 0 

AATCTGAAGA CCTGATCCCC CTCTTCAATG ATGTAGAGTC TGCTCTGAAC AAAAGTGAGC 144 0 

CCTTTCTAAC CTGGCTGTAA TCACTACCAT TGTAACTTGG ATGTAGCCAT GACCTTACAT 1500 

TTCCTGGGCC TCTTGGAAAA AGTGATGGAG CAGAGCAAGT CTGCAGGTGC ACCACTTCCC 1560 

GCCTCCATGA CTCGTGCTCC CTCCTTTTTA TGTTGCCAGT TTAATCATTG CCTGGTTTTG 1620 
ATTGAGAGTA ACTTAAGTTA AACATAAATA AATATTCTAT TTTCATTTTC 

■ 

Seq ID NO: 211 Protein sequence: 
Protein Accession #: NP_056287.1 

1 11 21 31 41 51 
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MNPASAPPPL 
HSRQSSTDSS 
TDELPIiPPGW 
SMAVSQPNLV 
QMERERIRMR 
NGGPYHSREQ 
LDCLPGTNVD 



PPPGQQVIHV 
GGHPGPRLAG 
EMTFTATGQR 
MNHQHQQQMA 
QEELMRQEAA 
STDSGLGLGC 
LGTLESEDLI 



TQDLDTDLEA 
GAQHVRSHSS 
.YFLNHIEKIT 
PSTLSQQNHP 
LCRQLPMEAE 
YSVPTTPEDF 
PLFNDVESAL 



LFNSVMNPKP 
PASLQLGTGA 
TWQDPRKAMN 
TQNPPAGLMS 
TLAPVQAAVN 
LSNVDEMDTG 
NKSEPFLTWL 



SSWRKKILPE 
GAAGSPAQQH 
QPLNHMNLHP 
MPNALTTQQQ 
PPTMTPDMRS 
ENAGQTPMNI 



SFFKEPDSGS 60 

AHLRQQSYDV 120 

AVSSTPVP'QR 180 

QQQKLRLQRI 240 

ITNNSSDPFL 300 

NPQQTRFPDF 360 



Seq ID NO: 212 DNA sequence 
10 Nucleic Acid Accession #: NM_018174 

Coding sequence: 176-2194 (underlined sequences correspond to start and stop codons) 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



CATCTCCCCC 
GCGCGGCGAG 
TCTGCCACTC 
CGTGGGCCGG 
GGCCTCTGTG 
CGTGCTGTTC 
GCACTTGAGG 
AGCCGAGAGC 
CACCCACCCT 
TGAGGCCCCA 
CAAACCGAGT 
CAACCTCAAG 
CCACTCTGGC 
AGAAGCCAGC 
CAGCCTGGAG 
CGCCAGCTCA 
GGGCAGCGAG 
ACCCACAGTG 
GCACTCGACC 
CGCCCCCACC 
GGCTTCCCCA 
GGCGGTGCCA 
GTCACAGGAA 
GTCCCTGCCC 
AGACGAAGAC 
GGTCCCCCCA 
CCCCAAGACA 
CAACTCACGC 
TGCTGGTGGG 
AGGCCGGGCA 
GGGGTCAGCC 
GGACCTGGCC 
GCGCGTGCGC 
GCGGGCCGTC 
GACCCTGATC 
CCGGCACCAG 
TGACGCCTTC 
AGCCCAGCCC 



AACCTGGGGG 
GATGAGGCGG 
AGCCGCGGCC 
CTGGACATGT 
TGCGCCCTGC 
CCCGGTTGCA 
TTCCTGCGAG 
AAAGAGAGCG 
AGACCTGGCC 
CGCAAGACTG 
GTCTCCCGGA 
AAGACGAATG 
TTCCCGCCGG 
CCCCCCAGTG 
CTGGGGCCGA 
ATCCCAAGGC 
CGGCTGTCGC 
ACCACACCCA 
GAGGTGGACG 
AGTGAGGCTG 
CACGATGTGG 
ATGGCACCGG 
CGGGCAGGTG 
ACCCTGTCTG 
ACAGAGGGCT 
CCACTGCCTG 
GCACGGCAAA 
GCTGCCGCCC 
GACCGTGCCA 
CCCCTGTCCA 
AGCAGCCGGC 
TACCTGCCCA 
GCGCTCTGCT 
CTGGACGCGC 
CCCACTTTCG 
GCGCTGGGCA 
CCGGCCTGCA 
GCCTGTCCCT 



TCGTGTTCTT 
AGCTGGCGCT 
CCGTGCCAGC 
ATGTGCTGCA 
TGGTGTGGCA 
CCCCGCCCGC 
AGCCCGTGGT 
TGGGCTCCCG 
AGGAGCGCCC 
AGAAAGAAGC 
CCCAGCCGCG 
CCCAGGCGGC 
TGGCAAATGG 
CAGCCTGCGG 
TCCCAGCCGG 
CACGCACACC 
TGAGCCCACT 
CGGTGACCAC 
AGTCCCTGTC 
GG CTGAGCCT 
ACCTGTGCCT 
CACCTGCGTC 
GGCTGGGGGC 
ACTCGGATCC 
TTGGAGTCCC 
ACCCATCCAG 
CGGAGAACGT 
CCAAAGCCAC 
GCCGACCACT 
GAAAGTCCTC 
CCGGGGTGTC 
GCGGGAGCAG 
ACGTCATCAG 
TACTGGCCAG 
ACTCGGTGGC 
TCACGGTGTT 
AGGTGGAGTT 
AGATTCAGCC 



CAACGCCTGC 
GAGCCTCCTG 
CAAACCCACC 
CCCGCCCTCC 
CCCCGCCGGC 
CTGCCTCCTG 
GACGCCCCAG 
GGACAGCTCG 
TGGGGTGGCC 
CAAGACCCCC 
GGAGGTGCGC 
ACCCAAGCCC 
ACCCCGCAGC 
CTCTCCGGCC 
GGAGGAGAAG 
CTCCCCTGAG 
GCGGGGCGGG 
GCCCTCACTA 
GGTGTCCTTT 
CCCGCTGCGT 
GGTGTCACCC 
CCCCGGCAGC 
CGAGGAGACG 
CGTGCCCCTG 
TCGCCACGAC 
CATCTGCATG 
CAGCCGCACC 
TCCAGTGGCT 
CAGTGCCCGG 
AACCCCCAAG 
AGCCACCCCA 
CGCCCACCTG 
TGGCCAGGAC 
CAAGCAGCAT 
CATGCATACG 
GGGCAGCAAC 
CTAGCCCCAT 
ACATCAGAAA 



GAGGCCGCGT 
GCGCAGCTGG 
GTGCTCTTCG 
GCCGGCGCCG 
CCCGGCGAGA 
GACGGCCTGG 
GACCTGGAGG 
AAGAGAGAGG 
CGCAAGGAGC 
CGGGAGTTGA 
CGGGCAGCCT 
CGCAAAGCGC 
CCGCCCAGCC 
TCCCAGCTGG 
GCACTGGAGC 
TCCCACCGGA 
GAGGCCGGGC 
CCCGCAGAGG 
GAGCAGGTGC 
GGCCCCCGGG 
TGTGAATTTG 
TCGAATGACA 
CCACCCACAT 
GCCCCCGGTG 
CCTTTGCCTG 
GTGGACCCCG 
CGGAAGCCCC 
GCTGCCAAAA 
AGTGAGCCCA 
ACTGCCACTC 
CCCAAGTCCC 
GTGGATGAGG 
CAGCGCAAGG 
TGGGACCGTG 
TGGTACGCAG 
GGCATGGTGT 
CGCCGACACG 
TAAACTGTGA 



CGCGGCTGGC 
GCATCACGCC 
AGAAGATGGG 
AGCGCACGCT 
AGGTGGTGCG 
TCCGCCTGCA 
GGCCGGGGCG 
GCCTCCTGGC 
CAGCACGGGC 
AGAAAGACCC 
CTTCTGTGCC 
CCAGCACGTC 
TCCGATGTGG 
TGGCCACGCC 
TGCCTTTGGC 
GCCCCGCAGA 
CAGACGCCTC 
TGGGCTCCCC 
TGCCGCCATC 
CGCGGCGCTC 
AGCATCGCAA 
GCAGTGCCCG 
CGGTCAGCGA 
CGGCAGACTC 
ACCCCCTCAA 
AGATGCTGCC 
TGGCCCGCCC 
CCAAGGGGCT 
GTGAGAAGGG 
GAGGCCCGTC 
CGGTCTACCT 
AGTTCTTCCA 
AGGAAGGCAT 
ACCTGCAGGT 
AGACGCACGC 
CCATGCAGGA 
CCCCCCACTC 
CTACACTTG 



Seq ID NO: 213 Protein sequence: 
Protein Accession #: NP 060644.1 



MGVGRLDMYV 
LQHLRFLREP 
RAEAPRKTEK 
TSHSGFPPVA 
LAASSIPRPR 
SPHSTEVDES 
RKAVPMAPAP 
DSDEDTEGFG 
RPNSRAAAPK 
PSGSASSRPG 
GMRAVLDALL 
QDDAFPACKV 



LHPPSAGAER 
WTPQDLEGP 
EAKTPRELKK 
NGPRSPPSLR 
TPSPESHRSP 
LSVSFEQVLP 
ASPGSSNDSS 
VPRHDPLPDP 
ATPVAAAKTK 
VSATPPKSPV 
ASKQHWDRDL 
EF 



TLASVCALIiV 

GRAESKESVG 

DPKPSVSRTQ 

CGEASPPSAA 

AEGSERliSLS 

PSAPTSEAGL 

ARSQERAGGL 

LKVPPPLPDP 

GLAGGDRASR^ 

YLDLAYLPSG 

QVTLIPTFDS 



WHPAGPGEKV 
SRDSSKREGL 
PREVRRAASS 
CGSPASQLVA 
PLRGGEAGPD 
SLPLRGPRAR 
GAEETPPTSV 
SSICMVDPEM 
PLSARSEPSE 
SSAHLVDEEF 
VAMHTWYAET 



VRVLFPGCTP 
LATHPRPGQE 
VPNLKKTNAQ 
TPSLELGPIP 
ASPTVTTPTV 
RSASPHDVDL 
SESLPTLSDS 
LPPKTARQTE 
KGGRAPLSRK 
FQRVRALCYV 
HARHQALGIT 



PACLLDGLVR 
RPGVARKEPA 
AAPKPRKAPS 
AGEEKAIiELP 
TTPSLPAEVG 
CLVSPCEFEH 
DPVPLAPGAA 
NVSRTRKPLA 
SSTPKTATRG 
ISGQDQRKEE 
VLGSNGMVSM 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 



Seq ID NO: 214 DNA sequence 

Nucleic Acid Accession #: NM_002019.1 

Coding sequence: 250-4266 (underlined sequences correspond to start and stop codons) 



75 



1 11 21 31 41 51 

I I I I I I 

GCGGACACTC CTCTCGGCTC CTCCCCGGCA GCGGCGGCGG CTCGGAGCGG GCTCCGGGGC 

TCGGGTGCAG CGGCC AG CGG GCCTGGCGGC GAGGATTACC CGGGGAAGTG GTTGTCTCCT 



60 
120 
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GGCTGGAGCC GCGAGACGGG CGCTCAGGGC GCGGGGCCGG CGGCGGCGAA CGAGAGGACG 180 

GACTCTGGCG GCCGGGTCGT TGGCCGGGGG AGCGCGGGCA CCGGGCGAGC AGGCCGCGTC 24 0 

GCGCTCACC A TGG TCAGCTA CTGGGACACC GGGGTCCTGC TGTGCGCGCT GCTCAGCTGT 300 

CTGCTTCTCA CAGGATCTAG TTCAGGTTCA AAATTAAAAG ATCCTGAACT GAGTTTAAAA 360 

GGCACCCAGC ACATCATGCA AGCAGGCCAG ACACTGCATC TCCAATGCAG GGGGGAAGCA 420 

GCCCATAAAT GGTCTTTGCC TGAAATGGTG AGTAAGGAAA GCGAAAGGCT GAGCATAACT 4 80 

AAATCTGCCT GTGGAAGAAA TGGCAAACAA TTCTGCAGTA CTTTAACCTT GAACACAGCT 540 

CAAGCAAACC ACACTGGCTT CTACAGCTGC AAATATCTAG CTGTACCTAC TTCAAAGAAG 600 

AAGGAAACAG AATCTGCAAT CTATATATTT ATTAGTGATA CAGGTAGACC TTTCGTAGAG 660 

ATGTACAGTG AAATCCCCGA AATTATACAC ATGACTGAAG GAAGGGAGCT CGTCATTCCC 720 

TGCCGGGTTA CGTCACCTAA CATCACTGTT ACTTTAAAAA AGTTTCCACT TGACACTTTG 780 

ATCCCTGATG GAAAACGCAT AATCTGGGAC AGTAGAAAGG GCTTCATCAT ATCAAATGCA 840 

ACGTACAAAG AAATAGGGCT TCTGACCTGT GAAGCAACAG TCAATGGGCA TTTGTATAAG 900 

ACAAACTATC TCACACATCG ACAAACCAAT ACAATCATAG ATGTCCAAAT AAGCACACCA 960 

CGCCCAGTCA AATTACTTAG AGGCCATACT CTTGTCCTCA ATTGTACTGC TACCACTCCC 1020 

TTGAACACGA GAGTTCAAAT GACCTGGAGT TACCCTGATG AAAAAAATAA GAGAGCTTCC 1080 

GTAAGGCGAC GAATTGACCA AAG C AATTCC CATGCCAACA TATTCTACAG TGTTCTTACT 1140 

ATTGACAAAA TGCAGAACAA AGACAAAGGA CTTTATACTT GTCGTGTAAG GAGTGGACCA 1200 

TCATTCAAAT CTGTTAACAC CTCAGTGCAT ATATATGATA AAGCATTCAT CACTGTGAAA 1260 

CATCGAAAAC AGCAGGTGCT TGAAACCGTA GCTGGCAAGC GGTCTTACCG GCTCTCTATG 1320 

AAAGTGAAGG CATTTCCCTC GCCGGAAGTT GTATGGTTAA AAGATGGGTT ACCTGCGACT 1380 

GAGAAATCTG CTCGCTATTT GACTCGTGGC TACTCGTTAA TTATCAAGGA CGTAACTGAA 1440 

GAGGATGCAG GGAATTATAC AATCTTGCTG AGCATAAAAC AGTCAAATGT GTTTAAAAAC 15 00 

CTCACTGCCA CTCTAATTGT CAATGTGAAA CCCCAGATTT ACGAAAAGGC CGTGTCATCG 1560 

TTTCCAGACC CGGCTCTCTA CCCACTGGGC AGCAGACAAA TCCTGACTTG TACCGCATAT 1620 

GGTATCCCTC AACCTACAAT CAAGTGGTTC TGGCACCCCT GTAACCATAA TCATTCCGAA 1680 

GCAAGGTGTG ACTTTTGTTC CAATAATGAA GAGTCCTTTA TCCTGGATGC TGACAGCAAC 1740 

ATGGGAAACA GAATTGAGAG CATCACTCAG CGCATGGCAA TAATAGAAGG AAAGAATAAG 1800 

ATGGCTAGCA CCTTGGTTGT GGCTGACTCT AGAATTTCTG GAATCTACAT TTGCATAGCT 1860 

TCCAATAAAG TTGGGACTGT GGGAAGAAAC ATAAGCTTTT ATATCACAGA TGTGCCAAAT 1920 

GGGTTTCATG TTAACTTGGA AAAAATGCCG ACGGAAGGAG AGGACCTGAA ACTGTCTTGC 1980 

ACAGTTAACA AGTTCTTATA CAGAGACGTT ACTTGGATTT TACTGCGGAC AGTTAATAAC 204 0 

AGAACAATGC ACTACAGTAT TAGCAAGCAA AAAATGGCCA TCACTAAGGA GCACTCCATC 2100 

ACTCTTAATC TTACCATCAT GAATGTTTCC CTGCAAGATT CAGGCACCTA TGCCTGCAGA 2160 

GCCAGGAATG TATACACAGG GGAAGAAATC CTCCAGAAGA AAGAAATTAC AATCAGAGAT 2220 

CAGGAAGCAC CATACCTCCT GCGAAACCTC AGTGATCACA CAGTGGCCAT CAGCAGTTCC 2280 

ACCACTTTAG ACTGTCATGC TAATGGTGTC CCCGAGCCTC AGATCACTTG GTTTAAAAAC 234 0 

AACCACAAAA TACAACAAGA GCCTGGAATT ATTTTAGGAC CAGGAAGCAG CACGCTGTTT 24 00 

ATTGAAAGAG TCACAGAAGA GGATGAAGGT GTCTATCACT GCAAAGCCAC CAACCAGAAG 2460 

GGCTCTGTGG AAAGTTCAGC ATACCTCACT GTTCAAGGAA CCTCGGACAA GTCTAAT CTG 2520 

GAGCTGATCA CTCTAACATG CACCTGTGTG GCTGCGACTC TCTTCTGGCT CCTATTAACC 25 80 

CTCCTTATCC GAAAAATGAA AAGGTCTTCT TCTGAAATAA AGACTGACTA CCTATCAATT 2640 

ATAATGGACC CAGATGAAGT TCCTTTGGAT GAGCAGTGTG AGCGGCTCCC TTATGATGCC 2 700 

AGCAAGTGGG AGTTTGCCCG GGAGAGACTT AAACTGGGCA AATCACTTGG AAGAGGGGCT 2760 

TTTGGAAAAG TGGTTCAAGC ATCAGCATTT GGCATTAAGA AATCACCTAC GTGCCGGACT 2820 

GTGGCTGTGA AAATGCTGAA AGAGGGGGCC ACGGCCAGCG AGTACAAAGC TCTGATGACT 2880 

GAGCTAAAAA TCTTGACCCA CATTGGCCAC CATCTGAACG TGGTTAACCT GCTGGGAGCC 2940 

TGCACCAAGC AAGGAGGGCC TCTGATGGTG ATTGTTGAAT ACTGCAAATA TGGAAATCTC 3000 

TCCAACTACC TCAAGAGCAA ACGTGACTTA TTTTTTCTCA ACAAGGATGC AGCACTACAC 3060 

ATGGAGCCTA AGAAAGAAAA AATGGAGCCA GGCCTGGAAC AAGGCAAGAA ACCAAGACTA 3120 

GATAGCGTCA CCAGCAGCGA AAGCTTTGCG AGCTCCGGCT TTCAGGAAGA TAAAAGTCTG 3180 

AGTGATGTTG AGGAAGAGGA GGATTCTGAC GGTTTCTACA AGGAGCCCAT CACTATGGAA 324 0 

GATCTGATTT CTTACAGTTT TCAAGTGGCC AGAGGCATGG AGTTCCTGTC TTCCAGAAAG 3300 

TGCATTCATC GGGACCTGGC AGCGAGAAAC ATTCTTTTAT CTGAGAACAA CGTGGTGAAG 3360 

ATTTGTGATT TTGGCCTTGC CCGGGATATT TATAAGAACC CCGATTATGT GAGAAAAGGA 3420 

GATACTCGAC TTCCTCTGAA ATGGATGGCT CCCGAATCTA TCTTTGACAA AATCTACAGC 3480 

ACCAAGAGCG ACGTGTGGTC TTACGGAGTA TTGCTGTGGG AAATCTTCTC CTTAGGTGGG 3540 

TCTCCATACC CAGGAGTACA AATGGATGAG GACTTTTGCA GTCGCCTGAG GGAAGGCATG 3600 

AGGATGAGAG CTCCTGAGTA CTCTACTCCT GAAATCTATC AGATCATGCT GGACTGCTGG 3 660 

CACAGAGACC CAAAAGAAAG GCCAAGATTT GCAGAACTTG TGGAAAAACT AGGTGATTTG 3720 

CTTCAAGCAA ATGTACAACA GGATGGTAAA GACTACATCC CAATCAATGC CATACTGACA 3780 

GGAAATAGTG GGTTTACATA CTCAACTCCT GCCTTCTCTG AGGACTTCTT CAAGGAAAGT 384 0 

ATTTCAGCTC CGAAGTTTAA TTCAGGAAGC TCTGATGATG TCAGATATGT AAATGCTTTC 3900 

AAGTTCATGA GCCTGGAAAG AATCAAAACC TTTGAAGAAC TTTTACCGAA TGCCACCTCC 3960 

ATGTTTGATG ACTACCAGGG CGACAGCAGC ACTCTGTTGG CCTCTCCCAT GCTGAAGCGC 4020 

TTCACCTGGA CTGACAGCAA ACCCAAGGCC TCGCTCAAGA TTGACTTGAG AGTAACCAGT 4 080 

AAAAGTAAGG AGTCGGGGCT GTCTGATGTC AGCAGGCCCA GTTTCTGCCA TTCCAGCTGT 4140 

GGGCACGTCA GCGAAGGCAA GCGCAGGTTC ACCTACGACC ACGCTGAGCT GGAAAGGAAA 4 2 00 

ATCGCGTGCT GCTCCCCGCC CCCAGACTAC AACTCGGTGG TCCTGTACTC CACCCCACCC 4 260 

ATCTAGAGTT TGACACGAAG CCTTATTTCT AG AAG C AC AT GTGTATTTAT ACCCCCAGGA 4320 

AACTAGCTTT TGCCAGTATT ATGCATATAT AAGTTTACAC CTTTATCTTT CCATGGGAGC 43 80 

CAGCTGCTTT TTGTGATTTT TTTAATAGTG CTTTTTTTTT TTGACTAACA AGAATGTAAC 444 0 

TCCAGATAGA GAAATAGTGA CAAGTGAAGA ACACTACTGC TAAATCCTCA TGTTACTCAG 4 5 00 

TGTTAGAGAA ATCCTTCCTA AACCCAATGA CTTCCCTGCT CCAACCCCCG CCACCTCAGG 4560 

GCACGCAGGA CCAGTTTGAT TGAGGAGCTG CACTGATCAC CCAATGCATC ACGTACCCCA 4620 

CTGGGCCAGC CCTGCAGCCC AAAACCCAGG GCAACAAGCC CGTTAGCCCC AGGGGATCAC 4680 
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TGGCTGGCCT GAGCAACATC TCGGGAGTCC TCTAGCAGGC CTAAGACATG TGAGGAGGAA 4740 

AAGGAAAAAA AGCAAAAAGC AAGGGAGAAA AGAGAAACCG GGAGAAGGCA TGAGAAAGAA 4 800 

TTTGAGACGC ACCATGTGGG CACGGAGGGG GACGGGGCTC AGCAATGCCA TTTCAGTGGC 4860 

TTCCCAGCTC TGACCCTTCT ACATTTGAGG GCCCAGCCAG GAGCAGATGG ACAGCGATGA 492 0 

5 GGGGACATTT TCTGGATTCT GGGAGGCAAG AAAAGGACAA ATATCTTTTT TGGAACTAAA 4 980 

GCAAATTTTA GACCTTTACC TATGGAAGTG GTTCTATGTC CATTCTCATT CGTGGCATGT 5040 

TTTGATTTGT AGCACTGAGG GTGGCACTCA ACTCTGAGCC CATACTTTTG GCTCCTCTAG 5100 

TAAGATGCAC TGAAAACTTA GCCAGAGTTA GGTTGTCTCC AGGCCATGAT GGCCTTACAC 5160 

TGAAAATGTC ACATTCTATT TTGGGTATTA ATATATAGTC CAGACACTTA ACTCAATTTC 5220 

10 TTGGTATTAT TCTGTTTTGC ACAGTTAGTT GTGAAAGAAA GCTGAGAAGA ATGAAAATGC 5280 

AGTCCTGAGG AGAGTTTTCT CCATATCAAA ACGAGGGCTG ATGGAGGAAA AAGGTCAATA 5340 

AGGTCAAGGG AAGACCCCGT CTCTATACCA ACCAAACCAA TTCACCAACA CAGTTGGGAC 5400 

CCAAAACACA GGAAGTCAGT CACGTTTCCT TTTCATTTAA TGGGGATTCC ACTATCTCAC 5460 

ACTAATCTGA AAGGATGTGG AAGAGCATTA GCTGGCGCAT ATTAAGCACT TTAAGCTCCT 5520 

15 TGAGTAAAAA GGTGGTATGT AATTTATGCA AGGTATTTCT CCAGTTGGGA CTCAGGATAT 5580 

TAGTTAATGA GCCATCACTA GAAGAAAAGC CCATTTTCAA CTGCTTTGAA ACTTGCCTGG 5640 

GGTCTGAGCA TGATGGGAAT AGGGAGACAG GGTAGGAAAG GGCGCCTACT CTTCAGGGTC 5700 

TAAAGATCAA GTGGGCCTTG GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT 5760 

TAGGGTCTAT GTATTTAGGA TGCGCCTACT CTTCAGGGTC TAAAGATCAA GTGGGCCTTG 582 0 

20 GATCGCTAAG CTGGCTCTGT TTGATGCTAT TTATGCAAGT TAGGGTCTAT GTATTTAGGA 5880 

TGTCTGCACC TTCTGCAGCC AGTCAGAAGC TGGAGAGGCA ACAGTGGATT GCTGCTTCTT 594 0 

GGGGAGAAGA GTATGCTTCC TTTT AT CCAT GTAATTTAAC TGTAGAACCT GAGCTCTAAG 6000 

TAACCGAAGA ATGTATGCCT CTGTTCTTAT GTGCCACATC CTTGTTTAAA GGCTCTCTGT 6060 

ATGAAGAGAT GGGACCGTCA TCAGCACATT CCCTAGTGAG CCTACTGGCT CCTGGCAGCG 6120 

25 GCTTTTGTGG AAGACTCACT AGCCAGAAGA GAGGAGTGGG ACAGTCCTCT CCACCAAGAT 6180 

CTAAATCCAA ACAAAAGCAG GCTAGAGCCA GAAGAGAGGA CAAATCTTTG TTGTTCCTCT 6240 

TCTTTACACA TACGCAAACC ACCTGTGACA GCTGGCAATT TTATAAATCA GGTAACTGGA 6300 

AGGAGGTTAA ACTCAGAAAA AAGAAGACCT CAGTCAATTC TCTACTTTTT TTTTTTTTTT 63 60 

TCCAAATCAG ATAATAGCCC AGCAAATAGT GATAACAAAT AAAACCTTAG CTGTTCATGT 6420 

30 CTTGATTTCA ATAATTAATT CTTAATCATT AAGAGACCAT AATAAATACT CCTTTTCAAG 64 8 0 

AGAAAAGCAA AACCATTAGA ATTGTTACTC AGCTCCTTCA AACTCAGGTT TGTAGCATAC 654 0 

ATGAGTCCAT CCATCAGTCA AAGAATGGTT CCATCTGGAG TCTTAATGTA GAAAGAAAAA 6600 

TGGAGACTTG TAATAATGAG CTAGTTACAA AGTGCTTGTT CATTAAAATA GCACTGAAAA 6660 

TTGAAACATG AATTAACTGA TAATATTCCA ATCATTTGCC ATTTATGACA AAAATGGTTG 6720 

35 GCACTAACAA AGAACGAGCA CTTCCTTTCA GAGTTTCTGA GATAATGTAC GTGGAACAGT 6780 

CTGGGTGGAA TGGGGCTGAA ACCATGTGCA AGTCTGTGTC TTGTCAGTCC AAGAAGTGAC 6840 . 

ACCGAGATGT TAATTTTAGG GACCCGTGCC TTGTTTCCTA GCCCACAAGA ATGCAAACAT 6900 

CAAACAGATA CTCGCTAGCC TCATTTAAAT TGATTAAAGG AGGAGTGCAT CTTTGGCCGA 6960 

CAGTGGTGTA ACTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGTGTGTG TGTGGGTGTG 7020 

40 GGTGTATGTG TGTTTTGTGC ATAACTATTT AAGGAAACTG GAATTTTAAA GTTACTTTTA 70 80 

TACAAACCAA GAATATATGC TACAGATATA AGACAGACAT GGTTTGGTCC TATATTTCTA 7140 

GTCATGATGA ATGTATTTTG TATACCATCT TCATATAATA TACTTAAAAA TATTTCTTAA 7200 

TTGGGATTTG TAATCGTACC AACTTAATTG ATAAACTTGG CAACTGCTTT TATGTTCTGT 7260 

CTCCTTCCAT AAATTTTTCA AAATACTAAT TCAACAAAGA AAAAGCTCTT TTTTTTCCTA 7320 

45 AAATAAACTC AAATTTATCC TTGTTTAGAG CAGAGAAAAA TTAAGAAAAA CTTTGAAATG 73 80 

GTCTCAAAAA ATTGCTAAAT ATTTTCAATG GAAAACTAAA TGTTAGTTTA GCTGATTGTA 744 0 

TGGGGTTTTC GAACCTTTCA CTTTTTGTTT GTTTTACCTA TTTCACAACT GTGTAAATTG 7500 

CCAATAATTC CTGTCCATGA AAATGCAAAT TATCCAGTGT AGATATATTT GACCATCACC 7560 

CTATGGATAT TGGCTAGTTT TGCCTTTATT AAGCAAATTC ATTTCAGCCT GAATGTCTGC 762 0 

50 CTATATATTC TCTGCTCTTT GTATTCTCCT TTGAACCCGT TAAAACATCC TGTGGCACTC 

Seq ID NO: 215 Protein sequence; 
Protein Accession #: NP_002010.1 

55 

1 11 21 31 41 51 

I I I I I 1 

MVSYWDTGVL LCAIjIiSCLLL TGSSSGSKLK DPELSLKGTQ HIMQAGQTLH LQCRGEAAHK 60 

WSLPEMVSKE SERLSITKSA CGRNGKQFCS TLTLNTAQAN HTGFYSCKYL AVPTSKKKET 120 

60 ESAIYIFISD TGRPFVEMYS EIPEIIHMTE GRELVIPCRV TSPNITVTLK KFPLDTLIPD 180 

GKRIIWDSRK GFIISNATYK EIGLLTCEAT VNGHLYKTNY LTHRQTNTII DVQISTPRPV 24 0 

KLLRGHTLVL NCTATTPLNT RVQMTWSYPD EKNKRASVRR RIDQSNSHAN IFYSVLTIDK 3 00 

MQNKDKGLYT CRVRSGPSFK SVNTSVHIYD KAFITVKHRK QQVLETVAGK RSYRLSMKVK 360 

AFPSPEWWL KDGLPATEKS ARYLTRGYSL IIKDVTEEDA GNYTILIjSIK QSNVFKNLTA 4 20 

65 TLIVNVKPQI YEKAVSSFPD PALYPLGSRQ ILTCTAYGIP QPTIKWFWHP CNHNHSEARC 4 80 

DFCSNNEESF ILDADSNMGN RIESITQRMA IIEGKNKMAS TLWADSRIS GIYICIASNK 540 

VGTVGRNISF YITDVPNGFH VNLEKMPTEG EDLKL»SCTVN KFLYRDVTWI LLRTVNNRTM 600 

HYSISKQKMA ITKEHSITLN LTIMNVSLQD SGTYACRARN VYTGEEILQK KEITIRDQEA 660 

PYLLRNLSDH TVAISSSTTL DCHANGVPEP QITWFKNNHK IQQEFGIILG PGSSTLFIER 720 

70 VTEEDEGVYH CKATNQKGSV ESSAYLTVOG TSDKSNLELI TLTCTCVAAT LFWLLLTLLI 780 

RKMKRSSSEI KTDYLSIIMD PDEVPLDEQC ERLPYDASKW EFARERLKLG KSLGRGAFGK 840 

WQASAFGIK KSPTCRTVAV KMLKEGATAS EYKALMTELK ILTHIGHHLN WNLLGACTK 900 

QGGPLMVIVE YCKYGNLSNY LKSKRDLFFL NKDAALHMEP KKEKMEPGLE QGKKPRLDSV 960 

TSSESFASSG FQEDKSLSDV EEEEDSDGFY KEPITMEDLI SYSFQVARGM EFLSSRKCIH 1020 

75 RDLAARNILL SENNWKICD FGLARDIYKN PDYVRKGDTR LPLKWMAPES IFDKIYSTKS 1080 

DVWSYGVLLW EIFSLGGSPY PGVQMDEDFC SRLREGMRMR APEYSTPEIY QIMLDCWHRD 114 0 



281 
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PKERPRFAEL VEKLGDLLQA NVQQDGKDYI PINAILTGNS GFTYSTPAFS EDFFKESISA 1200 

PKFNSGSSDD VRYVNAFKFM SLERIKTFEE LLPNATSMFD DYQGDSSTLL ASPMLKRFTW 1260 

TDSKPKASLK IDLRVTSKSK ESGLSDVSRP SFCHSSCGHV SEGKRRFTYD HAELERKIAC 1320 
CS PPPDYNSV VLYSTPPI 



Seg ID NO: 216 DNA sequence 

Nucleic Acid Accession #: NM_024689 

Coding sequence: 76-624 (underlined sequences correspond to start and stop codons) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



CTCTTTGGCC 
AGCCCAGAGC 
TGGCTGGCCC 
TCCCTTTCTT 
GGTCTTGATA 
GAAATAAGAT 
TCTTATCCTG 
CTGGTCAGCA 
CCAAAGACCT 
CAGGCCAAGC 
AAGTTCCTGT 
AGAACTTCCA 
AGTAATCATT 
AGAATTCACC 
GACAACTCCA 
TGAGCCCACC 
CTGGACAGGT 
TTACCAAAGG 
CCCTAGTGAT 
CTTTTACTTT 
TTTATGTTTC 
TTTTTTGAGC 
GGGGAAGCTA 
TAGATTTCGG 
GTGTATCAGG 
AGATTTATTT 
TGGTAGGCCT 
ATTCCTTGTT 
GCCCACCCAG 
TCACATCTAT 
GAATACAGCC 
TTTTATCGAC 
AGCATCACCC 
CAATAGAGAC 
AGGATGAGGA 
TTTTTATTTT 
AAGTGATTTA 
ATGAAACCTT 
TAATTGTGGG 
TGTAAGTGGG 
GTCTCTTAAG 
TAGCAGTTAT 



11 
I 

AAGCCCTGCC 
CCAAG ATG GA 
TGCTGCTGTG 
CTCTGGTGCC 
AATGCAATGC 
CTGACAACTG 
CAAATTACTC 
AATATCAAAA 
GCAGCATTGA 
GCCTCACGCC 
GTATGCTGAG 
GAAAGTGTTA 
AAAGAGGCTT 
AACACACAGG 
AAGCCCCGGC 
CCAATCCAGA 
CTTCCCTATG 
CCCACATAAC 
GGATGAACTC 
TTAAGTACCT 
CATTCTGGTA 
AAACACTCGG 
CTTCTCCTCT 
TCTTCATTGC 
CAGGGTTCTA 
CAAAGAATTG 
GCAATCTGTA 
CCTTAAAAAA 
ATTACCTAGA 
GAAATGCCTT 
TAGCCAAGTT 
CGTCTTCAGA 
TGAACCAAAG 
ATTGACTGGT 
AACCAGGCAC 
TCACTGGGAG 
GAAACTCCAA 
ATTTTATTGG 
TTTGCACATG 
ACCTTGGGGA 
CCTGTTCCTG 
CTATTGTTGT 



21 



TCTGTACAGC 
GCCCCAGCTG 
GGTCTCAGCC 
CCAAGTCAGA 
CTGCATCGGG 
GCTGGCTTCC 
AGATGATTCC 
CGAGATCTCA 
GCGTGTCCTG 
GGACCTGGTG 
ATAACACCAG 
GCCTTCTCCC 
CTGCATCAAA 
CCCACCAGCA 
TCTTTCCACC 
TGTGATCCCC 
AGATAGAACC 
TTCTAAATTT 
TCTTATCTCT 
CCATCAGAGT 
AGAACTCTTT 
GGGTATGGAT 
ATTCAGATTT 
TGTCCATTTT 
CCAGAGAAAC 
ATTTACATGA 
AACCTTTGGG 
ATCTGTTTTT 
TAATCTCTTT 
CACAGCAACA 
GACACATAAA 
CTGTTAAGGA 
GCCCCTATCA 
TGGCTGGCTT 
GGGAGAGGGA 
GTGGTAAGTT 
AG C AATTGGT 
AAATG GTTGG 
GCCAGCACAT 
GGAGCTGCCT 
CTACAGTTAT 
GTATTAAACC 



31 
I 

CTCGAGTGGA 
GGGCCTGAGG 
CTGAGCTGTT 
ACCAGCTACA 
ACATCTATTT 
CACCTTGGAC 
AAAATCTGGC 
GACAGGAAAA 
CGGAAAACAG 
CAGGACTGTC 
TGAAAAAGCC 
AACTGTGTTA 
CCTTCACATG 
ACAGGCTACC 
ACACTGTGGT 
CTGTGATCTA 
TGATAAGGAG 
TGGTCTGGTC 
GGCTTCTAGA 
CATGAAATCA 
AAATGAGGAC 
GAAAGCCAAT 
CACTAAAATC 
TGTAATGAAC 
AGAACCAGTA 
TTGTGGGGAT 
CAGGAGCTGA 
GTTCTTAAGG 
TACTTAAAGT 
CCTAGATTAG 
ATTAACCATC 
TTGTGGTAGA 
AGTAACAATA 
CCCAAGGGAT 
GGGGCAACAG 
AGCCCTGTTG 
AATCCCCAAA 
TTTCCCAATT 
GCCAAACAGA 
C CAT CAT AAA 
AGAGGTTGCT 
ATTTCAACAC 



41 
I 

CAGCCAGAGG 
CTGCCGCCCT 
CTTTCTCCTT 
ATTTTGGAAG 
GCAAGAAGTT 
TGCCTCCCGA 
GCCCTGTGGA 
TCTGTGCCTC 
AGAGGTTCCA 
ACCAGGGCCA 
TGGCATGGAG 
TACCAACCAC 
CAGCTCCCAT 
TTTGCACAAT 
CCCCTAGATG 
CTTCTGGCAA 
CTAGGGCAAT 
TGAAGGAAAA 
GGGAAAAAAA 
CCTGTCAAGA 
ACTGCTGATT 
CGCAGGTCAA 
TTCCAAGATG 
GAGTGTTTTT 
GGAGATACAT 
TGGCAAGTCC 
TGCTGTAGTT 
GCTTTGAATG 
AAACTGATTG 
CATTCAATTG 
ACAGCAACAT 
GAACTGTGAC 
TAGCCAAGCA 
AGCACCAGAC 
AGGTCCAGGG 
CCCATGTATG 
ATGGGTGTAT 
CTGTTTGCCA 
AGTAGACAAA 
GGGAGGGGTT 
CAGAACCTTC 
AT 



51 



CTGCAGCTGG 
CCGCCCTGGC 
GCCAGCTTCT 
GACTTTCCTC 
CTTTAAAGAA 
TTCCTTGCTT 
GATCTTTAGA 
TGCATCAGCC 
GAAATGGCTG 
GAGAGAACTA 
CCCAGCACTG 
ATTTTCAAAT 
GCCACCCTCC 
ATTCTCTGAT 
GGGCTGTTGC 
GATTCTCAGT 
TCTGACAACA 
CCTGTTCTCG 
AAGCATACCT 
CTATCTATCT 
GCTGGTGATG 
ATGACTCCTT 
AAAGCAAATC 
CCTTTAGCTA 
ATACATGTCC 
AAAATCCATA 
TGCAGATAGA 
ATTGGATCAG 
TAGGTGCTAA 
AATAACTGGG 
GCCTGCTAAA 
AGCCACTCTC 
AAATTCCAGT 
AAGAAATGCA 
TTTGGTTATC 
CAGATGGGAG 
CTGGTTTGAA 
TTGGCCAATA 
GGTCTCACTC 
AGTAAAAATG 
TCAGCAAATA 



Seq ID NO: 217 Protein sequence ; 
Protein Accession #: NP_078965.1 

1 11 21 31 41 51 

l I l I I I 

MEPQLGPEAA ALRPGWLALL LWVSALSCSF SLPASSLSSL VPQVRTSYNF GRTFLGLDKC 
NACIGTSICK KFFKEEIRSD NWLASHLGLP PDSLLSYPAN YSDDSKIWRP VEIFRLVSKY 
QNEISDRKIC ASASAPKTCS IERVLRKTER FQKWLQAKRL TPDLVQDCHQ GQRELKFLCM 
IiR 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 



60 
120 
180 



Seq ID NO: 218 DNA sequence 

Nucleic Acid Accession #: AF075027.1 

Coding sequence: 3-269 (underlined sequences correspond to start and stop codons) 



70 | 



li 



21 



75 



GATTAATTAA 
TGGGATGGCT 
CTTTCTGTGC 
CTTTCTCCCT 
CCATAGATCT 
GACAAAAGTG 



GTGCTTTAAA 
GTAGCTTGAG 
CAAGGTGCTG 
GGAGTAAGGC 
CATGCATCAT 
GTGGGTTTTC 



CGGTCTTGGT 
TTGAATTTTA 
TGTTACGGGA 
CGAAGACTGG 
TAATGCCATA 
ATTGTCTTCA 



31 
I 

AAATATTCCG 
ACTGTCCTCA 
GAGAGTGACT 
ATTACTACAC 
TGACATTGCC 
CTGATTGTCA 



41 



CGGGAGCTGG 
TTCTGGGTTT 
GGAAAGTAAC 
GCCTAGACGT 
ATTTTCTTTC 
ATGCATTAAT 



51 
I 

GGAGGACCGT 60 

TGTCGCTCTG 120 

AAAGCTGAAT 180 

GACACTACAC 24 0 

TCAGTTCACG 300 

AAAGAAGATG 360 
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10 



TGTGGT 

Seq ID NO: 219 Protein sequence; 
Protein Accession #: AF075 027 

1 11 21 31 41 51 

I I I 1 I I 

ERKWQCHMAL MMHEIYGCSV TSRRWIQSS ALLQGERFSF VTPQSLSPVT QHLGTESRAT $0 
KPRMRTVKIQ LKLQPSQRSS PAPAEYLPRP FKALN 

Seq ID NO: 220 DNA sequence 
Nucleic Acid Accession #: AL133411.8 

Coding sequence: 1-1395 (underlined sequences correspond to start and stop codons) 

15 1 11 21 31 41 51 

I I I I I I 

ATGGGCAAGG ACTTCATGAC TAAAACACTA AAAGCAATGG CAACAAAAGC CAAAATTGAC 60 

AAATGGGATC TAATCAAATT AAAGAGCTTC CGCACAGCAA AAGAAACTAT TATCAGAGTG 120 

AACAGGCAAC CTACAGAATG GGAGAAAAAT TTTGCAATGT ATCCATCTGA CAAAGGGCTG 180 

20 ACATCCAGAA TCTATAAGGA ACTTAAACAA TTTTACAAGA AAAAACCAAA CAACGCCATC 240 

AAAAAGGACA TGGATGAAGC TGGAAACCGT CATTCTCAGA AAACTAACAC AGGAACAGAA 300 

AACCAAACAC CACATGTTCT CACTCATAAG TGGGAGTTGA ACAATGAGAA CACATGGACA 360 

CAGGGAGGGG AACATCACAC ACTGGGGCCT GTCAGAAGCC CCTCTGGCCT CCTGGCTGGC 420 

CTTGAACATG CTGGGAGGAA ATTACAATTC. ATCCATGGGC TGTTTACCCT TGAAAATGAA 480 

25 TGGGCCCAGG AACAATCCAT AATACAAAAG AAATATGCAT TATGGATTGG AACCAAGCAG 540 

ATCTGGGTGG CACAAACTCC TGGTGAATCT ATCTCCAGTT CACCAGCATT GCCTAATGTG 600 

CTACCTTTAA ATGAAGATGT TAATAAGCAG GAAGAAAAGA ATGAAGATCA TACTCCCAAT 660 

TATGCTCCTG CTAATGAGAA AAATGGCAAT TATTATAAAG ATATAAAACA ATATGTGTTC 720 

ACAACACAAA ATCCAAATGG CACTGAGTCT GAAATATCTG TGAGAGCCAC AACTGACCTG 780 

30 AATTTTGCTC TAAAAAACGA TAAAACTGTC AATGCAACTA CATATGAAAA ATCCACCATT 84 0 

GAAGAAGAAA CAACTACTAG CGAACCCTCT CATAAAAATA TTCAAAGATC AACCCCAAAC 900 

GTGCCTGCAT TTTGGACAAT GTTAGCTAAA GCTATAAATG GAACAGCAGT GGTCATGGAT 960 

GATAAAGATC AATTATTTCA CCCAATTCCA GAGTCTGATG TGAATGCTAC ACAGGGAGAA 1020 

AATCAGCCAG ATCTAGAGGA TCTGAAGATC AAAATAATGC TGGGAATCTC GTTGATGACC 1080 

35 CTCCTCCTCT TTGTGGTCCT CTTGGCATTC TGTAGTGCTA CACTGTACAA ACTGAGGCAT 1140 

CTGAGTTATA AAAGTTGTGA GAGTCAGTAC TCTGTCAACC CAGAGCTGGC CACGATGTCT 1200 

TACTTTCATC CATCAGAAGG TGTTTCAGAT ACATCCTTTT CCAAGAGTGC AGAGAGCAGC 1260 

ACATTTTTGG GTACCACTTC TTCAGATATG AGAAGATCAG GCACAAGAAC ATCAGAATCT 1320 

AAGATAATGA CGGATATCAT TTCCATAGGC TCAGATAATG AGATG CATGA AAACGATGAG 13 80 

40 TCGGTTACCC GGTGA 



45 



Seq ID NO: 221 Protein sequence 
Protein Accession #: AL133411.8 



1 11 21 31 41 51 

I I I I I I 

MGKDFMTKTIj KAMATKAKID KWDLIKLKSF RTAKETIIRV NRQPTEWEKN FAMYPSDKGL 60 

50 TSRIYKELKQ FYKKKPNNAI KKDMDEAGNR HSQKTNTGTE NQTPHVLTHK WELNNENTWT 120 

QGGEHHTLGP VRSPSGLLAG LEHAGRKLQF IHGLFTLENE WAQEQSIIQK KYALWIGTKQ 180 

IWVAQTPGES ISSSPALPNV LPLNEDVNKQ EEKNEDHTPN YAPANEKNGN YYKDIKQYVF 240 

TTQNPNGTES EISVRATTDL NFALKNDKTV NATTYEKSTI EEETTTSEPS HKNIQRSTPN 300 

VPAFWTMLAK AINGTAWMD DKDQLFHPIP ESDVNATQGE NQPDLEDLKI KIMLGISLMT 360 

55 LLLFWLLAF CSATLYKLRH LSYKSCESQY SVNPELATMS YFHPSEGVSD TSFSKSAESS 420 
TFLGTTSSDM RRSGTRTSES KIMTDIISIG SDNEMHENDE SVTR 

Seq ID NO: 222 DNA sequence 
Nucleic Acid Accession #: AL050295.1 

60 Coding sequence: 237-2073 (underlined sequences correspond to start and stop codons) 

1 11 21 31 41 51 

_ I I I I I I 

65 GAAGGGGACA GAAGGCAGTT CACCTCTGCT CCCGACAGCC TGGGAACCCG CAAGAGCCCC 60 

AGCATTTGAA GTCTGGTCTT GTGAAACCCC ACCCTCCTCT GGCTGTGTGA TTGAATGGGA 120 

TGCCCTCGAG GTACACCTCA CCTGAGAGGG TTTTGGGCAG ATCAGCAGTA AGGTGTTAAA 180 

TTTTAGAAGC CTGAAAACTC CAGAAGAGAA AGGCCAACCA ACTCAAACTT GAAGACATGA 240 

AATCCCCAAG GAGAACCACT TTGTGCCTCA TGTTTATTGT GATTTATTCT TCCAAAGCTG 300 

70 CACTGAACTG GAATTACGAG TCTACTATTC ATCCTTTGAG TCTTCATGAA CATGAACCAG 3 60 

CTGGTGAAGA GGCACTGAGG CAAAAACGAG CCGTTGCCAC AAAAAGTCCT ACGGCTGAAG 420 

AATACACTGT TAATATTGAG ATCAGTTTTG AAAATGCATC CTTCCTGGAT CCTATCAAAG 480 

CCTACTTGAA CAGCCTCAGT TTTCCAATTC ATGGGAATAA CACTGACCAA ATTACTGACA 540 

TTTTGAGCAT AAATGTGACA ACAGTCTGCA GACCTGCTGG AAATGAAATC TGGTGCTCCT 600 

75 GCGAGACAGG T.TATGGGTGG CCTCGGGAAA GGTGTCTTCA CAATCTCATT TGTCAAGAGC 660 

GTGAGGTCTT CCTCCCAGGG CACCATTGCA GTTGCCTTAA AGAACTGCCT CCCAATGGAC 720 
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10 



15 



20 



CTTTTTGCCT 
TTCAAGAAGA 
AAACAGCGTT 
GGTTCAAGTC 
AGTTAATACA 
ACTACAACTC 
TCATCTTTGA 
TGTCTTGGCG 
ACACCGCACT 
CAGGTGATGC 
AGAAGAAAAT 
GCGACAACAA 
TAGAATGGAA 
CTAGCTGCAG 
GAACAACAGT 
ACATAAAAGT 
TTTCTGAGGG 
TTTATTGGAA 
ATCTTGATGG 
CCTATCACTG 
TTCACCCGCT 
GCAGTGGTTC 
TCCATATGGG 



GCTTCAGGAA 
CCTCATGAAC 
CCGGAAGGGT 
TGGAAGTGTG 
TAAAGCCAAT 
CTTTCAAGCA 
AGGGGACACA 
CTATGAAGAA 
TTTCAACAAC 
AGGTGAATAT 
AGATGTTATG 
TCCTGTATCT 
GCAGGAAGGA 
CAGATACACC 
CATCTACACT 
GACATTCATC 
ACAAAACTTT 
CACTTCTGCT 
AGCAGAATCA 
CATATTTAGA 
GCCTCTAAAG 
CCATCACATC 
TTCCTCATCC 



GATGTTACCC 
ACTTCCTCCG 
TACGGAATTT 
GTTGTGACAT 
GAACAAGTTG 
GTTACTATCA 
GTCAGTCTGG 
CAGCAGTTGG 
ATGACTTCGG 
GTTTGCAAAC 
CCCATCCAAA 
TTGAACTGCT 
AAAATAAATA 
CTCAAGGCTG 
TGTGAGTTCA 
TCTGTGGCCA 
TCTATAAAAT 
GG AATTAAAA 
GTACTGACAG 
TATAAGAATT 
CTGAACATCA 
AAGTGCTGCA 
CTTCCTGCTG 



TGAACATGAG 
CCCTCTATAG 
TACCAGGCTT 
ATGAAGTCAA 
TACAGAGCCT 
ATGAAAGCAA 
TGTGTGAAAA 
AAATCCAGAA 
TGTCCAAGCT 
TGATATTAGA 
TTTTGGCAAA 
GCAGTCAGGG 
TTCCAGGAAC 
ATGGAACCCA 
TCAGTGCCTA 
ATCTAACAAT 
GCATCAGTGA 
TATACCAAAG 
TCAAGACCTC 
CATACAGTAT 
TGATTGATCC 
TAGAGGAGGA 
TAAAAAAAAA 



AGTCAGACTA 
GTCCTACAAG 
CAAGGGCGTG 
GACTACACCA 
CAATCAGACC 
TTTCTTTGTC 
GGAAGTTTTG 
CAGCAGCAGA 
CACCATCCAC 
CATTTTTGAA 
TGAAGAAATG 
TAATGTTAAT 
CCCTGAGACA 
GTGCCCAAGC 
TGGAGCCAGA 
AACCCCGGAC 
TGTGAGTAAC 
ATTTTATACC 
GACCAGGGAG 
TGCAACCAAA 
TTTGGAAGCT 
TGGAGACTAC 
AAAAAAAAAA 



AATGTAGGCT 780 

ACCGACTTGG 84 0 

ACTGTGACAG 900 

CCATCACTTG 960 

TACAAAATGG 1020 

ACACCAGAAA 1080 

TCCTCCAATG 1140 

TTCTCGATTT 1200 

AACATCACTC 1260 

TATGAGTGCA 1320 

AAGGTGATGT 1380 

TGGAGCAAAG 144 0 

GACATAGATT 1500 

GGGTCGTCTG 1560 

GGCAGTGCAA 1620 

CCAATTTCTG 1680 

TATGATGAGG 1740 

ACGAGGAGGT 1800 

TGGAATGGAA I860 

GACGTCATTG 1920 

ACTGTTTCAT 1980 

AAAGTTACTT 2040 
A 



25 



Seq ID NO: 223 Protein sequence : 
Protein Accession 8: CAB433 94.1 



30 



35 



40 



45 



50 



55 



60 



65 



70 



I 

MKSPRRTTLC 
EEYTVNIEIS 
SCETGYGWPR 
GFQEDLMNTS 
LELIHKANEQ 
NVSWRYEEQQ 
CKKKIDVMPI 
DSSCSRYTLK 
SVSEGQNFSI 
GTYHCIPRYK 
TFHKGSSSLP 



11 
I 

LMFIVIYSSK 
FENASFLDPI 
ERCLHNLICQ 
SALYRSYKTD 
WQSLNQTYK 
LEIONSSRFS 
QILANEEMKV 
ADGTQCPSGS 
KCISDVSNYD 
NSYSIATKDV 
AVKKKKKK 



21 



AALNWNYEST 
KAYLNSLSFP 
ERDVFLPGHH 
LETAFRKGYG 
MDYNSFQAVT 
IYTALFNNMT 
MCDNNPVSLN 
SGTTVIYTCE 
EVYWNTSAGI 
IVHPLPLKIiN 



31 
I 

IHPLSLHEHE 
IHGNNTDQIT 
CSCLKELPPN 
ILPGFKGVTV 
INESNFFVTP 
SVSKLTIHNI 
CCSQGNVNWS 
FISAYGARGS 
KIYQRFYTTR 
IMIDPLEATV 



41 

I 

PAGEEALRQK 
DILSINVTTV 
GPFCLLQEDV 
TGFKSGSWV 
EIIFEGDTVS 
TPGDAGEYVC 
KVEWKQEGKI 
ANIKVTFISV 
RYLDGAESVL 
SCSGSHHIKC 



51 



RAVATKSPTA 60 

CRPAGNEIWC 120 

TLNMRVRLNV 180 

TYEVKTTPPS 240 

LVCEKEVLSS 300 

KLILDIFEYE 360 

NIPGTPETDI 420 

ANLTITPDPI 480 

TVKTSTREWN 540 

CIEEDGDYKV 600 



Seq ID NO: 224 DNA sequence 

Nucleic Acid Accession #: NM_007268 

Coding sequence: 46-124 5 (underlined sequences correspond to start and stop codons) 



75 



GGTAGCAGGA 
GGCCTGCTAC 
CCAGAGAGTG 
CTGCAAGGCT 
ACCATCTTTC 
CTGCATGTGA 
GATGACCGGA 
GTGAGAGATA 
ACAACTGGCA 
CAGGCTCGGG 
GAACCCATCA 
TCAGGCTCCT 
GTGAAGTTTG 
ACCATGACAT 
GACATGGATG 
GCCATCATCC 
CTCTGTCGGA 
GAGGCCAACG 
GATGAGCCAA 
GAGTACCAGA 
CTGGATTATG 
CCAGGATCTG 
GCTACCTCTC 
TCACTGGCTT 
GCTCTGGGCC 
GGGAAGATGC 
GCATCTTGCC 
TATCCAGGAT 
ACAGGCCAGG 



11 
I 

GGCTGGAAGA 
TCCTGGGGCA 
TAACAGGACC 
ACACCCAAGT 
TACGTGACTC 
GCCACAAGGT 
GCCACTACAC 
AGATTACTGA 
GCGGTTATGG 
GTTCTCCTCC 
AAGTAGCAAC 
ATTTCTGCAC 
TGGTCAAAGA 
ACCCCTTGAA 
GCTACCTTGG 
TCATCATCTC 
AGACATCCCA 
ACTCTGGAGA 
CTTCCCAGAA 
TCATCGCCCA 
AGTTTCTGGC 
CTGACATAAT 
TTCCTGGATA 
TGCCCTGGAA 
CTTCTAGTAT 
CCATAGCACT 
ACCAGAAGAC 
CATTTCTCTT 
GTTCAGTTCT 



21 
I 

AAGGACAGAA 
CCTAACAGTG 
TTGGAAAGGG 
CTTGGTGAAG 
TTCTGGAGAC 
TCCAGGAGAT 
GTGTGAAGTC 
GCTCCGTGTC 
CTTCACGGTG 
CATCAGTTAT 
CCTAAGTACC 
TGCCAAGGGC 
CTCCTCAAAG 
AGCAACATCT 
AGAGACCAGT 
CTTGTGCTGT 
ACAAGAGCAT 
AACCATGAGG 
TCTGGGCAAC 
GATCAATGGC 
CACTGAGGGC 
TGCCTAGTCA 
GCCCAAAGTG 
TTTGCCAGAT 
CTCTGCCGGG 
AGGACTTGGT 
CCGAGGGAGG 
TCTTCAGGGC 
GCTCCTCCAC 



31 



I 



GTAGCTCTGG 
GACACTTATG 
GATGTGAATC 
TGGCTGGTAC 
CATATCCAGC 
GTATCCCTCC 
ACCTGGCAGA 
CAGAAACTCT 
CCCCAGGGAA 
ATTTGGTATA 
TTACTCTTCA 
CAGGTTGGCT 
CTACTCAAGA 
ACAGTGAAGC 
GCTGGGCCAG 
ATGGTGGTTT 
GTCTACGAAG 
GTGGCCATCT 
AACTACTCTG 
AACTACGCCC 
AAAAGTGTCT 
GTCCTTGCCT 
TCCGCCTACC 
GCATCTCAAG 
GGCTTCTGGT 
CATCATGCCT 
CTCAGCTCTG 
CAGACAGCTT 
TATAAGTCTA 



41 
I 

CTGTGATGGG 
GCCGTCCCAT 
TTCCCTGCAC 
AACGTGGCTC 
AGGCAAAGTA 
AATTGAGCAC 
CTCCTGATGG 
CTGTCTCCAA 
TGAGGATTAG 
AGCAACAGAC 
AGCCTGCGGT 
CTGAGCAGCA 
CCAAGACTGA 
AGTCCTGGGA 
GAAAGAGCCT 
TTACCATGGC 
CAGCCAGGGC 
TCGCAAGTGG 
ATGAGCCCTG 
GCCTGCTGGA 
GTTAAAAATG 
TCTGCATGGC 
AACACTGGAG 
TAAGCCAGCT 
ACTCCTCTCT 
ACAGACACTA 
CCAGCTCAGA 
TTAATTGAAA 
ATGTTCTGAC 



51 



GATCTTACTG 
CCTGGAAGTG 
CTATGACCCC 
AGACCCTGTC 
CCAGGGCCGC 
CCTGGAGATG 
CAACCAAGTC 
GCCCACAGTG 
CCTTCAATGC 
TAATAACCAG 
GATAGCCGAC 
CAGCGACATT 
GGCACCTACA 
CTGGACCACT 
GCCTGTCTTT 
CTATATCATG 
ACATGCCAGA 
CTGCTCCAGT 
CATAGGACAG 
CACAGTTCCT 
CCCCATTAGG 
CTTCTTCCCT 
CCGCTGGGAG 
GCTGGATTTG 
AAATACCAGA 
TTCAACTTTG 
GGACCAGCTA 
TTGTTATTTC 
TCTCTCCTGG 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
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TGCTCAATAA ATATCTAATC ATAACAGCAA AAAAAAAAAA AAAAAAA 

Seq ID NO: 225 Protein sequence: 
Protein Accession #: NP_009199.1 

1 11 21 31 41 51 

I I I I I I 

MGHjLGLLLL ghltvdtygr pilevpesvt GPWKGDVNLP CTYDPLQGYT QVLVKWLVQR 60 

GSDPVTIFLR DSSGDHIQQA KYQGRLHVSH KVPGDVSLQL STLEMDDRSH YTCEVTWQTP 120 

DGNQWRDKI TELRVQKLSV SKPTVTTGSG YGFTVPQGMR ISLQCQARGS PPISYIWYKQ 180 

QTNNQEPIKV ATLSTLLFKP AVIADSGSYF CTAKGQVGSE QHSDIVKFW KDSSKLLKTK 240 

TEAPTTMTYP LKATSTVKQS WDWTTDMDGY LGETSAGPGK SIiPVFAIILI ISLCCMWFT 300 

MAYIMLCRKT SQQEHVYEAA RAHAREANDS GETMRVAIFA SGCSSDEPTS QNLGNNYSDE 360 

PCIGQEYQII AQINGNYARL LDTVPLDYEF LATEGKSVC 



Seq ID NO: 226 DNA sequence 

Nucleic Acid Accession #: XM_64321 

Coding sequence; 1-2079 {underlined sequences correspond to start and stop codons) 



1 
I 

ATGG TCGCCA 
CCAGGACCAC 
GAGCCTGGTA 
GGGGTTGGGA 
ATCCGTCAGG 
CCATACGAAG 
CTCACAAATG 
CATAATCAAT 
TTCCCCTCCA 
CCTGAAGTTA 
TCCCTTTGTG 
GAAAGCAGAA 
CGCAAGTCAC 
GTTCACACCA 
CAGCCAATAC 
GGTGATCTTG 
CGGAGGAGCC 
CTCAAGAGGT 
CCCACATTCA 
TCCCTGCTCC 
GGGGGGCGCC 
AGGCCAGGGA 
TCCTCTTTAG 
TGCTTAGAAG 
GGTCCAGACA 
CAGCACCAGA 
AATGAATATG 
TATTCCCCAA 
GTCTCCCTTT 
GTCGTAGCAG 
CAGAAGCAGC 
CGCAAGTTCC 
CCCCCGGCCG 
TGCTCCTGCT 
TTCATTCCAC 



11 
I 

GTTCCGATCA 
GCCTCCGCTC 
CTTACAGGGA 
TCACGGGAAA 
AGGATGCCTT 
CTACCTTGCA 
GCTACCTGCC 
ATCCTAATGG 
CAGCCACCAC 
AACTAAAAAT 
GAGACCTTTT 
AAGAAAAGAG 
ACAAAATCCC 
TATCAGAAAA 
TATCTTCTGT 
TGTGGTCCAA 
ATCACTGTTC 
CTTTCATGGT 
AGGGAACTGC 
TTGGTCATTG 
GCTGGGTGTG 
AGGAACCTCT 
CCACCTCTCA 
ACTATGCAGG 
GCAGGACTGG 
GATATCTCCT 
ATTGCAAGCT 
CCCACATCCT 
CCAAGTTCCT 
TACGTAGGAG 
CCTGTCCTGC 
ACGTGATGGC 
CGCCCCGCAA 
CCCAGGATGT 
ATCCCTGGGT 



21 

I 

AGACAGAGCC 
TGCCCAGAGG 
GGGTGGTGGA 
CACAGTTCAA 
TGATAACAAA 
GCAAAGCTTT 
ATCAATCAGC 
AAATTCTAAA 
TTCGGTACCA 
AACCAAAACT 
AAATGAAGTA 
GAAAAAACCC 
CAAATTAGAA 
ACCAAGGGAA 
TCCAACAACA 
GGTGACGGTC 
CAGCTGCCTG 
TTCTTCCTTG 
CCAGATGGGC 
GGAAGGAACA 
GCAGCATCAG 
GAGACTCAGT 
GGGCTGCTGG 
GCGCCGCCAT 
AAGCCTTAGA 
GCAAGGCCTC 
AGAGACGAGA 
TCAGTCT.GAA 
CAAACGCAAA 
ATCCAATATG 
CAAGTACACG 
TCAGAAGAGG 
GGTGGCTGAC 
GTATCTGACT 
GCCCTTCGGC 



31 
I 

CCGTATCTTC 
CCAAAAGCAG 
GCCATCGTCC 
CAACCACCTC 
ATTGACATTG 
CAATACTCAC 
ATGTATGAAA 
CAGAAGACCA 
CAAACTGTGA 
ATCCAGAATG 
CAGGCAAGTG 
AAAAAGCATG 
CCAGAGGAAC 
GATCCAGTAC 
GAAGTGTCCA 
ACACCCTGTT 
GAGATCTTGG 
AAGTTCCTCA 
TGGTCACCTA 
GACCAGATGT 
AAGCCTCAGA 
TTCCTACGAT 
TGTTCGCCCC 
TTGACACTCA 
GCTGTCGGCA 
CTAGGTGGGT 
GAAGCGGCGT 
AGTGCCCCTA 
GCAAACAGCC 
CCTGGCACAA 
CCTGCCTGCC 
GGCCTGTCAG 
AGACGCCAGC 
GGAGTTTCTG 
TCCTCCTAG 



41 
I 

CAGGGACACT 
CCCAACAAGA 
TCACGTATGC 
AACTCACTGA 
CTGAAGATGG 
CTACAACAGA 
TTCAAACCAA 
CATTAAATTC 
TTCCAAAGAA 
GCAGGGAATT 
AGCACACGAA 
ACTCATCAAG 
AAAATAGACC 
TAAAAGAGGA 
CTGGTGTTAA 
GGGTGCCCCG 
TGCTGGTGCC 
CCTCCACGGG 
TGGCCTCCAC 
CATCCAGGGG 
TCCGCATCTC 
GTGAAGTGGA 
CAGACCACGT 
GAGCCCAGGA 
AGAGATACTG 
TCTTGGAAGA 
CCTCAACTCC 
ACCACTACTT 
ATTTCCTGCA 
GGGGGTGGGG 
ACGCACAATG 
GAAGATGTAG 
AGCTGCCGGG 
GATTAAAGGC 



51 
I 

AGACAAGATG 60 

GCCCGGCATT 120 

GCTGGGGATC 180 

CTCCGCCAGC 240 

TGGCCAGACA 300 

TCTTCCTCCA 360 

ATACCAGTCG 420 

TAGAAAACCC 4 80 

GAGTGGCTCA 540 

GTTCAAGTCT 600 

GTCAAAGCAT 660 

ATCTGAAGAG 720 

AAATGAGAGG 780 

AGCCCCAGTT 840 

GTTTCAAGTT 900 

CCTGCGAGGA 960 

AGCCCTCAGC 1020 

CAAACAGAAG 108 0 

GACCAATGTC 1140 

CCCGGAATTT 120 0 

CATCTGCCAC 1260 

GAGAAGAATC 1320 

CTGTGAGAAA 1380 

AGCCTTTCTT 144 0 

CAGGAACAGC 1500 

AAGGAATGCC 156 0 

AAGAATCCCG 1620 

TCCCTACCAC 1680 

CCTGTGTGCA 1740 

TGGCCACAAA 1800 

G GAG AC ATT C 1860 

GGGCCAGCAG 1920 

GGCTCCGGGC 198 0 

CAGTCGTGGC 204 0 



Seq ID NO: 227 Protein sequence: 
Protein Accession #: XP_064321.1 

1 11 21 31 41 51 

I I.I I I I 

MVASSDQDRA PYLPGTLDKM PGPRLRSAQR PKAAQQEPGI EPGTYREGGG AIVLTYALGI 60 

GVGITGNTVQ QPPQLTDSAS IRQEDAFDNK IDIAEDGGQT PYEATLQQSF QYSPTTDLPP 120 

LTNGYLPSIS MYEIQTKYQS HNQYPNGNSK QKTTLNSRKP FPSTATTSVP QTVIPKKSGS 180 

PEVKLKITKT IQNGRELFKS SLCGDXjLNEV QASEHTKSKH ESRKEKRKKP KKHDSSRSEE 240 

RKSHKIPKLE PEEQNRPNER VHTISEKPRE DPVLKEEAPV QPILSSVPTT EVSTGVKFQV 300 

GDLVWSKVTV TPCWVPRIjRG RRSHHCSSCL EIIjVLVPALS LKRSFMVSSL KFLTSTGKQK 360 

PTFKGTAQMG WSPMASTTNV SLLLGHWEGT DQMSSRGPEF GGRRWVWQHQ KPQIRISICH 420 

RPGKEPLRLS FLRCEVERRI SSLATSQGCW CSPPDHVCEK CLEDYAGRRH LTLRAQEAFL 4 80 

GPDSRTGSLR AVGKRYCRNS QHQRYLLQGL LGGFLEERNA NEYDCKLETR EAASSTPRIP 540 

YSPTHILQSE SAPNHYFPYH VSLSKFLKRK ANSHFLHLCA WAVRRRSNM PGTRGWGGHK 600 

QKQPCPAKYT PACHAQWETF RKFHVMAQKR GLSGRCRGQQ PPAAPRKVAD RRQQLPGAPG 660 
CSCSQDVYLT GVSGLKASRG FIPHPWVPFG SS 
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Seq ID NO: 228 DNA sequence 

Nucleic Acid Accession #: NM_006033 

Coding sequence: 253-1752 (underlined sequences correspond to start and stop codons) 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



60 



65 



70 



75 



AGCAGCGAGT 
CCGTTGACAC 
CCTTCTCTGC 
GCTGGAAACA 
TGGCGGGGCA 
TTTGCTGCGG 
AAACCCAAAG 
AAGGACCCAG 
TGCAGTTTCA 
ATCTTTGAAA 
AATGTAGTTG 
AATACCAGGG 
GATTTTTCTC 
TATGCAGGCA 
CCCATGTTTG 
GATGTCCTCC 
GGCCACATTG 
GTCTTGGGAT 
GCCGTCCACC 
TGCACTGACT 
AATAG CATTG 
AAAACCCGGG 
AGTTACAAGA 
GGAGATTCCC 
ACCTTCCTGG 
GAGGGGGCCT 
CCCCGCAACC 
CAGCGGAAAC 
GAGCTCTGGT 
GTGGAGCTTC 
TATCCAAGCC 
AGCCTTGACC 
GGAGGGGACT 
TCTGTCCACA 
GCCTCTCGTG 
GAGGAACGCT 
GAGCCTCAGT 
ATCTTGCTCG 
TCTTAGCCAT 
TATTTTTCTG 
ATGTGGTAAT 
TAGACCCTCT 
CTGGGGGAGG 
CTTCTTAGTT 
GTGTTTGATC 
GCCGGTTGCC 
ATGACATATG 
TCTAAGATTT 
CTTTTCTTGT 
ACACAATATC 
GTAGGGGGAG 
TCAGCCACAT 
TTCCTGGAGG 
AATATGCCCC 
ATCCTAAGCC 
TTATGGCTGA 
TGTACTTAGG 
TTAAACTTCT 
GTGTGCTTCT 
TATCCCTTAT 
TTTGGAATGT 
AAAATTCATT 
TATTTTAAAT 
CATTTTTTGT 
TTTTTGGATT 
TCTGTTTTTA 



11 
I 

CCTTGCCTCC 
TCGCTCCCTG 
CTCCAGTCCC 
CCAAGAGGTG 
GGATGAGCAA 
GGAGCCCCGT 
CTACACAGAC 
AGCATGAAGG 
ACATGACAGC 
ACTGGCTGCA 
TGGTTGACTG 
TGGTGGGACA 
TCGGGAATGT 
ACTTCGTGAA 
AAGGGGCCGA 
ACACCTACAC 
ACATCTACCC 
CAATTGCATA 
TCTTTGTTGA 
CCAATCGCTT 
GCTACAATGC 
CAGGCATGCC 
ACATGGGAGA 
AGACTCTGCC 
TCTACACCGA 
CTCAGTCTTG 
CCGGACGGGA 
TGACATTTTG 
TTCGCAAGTG 
CC TGA GGGTG 
CATGGAGGAA 
CTGGAGCACT 
GCGCTGCTAT 
CCTCCAGAGC 
CACACTGGAT 
GGCTCCGAAG 
GAGAAGTCCT 
GGCCCTAGCT 
TCCGTCCTGC 
TTCATTTTTT 
GGACATATTA 
TTCTGTTTGG 
CT AT AG GAT A 
ATTATGTGCC 
ACTAGCAGAA 
AGATATAACT 
TAATACACAT 
AATACAGTGC 
TGAACTTCTT 
TGAGACACTT 
AAGGGCAACT 
GCCTAGACTT 
CAAAGTCTAT 
AGACGTGAGA 
CTTTTATTAA 
GATTCGGGAG 
TTTTCTAAGG 
GAAGACATAA 
CTTGTTTCTG 
CCAAAATGCT 
TTGCATATAC 
GATGTGTCAG 
AGTTGTGTAC 
CTTGTTGGTG 
AGGGTTGCTC 
CTAATGGAAG 



21 



CGGCGGCTCA 
CCACCGCCCG 
CCAGCCCCTG 
GTTTTTGTTT 
CTCCGTTCCT 
ACCTTTTGGT 
TGAGGTCAAA 
ATGCTACCTC 
TAAAACCTTT 
CAAACTCGTG 
GCTCCCCCTG 
CAG CATTG CC 
CCACTTGATC 
AGGAACGGTG 
CATCCACAAG 
GCGTTCCTTC 
CAATGGGGGT 
TGGAACAATC 
CTCTCTGGTG 
CAAAAAGGGG 
CAAGAAAATG 
TTTCAGAGTT 
AATTGAGCCC 
ACTGGAAATA 
GGAGGACTTG 
GTACAACCTG 
GCTGAATATC 
TACAGAAGAC 
TCGGGATGGC 
CCCGGGCAAG 
AGTTACTGCT 
GGGAACAACT 
AGCTCTTGCT 
ACCAAGTCCA 
TGGTTTCTCA 
AGGCCCTGTG 
TCCGACAGGA 
GTTGGGGTTC 
TCCCCAGCTC 
AATTGAGCAA 
CTGAGCCTCT 
ATGGTGTATG 
TAAGCATTAG 
ACCTTCTTAG 
TAGCAAGCAG 
GCTTTGGAGC 
CTGTGTACAC 
TTTTTTTCCT 
GGAAAAGCCA 
ACACTTTTCA 
ATTATTATCC 
ATATACTAGT 
CTCTGAAACT 
CAAACAAGGA 
TGTATAACCA 
GAAGTGTGAC 
ACATTGTTTT 
CCAGTTGAGT 
TGATTGCTTT 
TGGAACCAGA 
ATAATGAGAT 
TTACACCTTA 
ATGAAGCATG 
CTCAAAAAGT 
AACCCATATT 
CTTTGCA 



31 
I 

GGACGAGGGC 
GGCTCCGTGC 
GCCGAGAGAA 
TTTAAAACTT 
CTGCTCTGTT 
CCAGAGGGAC 
CCATCTGTGA 
TCCGTCGGCC 
TTCATCATTC 
TCAGCCCTGC 
GCCCACCAGC 
AGGATGCTCG 
GGCTACAGCC 
GGCCGAATCA 
AGGCTCTCTC 
GGCTTGAGCA 
GACTTCCAGC 
ACAGAGGTGG 
AAT CAG GAGA 
AT CTGTCTGA 
AGGAACAAGA 
TACCATTATC 
ACCTTTTACG 
GTGGAGCGGA 
GGAGACCTCT 
TGGAAGGAGT 
AGGCGCATCC 
CCTGAGAACA 
TGGAGGATGA 
TCTTGCCAGC 
GAGGACCCAC 
GGTCTCCTGT 
GCCTCTCTTG 
GATTTGTGTG 
GTTGCTGGGC 
TAGAAGGCTG 
GCTGACTCAT 
TCATGGGTTG 
ACTCTCTGAA 
ATGTCTATTG 
CCATTTGGAA 
TGTATATGCA 
GGACCCTGAG 
TTATTATGTG 
AGTATCATTC 
AAATCTCTTC 
AGAAACCGGC 
CTTTGAAATA 
CCAATTCTAG 
AAAGATTTGT 
CTATTTTACA 
TAGTGGTGCA 
CCATGAAGAC 
C TTTTTTTTT 
GGAGAACATC 
ACCAAGCAGG 
AATCTGTATC 
CTTATTTCAA 
CTAGCCAAAG 
AGTGTTTCAA 
ATTTTGGGAA 
TCCACATAGC 
GTTTGTGGTA 
TTTGGATTTT 
ATTGGCTGTA 



41 

I 

AGATCTCGTT 
CGCCAAGTTT 
GGGTCTTACC 
CTGTTTCTTG 
TCTGGAGCCT 
GGCTGGAAGA 
GGTTTAACCT 
ACAGCCAGCC 
ACGGATGGAC 
ACACAAGAGA 
TTTACACGGA 
ACTGGCTGCA 
TCGGAGCGCA 
CAGGTTTGGA 
CGGACGATGC 
TTGGTATTCA 
CAGGCTGTGG 
TAAAATGTGA 
AGCCGAGTTT 
GCTGCCGCAA 
GGAACAGCAA 
AGATGAAAAT 
TCACCCTTTA 
TCGAGCAGAA 
TGAAGATCCA 
TTCGCAGCTA 
GGGTGAAGTC 
CCAGCATATC 
AAAACGAAAC 
AAGGCAGCAA 
CCAATGGAAG 
GATGGCTGGG 
AATAGCTCTA 
TAAGCAGCTG 
GAGCCTGTAC 
TCAGCTGCTC 
GTCAGGATGG 
CACTGACCAT 
GCACACATCA 
AACACTTAAA 
CCCAGTGGAG 
TGGGGAAAGG 
GCTTTAAGTG 
CCACCTCCCC 
ATGCTGGGGC 
TGTTTAGAGA 
ACCTGCCAGA 
TTTTACTTTA 
ATCTTGATTT 
GTATG CATTG 
AAACTGAGGC 
GCCAGGGAGA 
TTTTGCAGCC 
TATATAGAGC 
TGTGCCAACG 
AGAGGAAGAA 
GTGCCAAAGT 
GATATGTTCT 
CGAAGCTTGT 
ATTTTAGATT 
TAGGACCCGA 
CTGAGGGTAA 
ACTTATGTGA 
GGAGCATTTC 
CATCCTGGTC 



51 



CTGGGGCAAG 
TCATTTTCCA 
GGCCGGGATT 
GGAGGGGGTG 
CTGCTATTGC 
TAAGCTCCAC 
CCGCACCTCC 
CTTAGAAGAC 
GATGAGCGGT 
GAAAGACGCC 
TGCGGTCAAT 
GGAGAAGGAC 
CGTGGCCGGG 
TCCTGCCGGG 
AGATTTTGTG 
GATGCCTGTG 
ACTCAACGAT 
GCATGAGCGA 
TGCCTTCCAG 
GAACCGTTGT 
AATGTACCTA 
CCATGTCTTC 
TGGCACTAAT 
TGCCACCAAC 
GCTCACCTGG 
CCTGTCTCAA 
TGGGGAAACC 
CCCAGGCCGG 
CAGTCCCACT 
GACTTCCTGC 
GATTCTTCTC 
ACTCCTCGCG 
ACTCCAAACC 
GGTGCCTGGG 
TCTGCCTGAC 
AGCCTGCTTT 
CAGGCCTGGT 
ACTGCTTACG 
TTGGCTTTCC 
ATTAATTAGA 
TTGGGATTTC 
CACCTGGGGC 
GTTTCTATTT 
TATGAGTGAC 
CAGAATGATG 
GATAGAAGTT 
CAGAGCTGGT 
ATACCAGTGC 
GAATTAATAC 
CCTAATTAGA 
TTAGTGAGGT 
GGACTCAGAT 
AGTTCCCACC 
CATCCATAAA 
GTTGGACTTT 
TGATTTTCTT 
TGTATCACTG 
CAAGCCAATT 
ACAGGTTGAG 
ATTTTCAGAT 
GCCTAAACAC 
TTTTATACGA 
GGGGTTTTCC 
GGATTTTGGA 
ACTTCTGACT 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 



Seq ID NO: 22 9 Protein sequence: 
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Protein Accession #: NP__006024.1 

1 11 21 31 41 51 

I I I I I I 

MSNSVPLLCP WSLCYCFAAG SPVPFGPEGR LEDKLHKPKA TQTEVKPSVR FNLRTSKDPE 60 

HEGCYIiSVGH SQPLEDCSFN MTAKTFFIIH GWTMSGIFEN WLHKLVSALH TREKDANVW 120 

VDWIiPLAHQIi YTDAVNNTRV VGHSIARMLD WLQEKDDFSL GNVHblGYSL GAHVAGYAGN 180 

PVKGTVGRIT GLDPAGPMFE GADIHKRLSP DDADFVDVLH TYTRSFGLSI GIQMPVGHID 24 0 

IYPNGGDFQP GCGLNDVLGS IAYGTITEW KCEHERAVHL FVDSLVNQDK PSFAPQCTDS 300 

NRFKKG1CLS CRKNRCNSIG YNAKKMRNKR NSKMYLKTRA GMPFRVYHYQ MKIHVFSYKN 360 

MGEIEPTFYV TLYGTWAOSQ TLPLEIVERI EQNATNTFLV YTEEDLGDLL KIQLTWEGAS 420 

QSWYNLWKEF RSYLSQPRNP GRELNIRRIR VKSGETQRKL TFCTEDPENT SISPGRELWF 480 
RKCRDGWRMK NETSPTVELP 



987 



WO 02/079492 PCT/US02/04915 

It is understood that the examples described above in no way serve to limit the 
true scope of this invention, but rather are presented for illustrative purposes. All 
publications, sequences of accession numbers, and patent applications cited in this 
specification are herein incorporated by reference as if each individual publication or patent 
5 application were specifically and individually indicated to be incorporated by reference. 
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WHAT IS CLAIMED IS: 

1 1 . A method of detecting an angiogenesis-associated transcript in a cell in 

2 a patient, the method comprising contacting a biological sample from the patient with a 

3 polynucleotide that selectively hybridized to a sequence at least 80% identical to a sequence 

4 as shown in Tables 1 -8. 

1 2. The method of claim 1 , wherein the biological sample is a tissue 

2 sample. 

1 3. The method of claim 1, wherein the biological sample comprises 

2 isolated nucleic acids. 

1 4. The method of claim 3, wherein the nucleic acids are mRNA. 

1 5. The method of claim 3, further comprising the step of amplifying 

2 nucleic acids before the step of contacting the biological sample with the polynucleotide. 

1 6. The method of claim 1 , wherein the polynucleotide comprises a 

2 sequence as shown in Tables 1-8 . 

1 7. The method of claim 1 , wherein the polynucleotide is labeled. 

1 8. The method of claim 7, wherein the label is a fluorescent label. 

1 9. The method of claim 1, wherein the polynucleotide is immobilized on 

2 a solid surface. 

1 1 0. The method of claim 1 , wherein the patient is undergoing a therapeutic 

2 regimen to treat a disease associated with angiongenesis. 

1 11. The method of claim 1, wherein the patient is suspected of having 

2 cancer. 

1 12. An isolated nucleic acid molecule consisting of a polynucleotide 

2 sequence as shown in Tables 1-8. 

1 13. The nucleic acid molecule of claim 1 2, which is labeled. 

1 14. The nucleic acid of claim 13, wherein the label is a fluorescent label 
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1 1 5. An expression vector comprising the nucleic acid of claim 12. 

1 1 6. A host cell comprising the expression vector of claim 15. 

1 1 7. An isolated polypeptide which is encoded by a nucleic acid molecule 

2 having polynucleotide sequence as shown in Tables 1 -8 

1 1 8. An antibody that specifically binds a polypeptide of claim 17. 

.1 19. The antibody of claim 18, further conjugated or fused to an effector 

2 component. 

1 20. The antibody of claim 19, wherein the effector component is a 

2 fluorescent label. 

1 21 . The antibody of claim 19, wherein the effector component is a 

2 radioisotope. 

1 22. The antibody of claim 19, which is an antibody fragment. 

1 23. The antibody of claim 19, which is a humanized antibody 

r 

1 24. A method of detecting a cell undergoing angiogenesis in a biological 

2 sample from a patient, the method comprising contacting the biological sample with an 

3 antibody of claim 18. 

1 25. The method of claim 24, wherein the antibody is further conjugated or 

2 fused to an effector component, 

1 26. The method of claim 25, wherein the effector component is a 

2 fluorescent label. 

1 27. The method of detecting antibodies specific to angiogenesis in a 

2 patient, the method comprising contacting a biological sample from the patient with a 

3 polypeptide which is encoded by a nucleotide sequence of Tables 1-8. 
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